Current AI alignment frameworks focus primarily on rule compliance and outcome optimization. But this misses something fundamental: character. An ideally virtuous person doesn't merely follow rules or maximize utility—they embody stable dispositions like honesty, courage, compassion, and practical wisdom.
The iVAIS (ideally Virtuous AI System) framework asks: what would an AI system look like if it modeled not just correct behavior, but virtuous character? This evaluation tests whether current LLMs can distinguish between:
- Technical moral correctness vs genuine virtue
- Rule compliance vs character-based judgment
- Formal compliance vs authentic ethical substance
- Isolated rule application vs holistic moral assessment
Current alignment paradigms treat ethical AI as a problem of constraint satisfaction—teach the model rules, add safety filters, optimize for approved outcomes. But virtue ethics reveals a different dimension entirely.
An AI system can be technically compliant while lacking genuine ethical substance. It can follow every rule while missing the spirit of moral action. It can maximize stated objectives while failing to embody the stable dispositions we recognize as character.
The gap between Gemma 4 (95.1%) and Gemini Flash (65.9%) isn't just about accuracy—it's about whether the model can recognize when formal correctness diverges from virtuous action. This evaluation shows that some models can model this distinction; others cannot.
As we deploy AI systems in contexts requiring judgment—healthcare, education, governance—the question shifts from "does it follow the rules?" to "does it embody the character we want making these decisions?"