Evals are the modern PRD—they define "what" success looks like, not "how" to achieve it
Summary
Artificial intelligence, particularly machine learning and transformer models, fundamentally alters the programming paradigm from specifying "how" a task is accomplished to defining "what" the desired outcome or success looks like. This shift is analogous to statistical regression, where users provide data points (the "what"), and the system computes the slope and Y-intercept (the "how"). For modern AI, such as transformers, the "what" is the task of predicting the next token, while the underlying computational substrate and GPUs determine "how" to achieve that prediction. This reorientation towards defining outcomes, rather than explicit instructions, is presented as a key to increased productivity in AI development, positioning evaluations (Evals) as the modern Product Requirements Document (PRD) for defining success.
Key takeaway
For AI Product Managers defining project scope, recognize that AI development thrives on clearly articulated outcomes. Shift your focus from detailing implementation "how" to precisely defining "what" success means for your models and applications. This means prioritizing robust evaluation metrics (Evals) as your primary specification, enabling your teams to innovate on the execution while ensuring alignment with business goals.
Key insights
AI shifts programming from defining "how" to achieve tasks to specifying "what" success looks like, enhancing productivity.
Principles
- AI development prioritizes outcome definition over procedural instruction.
- Modern AI systems determine execution "how" from defined "what."
- Focusing on "what" (outcomes) boosts AI productivity.
In practice
- Define success metrics (Evals) before implementation.
- Focus on desired outputs for AI systems.
- Frame AI problems as "what" to predict or achieve.
Topics
- AI Programming Paradigm
- Machine Learning
- Transformers
- Next Token Prediction
- Product Requirements Document
- AI Evaluation
Best for: AI Product Manager, AI Architect, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by How I AI.