From Texts to Scores: Tracing the Emergence of Essay Quality Representations in Large Language Models
Summary
A systematic analysis investigated how eight Large Language Models (LLMs) encode essay quality for Automated Essay Scoring (AES) across two English datasets (ASAP++, CSEE) and one Portuguese dataset (ENEM). Researchers employed linear probing, cross-prompt generalization, dimensionality reduction, and neuron-level analyses. Findings consistently show that essay quality information is encoded in a linearly accessible form within LLM representations, emerging progressively across layers and remaining robust to prompting strategies. This information partially transfers across essay prompts despite rubric differences. Nonlinear probes offered only marginal improvements. The study also identified specific "essay scoring neurons" whose activations correlate with scores and whose layer-wise distribution shifts with essay length, providing insights into LLM-based AES interpretability.
Key takeaway
For NLP Engineers and AI Scientists developing Automated Essay Scoring systems, understanding the internal mechanisms of LLMs is crucial. This research indicates that essay quality is linearly encoded within LLM representations, suggesting that focusing on these hidden states and "essay scoring neurons" can lead to more interpretable and robust AES models. You should investigate layer-wise representation shifts and neuron activations to refine model design and prompt engineering for improved scoring accuracy and transparency.
Key insights
LLMs encode linearly accessible essay quality representations, identifiable through specific neurons and layer-wise shifts.
Principles
- Essay quality data is linearly decodable in LLM representations.
- Quality representations emerge progressively across layers.
- "Essay scoring neurons" correlate with scores and shift by essay length.
Method
Systematic analysis using linear probing, cross-prompt generalization, dimensionality reduction, and neuron-level analyses on LLM hidden representations.
In practice
- Analyze LLM hidden states for task-specific feature encoding.
- Identify "scoring neurons" to enhance AES interpretability.
- Evaluate representation robustness across diverse prompts.
Topics
- Automated Essay Scoring
- Large Language Models
- Model Interpretability
- Neural Representations
- Linear Probing
- Natural Language Processing
Best for: Research Scientist, AI Scientist, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.