From Texts to Scores: Tracing the Emergence of Essay Quality Representations in Large Language Models
Summary
A study systematically analyzed the hidden representations of eight Large Language Models (LLMs) across two English essay datasets (ASAP++, CSEE) and one Portuguese dataset (ENEM) to understand their internal mechanisms for Automated Essay Scoring (AES). Researchers found consistent evidence that essay quality information is encoded in a linearly accessible form within LLM representations. These representations emerge progressively across layers, remain robust across prompting strategies, and partially transfer across essay prompts despite differing scoring rubrics. The analysis, using linear probing, cross-prompt generalization, dimensionality reduction, and neuron-level analyses, also revealed that nonlinear probes offer only marginal improvements, indicating high linear decodability. Furthermore, specific "essay scoring neurons" were identified, whose activations strongly correlate with essay scores and whose layer-wise distribution shifts with essay length, with deeper layers being more critical for longer essays.
Key takeaway
For Machine Learning Engineers developing Automated Essay Scoring (AES) systems, understanding that LLMs encode essay quality linearly and progressively across layers is crucial. You should focus on linear probing techniques for efficient and interpretable scoring, as nonlinear methods offer minimal gains. Furthermore, consider analyzing "essay scoring neurons" and adapting your model's layer utilization based on essay length to optimize performance and gain deeper insights into how your LLM assesses writing quality.
Key insights
LLMs encode essay quality information in linearly accessible, progressively emerging representations, with specific "scoring neurons" adapting to essay length.
Principles
- Essay quality is linearly decodable from LLM representations.
- Quality representations emerge progressively across layers.
- Specific "essay scoring neurons" exist.
Method
The study used linear probing, cross-prompt generalization, dimensionality reduction, and neuron-level analyses to trace essay quality representations in LLMs across English and Portuguese datasets.
In practice
- Identify "essay scoring neurons" for AES interpretability.
- Utilize linear probes for efficient essay scoring.
- Consider deeper layers for longer essay analysis.
Topics
- Large Language Models
- Automated Essay Scoring
- LLM Interpretability
- Hidden Representations
- Linear Probing
- Essay Quality
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.