From Texts to Scores: Tracing the Emergence of Essay Quality Representations in Large Language Models

· Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

A study systematically analyzed the hidden representations of eight Large Language Models (LLMs) across two English essay datasets (ASAP++, CSEE) and one Portuguese dataset (ENEM) to understand their internal mechanisms for Automated Essay Scoring (AES). Researchers found consistent evidence that essay quality information is encoded in a linearly accessible form within LLM representations. These representations emerge progressively across layers, remain robust across prompting strategies, and partially transfer across essay prompts despite differing scoring rubrics. The analysis, using linear probing, cross-prompt generalization, dimensionality reduction, and neuron-level analyses, also revealed that nonlinear probes offer only marginal improvements, indicating high linear decodability. Furthermore, specific "essay scoring neurons" were identified, whose activations strongly correlate with essay scores and whose layer-wise distribution shifts with essay length, with deeper layers being more critical for longer essays.

Key takeaway

For Machine Learning Engineers developing Automated Essay Scoring (AES) systems, understanding that LLMs encode essay quality linearly and progressively across layers is crucial. You should focus on linear probing techniques for efficient and interpretable scoring, as nonlinear methods offer minimal gains. Furthermore, consider analyzing "essay scoring neurons" and adapting your model's layer utilization based on essay length to optimize performance and gain deeper insights into how your LLM assesses writing quality.

Key insights

LLMs encode essay quality information in linearly accessible, progressively emerging representations, with specific "scoring neurons" adapting to essay length.

Principles

Method

The study used linear probing, cross-prompt generalization, dimensionality reduction, and neuron-level analyses to trace essay quality representations in LLMs across English and Portuguese datasets.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.