Feedback-to-Rubrics: Can We Learn Expert Criteria from Inline Comments?
Summary
A novel problem setting is proposed for learning reusable natural-language rubrics from accumulated inline comments on artifacts like human-written or LLM-generated drafts. This method addresses the challenge of eliciting tacit, undocumented, and context-dependent criteria, such as expert preferences or organization-specific conventions, which are crucial for effective LLM-based writing and review support. The approach infers rubrics from these comments and iteratively refines them by identifying comment-wise mismatches between rubric-conditioned predictions and reference comments. Evaluated in both real-world review settings and controlled environments with reference rubrics, the results demonstrate that inline comments can be effectively distilled into reusable rubrics. These learned rubrics support comment prediction, enhance rubric understanding, and facilitate automatic artifact revision.
Key takeaway
For NLP Engineers developing LLM-powered writing or review systems, this method offers a direct path to operationalize tacit expert criteria. You can distill organizational conventions and expert preferences from existing inline comments, significantly enhancing your model's contextual relevance. Implement this approach to generate reusable rubrics, improving comment prediction, fostering rubric understanding, and enabling more effective automatic artifact revision within your applications.
Key insights
Inline comments can be distilled into reusable natural-language rubrics, enhancing LLM-driven writing and review support.
Principles
- Expert criteria are often tacit and hard to elicit.
- Rubrics can be inferred from inline comments.
- Iterative refinement improves rubric accuracy.
Method
Rubrics are inferred from inline comments, then iteratively refined by observing comment-wise mismatches between rubric-conditioned predictions and reference comments.
In practice
- Predict comments based on learned rubrics.
- Improve understanding of review criteria.
- Automate artifact revision processes.
Topics
- Large Language Models
- Natural Language Processing
- Rubric Learning
- Inline Comments
- Automated Review
- Artifact Revision
Best for: Research Scientist, AI Scientist, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.