Feedback-to-Rubrics: Can We Learn Expert Criteria from Inline Comments?

2026-05-28 · Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Expert, quick

Summary

A novel problem setting is proposed for learning reusable natural-language rubrics from accumulated inline comments on artifacts like human-written or LLM-generated drafts. This method addresses the challenge of eliciting tacit, undocumented, and context-dependent criteria, such as expert preferences or organization-specific conventions, which are crucial for effective LLM-based writing and review support. The approach infers rubrics from these comments and iteratively refines them by identifying comment-wise mismatches between rubric-conditioned predictions and reference comments. Evaluated in both real-world review settings and controlled environments with reference rubrics, the results demonstrate that inline comments can be effectively distilled into reusable rubrics. These learned rubrics support comment prediction, enhance rubric understanding, and facilitate automatic artifact revision.

Key takeaway

For NLP Engineers developing LLM-powered writing or review systems, this method offers a direct path to operationalize tacit expert criteria. You can distill organizational conventions and expert preferences from existing inline comments, significantly enhancing your model's contextual relevance. Implement this approach to generate reusable rubrics, improving comment prediction, fostering rubric understanding, and enabling more effective automatic artifact revision within your applications.

Key insights

Inline comments can be distilled into reusable natural-language rubrics, enhancing LLM-driven writing and review support.

Principles

Expert criteria are often tacit and hard to elicit.
Rubrics can be inferred from inline comments.
Iterative refinement improves rubric accuracy.

Method

Rubrics are inferred from inline comments, then iteratively refined by observing comment-wise mismatches between rubric-conditioned predictions and reference comments.

In practice

Predict comments based on learned rubrics.
Improve understanding of review criteria.
Automate artifact revision processes.

Topics

Large Language Models
Natural Language Processing
Rubric Learning
Inline Comments
Automated Review
Artifact Revision

Best for: Research Scientist, AI Scientist, NLP Engineer

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.