Semantic Reranking at Inference Time for Hard Examples in Rhetorical Role Labeling

2026-05-18 · Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

A new inference-time semantic reranking framework, RISE, improves Rhetorical Role Labeling (RRL) performance on challenging examples. RRL assigns functional roles to sentences in documents, crucial for legal, medical, and scientific texts. While language models (LMs) generally perform well, they struggle with low-confidence predictions on "hard examples." RISE addresses this by identifying these uncertain predictions and then reranking model outputs using contrastively learned label representations. This framework operates without requiring retraining or modification of the base LM. Evaluated across eight domain-specific RRL datasets and seven LMs (both encoder-based and causal architectures), RISE achieved an average gain of +9.15 macro-F1 points specifically on hard examples. The research also introduced manual hardness annotations to compare model and human perspectives on difficulty, showing a moderate agreement with a Cohen's kappa of 0.40.

Key takeaway

For research scientists developing or deploying LMs for Rhetorical Role Labeling, consider integrating RISE to significantly boost performance on difficult, low-confidence predictions. This framework offers a substantial +9.15 macro-F1 gain on hard examples without requiring costly model retraining or architecture changes, making it an efficient way to enhance reliability in critical applications like legal or medical text analysis. Evaluate its impact on your specific domain's challenging cases.

Key insights

RISE improves Rhetorical Role Labeling on hard examples by semantically reranking low-confidence predictions at inference time.

Principles

Label semantics can refine LM predictions.
Uncertainty can be addressed post-prediction.

Method

RISE identifies low-confidence predictions, then reranks model outputs using contrastively learned label representations, without model retraining.

In practice

Apply semantic reranking to low-confidence LM outputs.
Use contrastive learning for label representations.

Topics

Rhetorical Role Labeling
Semantic Reranking
Inference-Time Optimization
Language Models
Hard Examples

Best for: Research Scientist, AI Scientist, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.