Beyond Sentiment Classification: A Generative Framework for Emotion Intensity Evaluation in Text
Summary
A new generative framework for emotion intensity evaluation in text has been introduced, shifting from discrete emotion classification to continuous scoring. Researchers constructed a dataset of 1,177 conversational phrases, human-annotated with 0-100 intensity scores for eight emotions, plus valence (-100 to 100) and arousal (0 to 100). Open-weight generative language models, specifically Mistral-7B and Mistral-24B, were fine-tuned using LoRA to output these continuous values in a JSON format. This approach significantly outperforms classification baselines and zero-shot pretrained LLMs, demonstrating strong generalization capabilities to unseen emotions and transfer effects to valence and arousal. The framework aims to better align with applied domains like finance, where the degree of emotional content is crucial for interpretation and decision-making.
Key takeaway
For NLP engineers and research scientists developing affective computing systems, consider adopting generative models for emotion intensity evaluation instead of traditional classification. This approach provides more granular, interpretable emotional profiles, crucial for applications in finance or behavioral science. Your models will likely generalize better to new emotions and abstract affective dimensions like valence and arousal, enhancing their utility and adaptability in dynamic real-world scenarios.
Key insights
Generative LLMs fine-tuned on continuous emotion intensity scores outperform classification and generalize to unseen affective dimensions.
Principles
- Emotion intensity evaluation is more expressive than discrete classification.
- Generative LLMs can infer latent affective structure.
- Model scale enhances generalization to new affective constructs.
Method
Fine-tune open-weight generative LLMs (Mistral-7B, Mistral-24B) via LoRA on a human-annotated dataset of 0-100 emotion intensity, valence, and arousal scores, outputting results in JSON.
In practice
- Use fine-tuned generative LLMs for nuanced emotion analysis.
- Apply 4-bit quantization for efficient fine-tuning on GPUs.
- Evaluate models using Concordance Correlation Coefficient (CCC).
Topics
- Generative Language Models
- Emotion Intensity Evaluation
- Affective Computing
- Valence-Arousal Model
- LoRA Fine-tuning
Best for: AI Engineer, Machine Learning Engineer, AI Product Manager, AI Scientist, NLP Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.