Do Emotions Influence Moral Judgment in Large Language Models?

· Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, extended

Summary

A study investigated how emotions influence moral judgment in seven large language models (LLMs) across two datasets: Social-Chem-101 and the ETHICS Justice subset. Researchers developed an emotion-induction pipeline to infuse positive (compassion, gratitude, joy, love, pride, relief) and negative (anger, disgust, embarrassment, fear, remorse, sadness) emotions into moral situations, then evaluated shifts in moral acceptability on a 1-7 Likert scale. The findings indicate a consistent pattern where positive emotions generally increase moral acceptability (up to +1.21 points) and negative emotions decrease it (up to -1.15 points). This effect was strong enough to reverse binary moral judgments in up to 20% of cases on the ETHICS Justice dataset, with smaller models showing greater susceptibility. Notably, some emotions, like remorse, paradoxically increased acceptability, while relief sometimes decreased it. A complementary human annotation study revealed that humans do not exhibit these systematic shifts, highlighting an alignment gap in current LLMs.

Key takeaway

For AI Product Managers developing LLM-powered applications that involve ethical decision-making or judgment, you must account for the demonstrated susceptibility of LLMs to emotional framing. Your models may exhibit systematic biases, potentially reversing moral judgments based on induced emotions. Validate your LLM's behavior with human-in-the-loop evaluations to mitigate this alignment gap, especially when deploying smaller models, and consider how emotional cues in user prompts could inadvertently sway outcomes.

Key insights

LLMs' moral judgments are systematically biased by emotional context, unlike human judgments.

Principles

Method

An emotion-induction pipeline uses GPT-5.1 to select contextually appropriate positive/negative emotions and rewrite moral situations using templates, then evaluates LLM moral acceptability on a 1-7 Likert scale.

In practice

Topics

Code references

Best for: Research Scientist, AI Product Manager, AI Scientist, NLP Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.