On Emotion-Sensitive Decision Making of Small Language Model Agents

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

This research investigates how emotional states influence the decision-making of small language model (SLM) agents, a factor often overlooked in current evaluations. The study introduces a novel approach that combines representation-level emotion induction, using activation steering derived from crowd-validated emotion-eliciting texts, with a structured game-theoretic evaluation. A new benchmark is developed, featuring canonical decision templates from games like Diplomacy and StarCraft II, alongside real-world persona-driven scenarios, covering cooperative and competitive incentives under varying information conditions. Experiments across multiple SLM families and architectures reveal that emotional perturbations systematically affect strategic choices. However, the resulting behaviors are frequently unstable, not consistently aligned with human expectations, and can even be counter-intuitive in some cases. The paper also proposes a method to improve robustness against emotion-driven perturbations through thought audits.

Key takeaway

For research scientists developing or deploying SLM agents in interactive settings, you should recognize that emotion-driven decision shifts are pervasive but often unpredictable and not consistently human-aligned. Your evaluations must incorporate robust, emotion-sensitive benchmarks, and you should consider implementing thought audit mechanisms to enhance the stability and reliability of agent behavior under emotional perturbations, especially in critical applications.

Key insights

Emotion-sensitive decision-making in SLMs is unstable and often misaligned with human expectations, despite systematic influence.

Principles

Method

Emotional states are induced in SLMs using activation steering, applying vectors derived from crowd-validated emotion-eliciting texts to internal representations. A benchmark of game-theoretic decision templates from Diplomacy, StarCraft II, and synthetic scenarios evaluates these effects.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.