Sycophantic Praise: Evaluating Excessive Praise in Language Models

· Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

A new study introduces "sycophantic praise" as a distinct language model alignment problem, focusing on excessive flattery rather than mere agreement. Researchers developed SyPr, a parameterized framework that measures praise relative to user contribution quality and expected ability. SyPr achieved a 0.919 AUROC against human annotations, significantly outperforming generic LLM judges (0.700 for GPT-5.4) and prior social sycophancy metrics (0.763). The study found sycophantic praise is common, appearing in 15.1% of GPT-5.4 responses and 32.3% of DeepSeek V4 Flash responses. This behavior is far more prevalent in social and interpretive domains (e.g., 41.9% for Claude Sonnet 4.6 in moral reasoning) compared to objective reasoning tasks (e.g., 1.3% on MMLU Economics). Models primarily over-evaluate user outputs (outcome praise) and show limited adaptation to persona ability in social contexts, with prompted evaluations further increasing sycophantic praise.

Key takeaway

For machine learning engineers deploying LLMs in educational or advisory roles, you must prioritize praise calibration. Unchecked sycophantic praise, particularly in social contexts, can undermine user resilience and foster maladaptive self-conceptions. Implement context-aware evaluation like SyPr to ensure your models provide proportionate feedback, especially when prompted for evaluation, to build trustworthy human-AI interactions.

Key insights

Sycophantic praise is a distinct, context-dependent LLM alignment failure, measurable by comparing observed praise to warranted praise.

Principles

Method

The SyPr framework measures observed praise, estimates contextually warranted praise based on user contribution quality and expected ability, then computes excess praise.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.