Staying with the Uncertainty: Uncertainty-Scaffolding Strategies for Artificial Moral Advisors in LLM-to-LLM Simulated Conversations

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Social Sciences & Behavioral Studies · Depth: Expert, extended

Summary

A study investigated "uncertainty-scaffolding" strategies for Large Language Model (LLM) Artificial Moral Advisors (AMAs) in simulated ethical dialogues. Researchers proposed three strategies—Perspective-Multiplying, Tension-Preserving, and Process-Reflecting—and compared them against Baseline, Persuasive, and Sycophantic controls. Using a multi-agent framework, 6,400 conversations were generated with gpt-oss-120B as both user and AMA, and gpt-5-mini for specific comparisons. Findings indicate open-source LLMs like gpt-oss-120b (AUROC 0.67) simulate moral ambiguity through diverse stances across personas, while proprietary models like gpt-5-mini (AUROC 0.65) use individual hedging. Declarative persona prompts yielded greater initial stance diversity, whereas narrative prompts led to more realistic belief revisions post-conversation. All six AMA strategies produced distinct conversational patterns (overall F1 0.887). Uncertainty strategies improved engagement quality: Process-Reflecting fostered genuine stance shifts (18% revision rate, 3.70 helpfulness), Perspective-Multiplying clarified weaker arguments (0.24 weaker position support growth), and Tension-Preserving increased empathy (0.71 weaker position relatability). Control strategies generally reinforced initial user beliefs.

Key takeaway

For AI Ethicists designing conversational agents, you should integrate uncertainty-scaffolding strategies to cultivate richer ethical engagement. If your goal is genuine stance revision, implement Process-Reflecting. To broaden perspectives, use Perspective-Multiplying. For increased empathy towards opposing views, apply Tension-Preserving. Avoid default or purely persuasive approaches, as they often reinforce existing beliefs, limiting productive deliberation and user satisfaction.

Key insights

Effective AI moral advisors can foster productive ethical deliberation by strategically expressing uncertainty, not just providing answers.

Principles

Method

A multi-agent simulation framework compares AMA strategies by having a user-agent LLM engage in ethical dilemmas, completing pre/post-conversation questionnaires to measure stance, certainty, relatability, and reasoning clarity shifts.

In practice

Topics

Best for: Research Scientist, AI Scientist, AI Ethicist, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.