Whose Alignment? Comparing LLM Process Alignment Across Diverse Organizational Decision Contexts

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Legal & Regulatory, Finance & Economics · Depth: Expert, extended

Summary

The Contextualized Alignment Lens Model (CALM) framework measures "process alignment" in LLMs, assessing whether a model weights information similarly to an organization's decision policy, rather than just matching outcomes. Applied to ECHR Article 6 decisions, CALM found process alignment strongly predicts output accuracy (r=0.85, p<.001), with explicit knowledge externalization significantly improving alignment for poorly-aligned models (e.g., GPT-5.4-nano improved by Δ=+0.906). However, in German consumer credit decisions, this relationship collapsed (r=0.15, p=.60), interventions were inconsistent, and the benchmark encoded potentially discriminatory historical patterns. This contrast highlights that high process alignment is not always achievable or desirable, especially in contested domains where output agreement alone cannot reveal underlying policy adherence or ethical conflicts.

Key takeaway

For AI scientists and policy makers deploying LLMs in high-stakes decision contexts, you should prioritize process-level evaluation using tools like CALM. Relying solely on output accuracy or demographic parity metrics risks overlooking whether your models are replicating historically discriminatory decision policies or reasoning in normatively inappropriate ways. Use process alignment as an audit tool to surface underlying cue-weighting policies, especially when organizational benchmarks may encode contested values, ensuring decisions are reached legitimately.

Key insights

Process alignment measures how LLMs weight information like organizations, revealing policy adherence beyond mere output accuracy.

Principles

Method

CALM estimates behavioral policies using ridge-regularized logistic regression on observed decisions, then computes cosine similarity between organizational and LLM policy vectors to measure alignment.

In practice

Topics

Best for: Research Scientist, AI Scientist, AI Ethicist, Policy Maker

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.