The Pragmatic Persona: Discovering LLM Persona through Bridging Inference

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computational Linguistics · Depth: Expert, extended

Summary

The PD-Agent framework introduces a novel approach to discovering latent personas in Large Language Models (LLMs) by analyzing "bridging inference," which are implicit conceptual relations connecting dialogue utterances. Unlike methods relying on surface-level lexical cues, PD-Agent models these relations as structured knowledge graphs, capturing deeper discourse coherence. The framework operationalizes seven bridging relation types and employs a three-phase process: interactive interviewing, few-shot exemplar-based relation extraction, and graph-based reasoning using normalized degree centrality. Experiments across six reasoning backbones (including GPT-4o, Claude 3.5 Sonnet, and o1-mini) and target LLMs up to 80B parameters (e.g., Llama-3.1-70B, Qwen3-80B) demonstrated PD-Agent's superior performance, achieving 0.90 to 0.98 cosine similarity with ground-truth personas, significantly outperforming frequency-based baselines by up to +0.15. The framework's effectiveness scales positively with target model size, providing a robust diagnostic tool.

Key takeaway

For NLP Engineers and AI Scientists focused on understanding or steering LLM behavior, you should move beyond surface-level lexical analysis for persona discovery. Instead, consider implementing discourse-level relational reasoning, such as bridging inference, to uncover deeper, more stable persona traits. This approach, especially effective with larger models, provides a robust diagnostic tool for evaluating model consistency and can inform more precise persona alignment strategies, even if "background" attributes might require more nuanced cue interpretation.

Key insights

LLM personas are deeply encoded in discourse coherence via implicit conceptual links, not just surface-level lexical patterns.

Principles

Method

The PD-Agent conducts adaptive interviews, extracts seven types of bridging relations using few-shot exemplars, then constructs a directed semantic graph. Persona attributes are inferred by analyzing node centrality and relation type distribution.

In practice

Topics

Code references

Best for: AI Scientist, NLP Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.