Structured Intent as a Protocol-Like Communication Layer: Cross-Model Robustness, Framework Comparison, and the Weak-Model Compensation Effect

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Natural Language Processing, Human-AI Interaction · Depth: Expert, quick

Summary

A study investigated the reliability of structured intent representations in preserving user goals across various AI models, languages, and prompting frameworks. Building on previous work with PPS (Prompt Protocol Specification), this research evaluated cross-model robustness using Claude, GPT-4o, and Gemini 2.5 Pro, compared it with CO-STAR and RISEN frameworks, and conducted a user study (N=50) on AI-assisted intent expansion. Analyzing 3,240 model outputs across three languages, six conditions, three models, three domains, and 20 tasks, an independent judge (DeepSeek-V3) found that structured prompting significantly reduced cross-language score variance, decreasing sigma from 0.470 to approximately 0.020 in the strongest conditions. The study also identified a "weak-model compensation" effect, where Gemini, the lowest-baseline model, showed a substantially larger gain (+1.006) than Claude (+0.217). 5W3H, CO-STAR, and RISEN achieved similar high goal-alignment scores, indicating that dimensional decomposition is a key factor. The user study demonstrated that AI-expanded 5W3H prompts reduced interaction rounds by 60% and increased user satisfaction from 3.16 to 4.04.

Key takeaway

For AI Engineers designing human-AI interaction systems, adopting structured intent representations like 5W3H can dramatically improve goal alignment and reduce cross-language variability across different large language models. This approach not only enhances the robustness of your applications but also significantly boosts user satisfaction and reduces interaction overhead, especially when integrating models with varying performance baselines. Consider implementing structured prompting as a core communication layer.

Key insights

Structured intent representations significantly enhance goal alignment and reduce variance across diverse AI models and languages.

Principles

Method

The study compared 5W3H, CO-STAR, and RISEN frameworks across Claude, GPT-4o, and Gemini 2.5 Pro, using 3,240 outputs evaluated by DeepSeek-V3, and included a user study (N=50) on AI-assisted intent expansion.

In practice

Topics

Best for: AI Engineer, Machine Learning Engineer, Research Scientist, AI Scientist, NLP Engineer, Prompt Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.