Wisdom Of The (AI) Crowd: Investigating Artificial Swarm Intelligence In Large Language Models
Summary
A study investigated whether large language models (LLMs) can replicate human swarm intelligence effects, which typically offer high collective accuracy but face scalability issues. Researchers conducted a controlled experiment using 960 manually executed prompts across GPT-5, Gemini 2.5 Pro, and Claude Sonnet 4.5. The experiment tested both intra-model sampling and inter-model aggregation on eight distinct estimation tasks. Findings consistently showed error reduction through both aggregation methods, with significant reductions up to 37 percentage points in Mean Absolute Percentage Error (MAPE) across various strategies. The research also identified positive correlations (Spearman's ρ=0.242-0.568, all p<0.001) between relative confidence interval widths and estimation errors, suggesting LLMs exhibit metacognitive awareness regarding uncertainty. This work explores deploying LLM swarms in organizational decision-making.
Key takeaway
For Directors of AI/ML evaluating decision support systems, consider implementing LLM swarm intelligence. Your teams can achieve up to 37 percentage points error reduction by aggregating responses from models like GPT-5 or Gemini 2.5 Pro. This approach enhances accuracy and leverages LLMs' inherent metacognitive awareness of uncertainty. Explore intra-model sampling and inter-model aggregation strategies to improve organizational decision-making processes.
Key insights
LLM swarms can reduce estimation errors and exhibit metacognitive awareness, mimicking human swarm intelligence.
Principles
- Aggregation reduces LLM estimation errors.
- LLMs show metacognitive uncertainty awareness.
- Intra- and inter-model aggregation are effective.
Method
A controlled experiment used 960 prompts across GPT-5, Gemini 2.5 Pro, and Claude Sonnet 4.5, testing intra-model sampling and inter-model aggregation on eight estimation tasks.
In practice
- Deploy LLM swarms for decision-making.
- Use aggregation to improve LLM accuracy.
- Explore LLM uncertainty for better estimates.
Topics
- Large Language Models
- Swarm Intelligence
- Model Aggregation
- Metacognition
- Decision Support Systems
- GPT-5
Best for: Research Scientist, AI Architect, AI Engineer, AI Scientist, Director of AI/ML, Consultant
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.