The World Leaks the Future: Harness Evolution for Future Prediction Agents

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

Milkyway is a self-evolving agent system designed to improve future prediction capabilities by updating a persistent "future prediction harness" rather than the base large language model (LLM). This system addresses the challenge of making consequential decisions before outcomes are known, a common problem in future prediction tasks. Milkyway operates by revisiting unresolved questions over time, extracting "internal feedback" from temporal contrasts between earlier and later predictions. This feedback identifies omissions in the prediction process and is used to write reusable guidance back into the harness, enabling improvements before the final outcome is known. Once a question is resolved, a "retrospective check" is performed, and the updated harness is carried forward. On the FutureX benchmark, Milkyway improved scores from 44.07 to 60.90, and on FutureWorld, scores increased from 62.22 to 77.96, achieving the best overall performance among compared methods.

Key takeaway

For research scientists developing future prediction agents, you should consider adopting a self-evolving harness approach like Milkyway. This method allows for continuous improvement of factor tracking, evidence gathering, and uncertainty handling without retraining the base LLM, significantly boosting prediction accuracy on evolving public information.

Key insights

Milkyway improves LLM future prediction by evolving a persistent harness with internal feedback, not the base model.

Principles

Method

Milkyway extracts internal feedback from repeated predictions on unresolved questions, writes reusable guidance to a persistent harness, and performs a retrospective check after resolution.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.