🤖AI Agents Weekly: LLMs in 2025, YOLO in the Sandbox, Plan Caching for Agents, DeepTutor

· Source: AI Newsletter · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation, Software Development & Engineering · Depth: Advanced, quick

Summary

Simon Willison's annual review, "2025: The Year in LLMs," details 26 significant trends that shaped the Large Language Model landscape in 2025. Key developments include the breakthrough of coding agents, with Claude Code achieving $1 billion in run-rate revenue by December, and other major labs releasing their own CLI coding agents like Codex CLI and Gemini CLI. Reasoning models proved crucial for driving multi-step agents, enabling effective AI-assisted search and complex code debugging. Chinese models, such as GLM-4.7 and DeepSeek V3.2, dominated open-weight rankings, with DeepSeek R1's January release notably impacting NVIDIA's market cap by $593 billion. The report also highlights the normalization of risky AI practices, termed "YOLO mode," and the rapid increase in long task capabilities, with AI task duration doubling every 7 months.

Key takeaway

For Machine Learning Engineers evaluating LLM trends, recognize that 2025 saw coding agents become a major revenue driver and reasoning models unlock advanced agent capabilities. You should prioritize integrating robust reasoning models into your agent designs to handle complex, multi-step tasks and consider the implications of "YOLO mode" for deployment safety, especially as AI task duration continues to rapidly expand.

Key insights

Coding agents and reasoning models significantly advanced LLM capabilities and market impact in 2025.

Principles

In practice

Topics

Best for: Machine Learning Engineer, NLP Engineer, CTO, AI Engineer, AI Researcher, AI Product Manager

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by AI Newsletter.