Anthropic: Our AI just created a tool that can ‘automate all white collar work’, Me:
Summary
Anthropic's new Claude Co-work tool, powered by Claude Opus 4.5, has garnered significant attention for automating non-coding tasks, with some commentators even labeling its underlying model as AGI. However, practical testing reveals that while Claude Co-work can generate impressive initial drafts and plans, it still exhibits brittleness and factual inaccuracies, such as incorrect sports league positions. Despite these limitations, an OpenAI paper from October 2025 suggests that a "productivity multiplier" is achieved when humans review and edit AI-generated drafts rather than creating content from scratch. Current data from Oxford Economics (January 7, 2026) indicates that AI's impact on overall labor productivity and unemployment remains limited, with companies potentially attributing job cuts to AI for investor messaging rather than actual productivity gains. The article explores why LLMs demonstrate both advanced capabilities and fundamental errors, attributing it to multiple levels of "understanding"—from simple conceptual connections to principled derivation of rules—and their pragmatic toggling between deep algorithms and shallow memorization.
Key takeaway
For Machine Learning Engineers and NLP Engineers evaluating the practical utility of advanced LLMs like Claude Co-work, recognize that while these tools are not AGI and can make errors, they offer substantial productivity gains when integrated into a human-in-the-loop workflow. Your focus should shift from expecting perfect autonomous output to leveraging AI for rapid drafting and iteration, reserving human expertise for critical review and refinement to achieve optimal results.
Key insights
LLMs offer significant productivity gains through human-AI co-creation, despite inherent brittleness and factual inaccuracies.
Principles
- AI productivity is maximized by human review and editing.
- LLMs exhibit multiple levels of understanding.
- Models toggle between deep algorithms and shallow heuristics.
Method
Achieve a "productivity multiplier" by having models generate initial drafts, then humans review, edit, and iterate, rather than starting from scratch.
In practice
- Use AI for drafting and ideation, not final output.
- Always fact-check AI-generated content.
- Compare different frontier models for specific use cases.
Topics
- Claude Co-work
- AI Automation
- LLM Brittleness
- AI Productivity
- Labor Market Impact
Best for: Machine Learning Engineer, NLP Engineer, AI Engineer, Software Engineer, AI Researcher
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by AI Explained.