I don’t think we are close to “AI scientists”
Summary
LLMs exhibit an uncanny ability to recognize authors from unpublished prose, as demonstrated by ChatGPT identifying Timothy B. Lee from his 2012 "Great Canadian Maple Syrup Heist" essay after nine paragraphs, despite being unable to articulate specific reasons. This phenomenon highlights that LLMs, like humans, possess implicit knowledge, but unlike humans, their learning capacity is frozen post-training. The article explores the architecture of modern AI agents such as Claude Code and OpenClaw, which utilize external file systems, Unix shells, and cron jobs to manage context and persistent state, leading to increased demand for local hardware like Apple Mac minis. However, this reliance on explicit file storage means agents lose implicit knowledge between sessions, hindering their ability to perform complex, context-dependent tasks requiring continuous learning and deep, evolving understanding, making the prospect of "AI scientists" based on current architectures distant.
Key takeaway
For AI Engineers and Directors of AI/ML evaluating agent capabilities for complex, long-term projects, understand that current LLM-based agents, despite their utility, are fundamentally limited by their reliance on explicit file-based memory and lack of continuous implicit learning. This architecture hinders their ability to develop deep, evolving context crucial for roles like "AI scientists." You should prioritize exploring novel agentic frameworks and transformer architectures that enable persistent implicit knowledge acquisition to overcome these limitations.
Key insights
Current LLM-based AI agents lack continuous implicit learning, limiting their ability to achieve human-level scientific or complex knowledge work.
Principles
- LLMs' implicit knowledge is fixed post-training.
- Agent coherence relies on explicit file-based state.
- Human expertise compounds through implicit knowledge.
Method
The architecture for AI agents combines an LLM, Unix shell, file system (often Markdown), and a cron job for a persistent loop and heartbeat.
In practice
- Use local agents like OpenClaw on Mac minis.
- Agents can self-modify by rewriting their own files.
- Compaction can lead to data loss in chatbots.
Topics
- LLM Limitations
- AI Agent Architectures
- Implicit Knowledge
- Continuous Learning
- Context Management
- OpenClaw Framework
Best for: AI Scientist, AI Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Understanding AI.