I don’t think we are close to “AI scientists”

· Source: Understanding AI · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Advanced, long

Summary

LLMs exhibit an uncanny ability to recognize authors from unpublished prose, as demonstrated by ChatGPT identifying Timothy B. Lee from his 2012 "Great Canadian Maple Syrup Heist" essay after nine paragraphs, despite being unable to articulate specific reasons. This phenomenon highlights that LLMs, like humans, possess implicit knowledge, but unlike humans, their learning capacity is frozen post-training. The article explores the architecture of modern AI agents such as Claude Code and OpenClaw, which utilize external file systems, Unix shells, and cron jobs to manage context and persistent state, leading to increased demand for local hardware like Apple Mac minis. However, this reliance on explicit file storage means agents lose implicit knowledge between sessions, hindering their ability to perform complex, context-dependent tasks requiring continuous learning and deep, evolving understanding, making the prospect of "AI scientists" based on current architectures distant.

Key takeaway

For AI Engineers and Directors of AI/ML evaluating agent capabilities for complex, long-term projects, understand that current LLM-based agents, despite their utility, are fundamentally limited by their reliance on explicit file-based memory and lack of continuous implicit learning. This architecture hinders their ability to develop deep, evolving context crucial for roles like "AI scientists." You should prioritize exploring novel agentic frameworks and transformer architectures that enable persistent implicit knowledge acquisition to overcome these limitations.

Key insights

Current LLM-based AI agents lack continuous implicit learning, limiting their ability to achieve human-level scientific or complex knowledge work.

Principles

Method

The architecture for AI agents combines an LLM, Unix shell, file system (often Markdown), and a cron job for a persistent loop and heartbeat.

In practice

Topics

Best for: AI Scientist, AI Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Understanding AI.