I don’t think we are close to “AI scientists”

2023-07-27 · Source: Understanding AI · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Advanced, long

Summary

LLMs exhibit an uncanny ability to recognize authors from unpublished prose, as demonstrated by ChatGPT identifying Timothy B. Lee from his 2012 "Great Canadian Maple Syrup Heist" essay after nine paragraphs, despite being unable to articulate specific reasons. This phenomenon highlights that LLMs, like humans, possess implicit knowledge, but unlike humans, their learning capacity is frozen post-training. The article explores the architecture of modern AI agents such as Claude Code and OpenClaw, which utilize external file systems, Unix shells, and cron jobs to manage context and persistent state, leading to increased demand for local hardware like Apple Mac minis. However, this reliance on explicit file storage means agents lose implicit knowledge between sessions, hindering their ability to perform complex, context-dependent tasks requiring continuous learning and deep, evolving understanding, making the prospect of "AI scientists" based on current architectures distant.

Key takeaway

For AI Engineers and Directors of AI/ML evaluating agent capabilities for complex, long-term projects, understand that current LLM-based agents, despite their utility, are fundamentally limited by their reliance on explicit file-based memory and lack of continuous implicit learning. This architecture hinders their ability to develop deep, evolving context crucial for roles like "AI scientists." You should prioritize exploring novel agentic frameworks and transformer architectures that enable persistent implicit knowledge acquisition to overcome these limitations.

Key insights

Current LLM-based AI agents lack continuous implicit learning, limiting their ability to achieve human-level scientific or complex knowledge work.

Principles

LLMs' implicit knowledge is fixed post-training.
Agent coherence relies on explicit file-based state.
Human expertise compounds through implicit knowledge.

Method

The architecture for AI agents combines an LLM, Unix shell, file system (often Markdown), and a cron job for a persistent loop and heartbeat.

In practice

Use local agents like OpenClaw on Mac minis.
Agents can self-modify by rewriting their own files.
Compaction can lead to data loss in chatbots.

Topics

LLM Limitations
AI Agent Architectures
Implicit Knowledge
Continuous Learning
Context Management
OpenClaw Framework

Best for: AI Scientist, AI Engineer, Director of AI/ML

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Understanding AI.