How far from "Her"

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Intermediate, medium

Summary

The discussion analyzes the feasibility of replicating the AI from the 2013 film "Her," specifically focusing on its real-time interpretation, instantaneous response, and autonomous consciousness. While some argue that current AI, like GPT-4o, can interpret live video and voice in real-time, the core challenge remains persistent, cross-session memory. Large Language Models (LLMs) are described as stateless calculations that simulate conversation through programmatic loops feeding historical context, but they face hard limits on context size and degrade significantly before reaching the end of their context window. The consensus is that true "Her"-level AI, particularly regarding autonomous consciousness and long-term memory, is still years away, with estimates ranging from 1-4 years for advanced capabilities to much longer for transcendence.

Key takeaway

For research scientists evaluating the frontier of AI capabilities, you should recognize that while real-time multimodal interaction is advancing, the fundamental challenge of persistent, cross-session memory remains unsolved for LLMs. Focus your efforts on novel architectural approaches that move beyond context window limitations, rather than solely scaling existing stateless models, to progress towards truly autonomous and continuously aware AI systems.

Key insights

Achieving "Her"-level AI hinges on solving persistent, cross-session memory, a current limitation of stateless LLMs.

Principles

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Product Manager

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.