Gemma 4 Runs on Your Phone

· Source: unwind ai · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Software Development & Engineering · Depth: Intermediate, medium

Summary

Google has released Gemma 4, a new family of open-source, multimodal AI models under the Apache 2.0 license, ranging from a 2B edge model for devices like Raspberry Pi to a 31B dense model that ranks #3 on Arena AI's text leaderboard. These models, built on Gemini 3 research, are designed for agentic workflows with native function calling, structured output, and up to 256K context windows. Concurrently, Cursor 3 has launched, transforming its IDE into a unified workspace for managing fleets of coding agents across multiple repositories, featuring Composer 2, an integrated browser, and a plugin marketplace. Anthropic also introduced a "NO_FLICKER" mode for Claude Code, enhancing terminal-based coding sessions by virtualizing the viewport to eliminate screen flickering.

Key takeaway

For AI Architects evaluating model deployment strategies, Gemma 4's Apache 2.0 license and edge-native capabilities for devices like Raspberry Pi present a compelling option for cost-effective, offline agentic applications. Consider integrating these smaller, efficient models into your infrastructure to expand AI functionality beyond cloud-dependent solutions, especially for scenarios requiring low latency or privacy-sensitive local processing.

Key insights

Agentic AI models are becoming smaller, more efficient, and integrated into developer workflows.

Principles

Method

Karpathy's workflow for self-improving wikis involves LLMs compiling raw research into interconnected Markdown files, with queries generating new content that loops back into the wiki.

In practice

Topics

Code references

Best for: CTO, VP of Engineering/Data, AI Architect, AI Engineer, Machine Learning Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by unwind ai.