Gemma 4 Runs on Your Phone
Summary
Google has released Gemma 4, a new family of open-source, multimodal AI models under the Apache 2.0 license, ranging from a 2B edge model for devices like Raspberry Pi to a 31B dense model that ranks #3 on Arena AI's text leaderboard. These models, built on Gemini 3 research, are designed for agentic workflows with native function calling, structured output, and up to 256K context windows. Concurrently, Cursor 3 has launched, transforming its IDE into a unified workspace for managing fleets of coding agents across multiple repositories, featuring Composer 2, an integrated browser, and a plugin marketplace. Anthropic also introduced a "NO_FLICKER" mode for Claude Code, enhancing terminal-based coding sessions by virtualizing the viewport to eliminate screen flickering.
Key takeaway
For AI Architects evaluating model deployment strategies, Gemma 4's Apache 2.0 license and edge-native capabilities for devices like Raspberry Pi present a compelling option for cost-effective, offline agentic applications. Consider integrating these smaller, efficient models into your infrastructure to expand AI functionality beyond cloud-dependent solutions, especially for scenarios requiring low latency or privacy-sensitive local processing.
Key insights
Agentic AI models are becoming smaller, more efficient, and integrated into developer workflows.
Principles
- Edge-native models enable offline, low-latency AI applications.
- Agentic design requires built-in reasoning and structured output.
- Parallel agent management enhances developer productivity.
Method
Karpathy's workflow for self-improving wikis involves LLMs compiling raw research into interconnected Markdown files, with queries generating new content that loops back into the wiki.
In practice
- Deploy Gemma 4's 2B model on a Raspberry Pi for offline AI.
- Use Cursor 3 to manage multiple coding agents simultaneously.
- Enable CLAUDE_CODE_NO_FLICKER=1 for flicker-free Claude Code sessions.
Topics
- Gemma 4
- AI Agents
- Open-Source Models
- Coding Agents
- Edge AI
Code references
Best for: CTO, VP of Engineering/Data, AI Architect, AI Engineer, Machine Learning Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by unwind ai.