Google AI Edge Gallery
Summary
Google has released the "Google AI Edge Gallery" app, enabling users to run Gemma 4 (E2B and E4B sizes) and some Gemma 3 models directly on an iPhone. The E2B model, a 2.54GB download, demonstrates fast and useful performance. The app includes features like image-based questioning and audio transcription for up to 30 seconds using the smaller Gemma 4 models. It also showcases an "agent skills" demo, illustrating tool calling against eight interactive HTML widgets such as an interactive map, Wikipedia query, and QR code generator. This marks a notable instance of a local model vendor providing an official app for on-device model execution on iPhones, though it currently lacks permanent conversation logs.
Key takeaway
For AI Product Managers evaluating edge AI solutions, Google's AI Edge Gallery demonstrates a viable path for deploying large language models and agentic capabilities directly on mobile devices. Your team should explore the app's performance and tool-calling features to understand the potential for offline functionality and reduced latency in future applications, despite the current lack of persistent conversation logs.
Key insights
Google's new app enables on-device execution of Gemma models and tool calling directly on iPhones.
Principles
- On-device AI enhances privacy and speed.
- Tool calling expands local model utility.
In practice
- Run Gemma 4 E2B/E4B models locally.
- Experiment with agentic tool calling demos.
Topics
- Google AI Edge Gallery
- Gemma Models
- On-device AI
- iPhone Applications
- Tool Calling
Best for: NLP Engineer, AI Product Manager, Entrepreneur, AI Engineer, Machine Learning Engineer, AI Student
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.