Google AI Edge Gallery

· Source: Simon Willison's Weblog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Emerging Technologies & Innovation · Depth: Intermediate, quick

Summary

Google has released the "Google AI Edge Gallery" app, enabling users to run Gemma 4 (E2B and E4B sizes) and some Gemma 3 models directly on an iPhone. The E2B model, a 2.54GB download, demonstrates fast and useful performance. The app includes features like image-based questioning and audio transcription for up to 30 seconds using the smaller Gemma 4 models. It also showcases an "agent skills" demo, illustrating tool calling against eight interactive HTML widgets such as an interactive map, Wikipedia query, and QR code generator. This marks a notable instance of a local model vendor providing an official app for on-device model execution on iPhones, though it currently lacks permanent conversation logs.

Key takeaway

For AI Product Managers evaluating edge AI solutions, Google's AI Edge Gallery demonstrates a viable path for deploying large language models and agentic capabilities directly on mobile devices. Your team should explore the app's performance and tool-calling features to understand the potential for offline functionality and reduced latency in future applications, despite the current lack of persistent conversation logs.

Key insights

Google's new app enables on-device execution of Gemma models and tool calling directly on iPhones.

Principles

In practice

Topics

Best for: NLP Engineer, AI Product Manager, Entrepreneur, AI Engineer, Machine Learning Engineer, AI Student

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.