A Smarter Google AI Edge Gallery: MCP integration, notifications, and session continuity
Summary
Google AI Edge Gallery, an on-device AI showcase app for Android and iOS, has been updated to support the Model Context Protocol (MCP), local notification reminders, and persistent chat history. This expansion enables developers to build more connected, proactive, and persistent agentic experiences using Gemma and other open models directly on mobile devices. The MCP integration, currently an experimental feature on Android, allows on-device LLMs to interact with external tools and data sources via Streamable HTTP, with reasoning and decision-making occurring locally. New "Schedule Notification" skills facilitate automated routines, such as mood tracking or calendar briefings. Additionally, the app now offers persistent chat history, leveraging LiteRT-LM backend's fast prefill capability (exceeding 3,000 tokens per second), and allows direct editing of custom system prompts for enhanced developer control.
Key takeaway
For AI Architects and Machine Learning Engineers building mobile-first AI applications, the Google AI Edge Gallery updates offer critical capabilities for creating sophisticated on-device agents. You should explore integrating the Model Context Protocol (MCP) to enable your LLMs to interact with external tools and data sources, and leverage the new notification and persistent chat features to build proactive, stateful user experiences. Consider the open-source toolkit and community contributions for accelerating your development of tailored, utility-focused workflows.
Key insights
Google AI Edge Gallery enhances on-device AI with MCP, notifications, and persistent chat for agentic mobile experiences.
Principles
- On-device LLMs require standardized external interaction.
- Smaller context windows necessitate concise tool descriptions.
- Proactive interactions enhance automated use cases.
Method
Integrate MCP via a registered URL to dynamically import tool definitions and resource schemas into the on-device model's system prompt, enabling local reasoning and tool execution by an MCP server.
In practice
- Use MCP for on-device LLM tool calling.
- Keep MCP tool descriptions short for efficiency.
- Experiment with custom system prompts for agent personas.
Topics
- Google AI Edge Gallery
- Model Context Protocol
- On-device AI
- Agentic Workflows
- Local Notifications
Code references
Best for: AI Architect, Machine Learning Engineer, NLP Engineer, AI Engineer, Software Engineer, Prompt Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Google Developers Blog - AI.