๐บ Google just gave away its best AI
Summary
Four distinct open-source AI models have been released under the Apache 2.0 license, collectively covering a wide range of compute needs from edge devices to cloud agents. Google's Gemma 4, available in four sizes, includes an edge model for Raspberry Pi (under 1.5GB memory) and a 31B model ranking #3 among open models. PrismML introduced 1-bit Bonsai, an 8B model compressed to 1.15GB, achieving 44 tokens/second on an iPhone. H Company launched Holo3, a 10B active parameter computer-use agent that set a new desktop automation benchmark. Arcee AI released Trinity-Large-Thinking, a 400B reasoning model (13B active) that scores #2 on agentic benchmarks at 96% less cost than Claude Opus 4.6. This collective release signifies a major shift, making competitive AI accessible and deployable across the entire compute spectrum.
Key takeaway
For CTOs and VPs of Engineering evaluating AI infrastructure, this proliferation of open-source, Apache 2.0 licensed models fundamentally alters your strategy. You can now own and customize frontier AI weights for diverse applications, from mobile to cloud, at a fraction of proprietary costs, enabling greater control and innovation without vendor lock-in.
Key insights
Open-source AI models now provide competitive, cost-effective solutions across all compute scales, from mobile to data centers.
Principles
- Open licensing (Apache 2.0) removes enterprise adoption friction.
- Specialized AI models complement generalists for diverse tasks.
Method
Deploying a suite of open-source AI models (Gemma 4, Bonsai, Holo3, Trinity) allows for optimized performance and cost efficiency across various device types and computational requirements, from edge to cloud.
In practice
- Run Gemma 4 E2B on Android for offline AI assistance.
- Utilize Bonsai for efficient AI on mobile devices.
- Deploy Holo3 for desktop automation tasks.
Topics
- Google Gemma 4
- Open-Source AI Models
- Apache 2.0 Licensing
- Edge AI
- AI Agents
Code references
Best for: CTO, VP of Engineering/Data, Investor, AI Engineer, Machine Learning Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Neuron.