๐Ÿ˜บ Google just gave away its best AI

ยท Source: The Neuron ยท Field: Technology & Digital โ€” Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Emerging Technologies & Innovation ยท Depth: Intermediate, long

Summary

Four distinct open-source AI models have been released under the Apache 2.0 license, collectively covering a wide range of compute needs from edge devices to cloud agents. Google's Gemma 4, available in four sizes, includes an edge model for Raspberry Pi (under 1.5GB memory) and a 31B model ranking #3 among open models. PrismML introduced 1-bit Bonsai, an 8B model compressed to 1.15GB, achieving 44 tokens/second on an iPhone. H Company launched Holo3, a 10B active parameter computer-use agent that set a new desktop automation benchmark. Arcee AI released Trinity-Large-Thinking, a 400B reasoning model (13B active) that scores #2 on agentic benchmarks at 96% less cost than Claude Opus 4.6. This collective release signifies a major shift, making competitive AI accessible and deployable across the entire compute spectrum.

Key takeaway

For CTOs and VPs of Engineering evaluating AI infrastructure, this proliferation of open-source, Apache 2.0 licensed models fundamentally alters your strategy. You can now own and customize frontier AI weights for diverse applications, from mobile to cloud, at a fraction of proprietary costs, enabling greater control and innovation without vendor lock-in.

Key insights

Open-source AI models now provide competitive, cost-effective solutions across all compute scales, from mobile to data centers.

Principles

Method

Deploying a suite of open-source AI models (Gemma 4, Bonsai, Holo3, Trinity) allows for optimized performance and cost efficiency across various device types and computational requirements, from edge to cloud.

In practice

Topics

Code references

Best for: CTO, VP of Engineering/Data, Investor, AI Engineer, Machine Learning Engineer, Director of AI/ML

Related on AIssential

Open in AIssential โ†’

Editorial summary, takeaway, and curation by AIssential. Original article published by The Neuron.