Generative AI News Rundown - LLM Palooza with Llama, Mistral, Phi & Grok, Plus New Funding, Adobe, Apple, and More - Voicebot Podcast Ep 379

· Source: The Voicebot Podcast · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation, Cloud Computing & IT Infrastructure · Depth: Intermediate, extended

Summary

The generative AI landscape experienced significant activity over two weeks, highlighted by Meta's Llama 3 launch, featuring 8B and 70B parameter models with a 400B model anticipated, demonstrating substantial performance gains over Llama 2. Other notable model releases included X.ai's Grok-1.5V multimodal model, Mistral's efficient 8x22B LLM, and Microsoft's Phi AI models, which are small enough to run on phones. Enterprise adoption data from Andreessen Horowitz indicates 93% of companies use multiple models, with OpenAI dominating production use at nearly 70% market share. Control, customizability, and cost are key drivers for open-source adoption, with 60% of enterprises prioritizing control. The period also saw nine-figure funding rounds for Cognition Labs ($175M at $2B valuation) and Augment ($252M), a $700M acquisition of Run.ai by Nvidia, and Perplexity's enterprise debut alongside a $1B valuation. Apple acquired French AI startup Datakalab, reinforcing its on-device AI focus.

Key takeaway

For CTOs and VP of Engineering evaluating generative AI solutions, the rapid proliferation of high-quality, efficient models like Llama 3 and Microsoft Phi necessitates a flexible, multi-model strategy. Your teams should prioritize open-source options for greater control and cost-efficiency, especially as smaller models achieve "good enough" performance for many use cases. This shift reduces reliance on single vendors and mitigates supply chain risks for GPUs, enabling broader deployment across diverse platforms, including edge devices.

Key insights

The generative AI market is rapidly diversifying with new models, significant investments, and a strong enterprise shift towards multi-model and open-source strategies.

Principles

Method

Microsoft's Phi models demonstrate that high-quality, curated training data (e.g., "textbooks are all you need") can achieve GPT-3.5 level performance with significantly fewer parameters, enabling efficient small language models.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Machine Learning Engineer, AI Engineer, AI Product Manager, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Voicebot Podcast.