Generative AI News Rundown - LLM Palooza with Llama, Mistral, Phi & Grok, Plus New Funding, Adobe, Apple, and More - Voicebot Podcast Ep 379
Summary
The generative AI landscape experienced significant activity over two weeks, highlighted by Meta's Llama 3 launch, featuring 8B and 70B parameter models with a 400B model anticipated, demonstrating substantial performance gains over Llama 2. Other notable model releases included X.ai's Grok-1.5V multimodal model, Mistral's efficient 8x22B LLM, and Microsoft's Phi AI models, which are small enough to run on phones. Enterprise adoption data from Andreessen Horowitz indicates 93% of companies use multiple models, with OpenAI dominating production use at nearly 70% market share. Control, customizability, and cost are key drivers for open-source adoption, with 60% of enterprises prioritizing control. The period also saw nine-figure funding rounds for Cognition Labs ($175M at $2B valuation) and Augment ($252M), a $700M acquisition of Run.ai by Nvidia, and Perplexity's enterprise debut alongside a $1B valuation. Apple acquired French AI startup Datakalab, reinforcing its on-device AI focus.
Key takeaway
For CTOs and VP of Engineering evaluating generative AI solutions, the rapid proliferation of high-quality, efficient models like Llama 3 and Microsoft Phi necessitates a flexible, multi-model strategy. Your teams should prioritize open-source options for greater control and cost-efficiency, especially as smaller models achieve "good enough" performance for many use cases. This shift reduces reliance on single vendors and mitigates supply chain risks for GPUs, enabling broader deployment across diverse platforms, including edge devices.
Key insights
The generative AI market is rapidly diversifying with new models, significant investments, and a strong enterprise shift towards multi-model and open-source strategies.
Principles
- Enterprise AI adoption favors multi-model strategies.
- Control and customizability drive open-source preference.
- Data quality and quantity are critical for model performance.
Method
Microsoft's Phi models demonstrate that high-quality, curated training data (e.g., "textbooks are all you need") can achieve GPT-3.5 level performance with significantly fewer parameters, enabling efficient small language models.
In practice
- Evaluate multiple LLMs for diverse use cases.
- Prioritize open-source models for control and customization.
- Consider small language models for on-device or cost-sensitive applications.
Topics
- Large Language Models
- Generative AI Adoption
- Llama 3
- Small Language Models
- AI Funding & Acquisitions
Best for: CTO, VP of Engineering/Data, Machine Learning Engineer, AI Engineer, AI Product Manager, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Voicebot Podcast.