The Sequence Radar #783: Softbank, DeepSeek, MiniMax and The Sequence 2026

2025-07-08 · Source: TheSequence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation, Cloud Computing & IT Infrastructure · Depth: Advanced, medium

Summary

The first week of 2026 established a clear bifurcation in the AI ecosystem, marked by SoftBank's aggressive infrastructure consolidation and DeepSeek's advancements in algorithmic efficiency. SoftBank Group committed $41 billion to OpenAI and acquired DigitalBridge for $4 billion, aiming to vertically integrate AI models with their underlying data center and edge infrastructure. Concurrently, DeepSeek released DeepSeek-V3.2, an improved Mixture-of-Experts (MoE) architecture featuring Group Relative Policy Optimization (GRPO). This innovation allows DeepSeek to match the reasoning capabilities of models like GPT-5 and Gemini 3.0 Pro with significantly reduced GPU hours. Additionally, Chinese AI unicorn MiniMax filed for a $539 million IPO on the Hong Kong Stock Exchange, signaling a major test for the applied AI market. These events highlight a dual trend of capital-driven dominance and innovation-led efficiency in the evolving AI landscape.

Key takeaway

For AI Architects and Machine Learning Engineers evaluating model deployment strategies, recognize that the industry is splitting between large-scale infrastructure plays and algorithmic efficiency. You should investigate advanced MoE architectures like DeepSeek-V3.2 to achieve competitive reasoning capabilities with fewer compute resources, potentially reducing operational costs and hardware dependencies, especially when facing capital constraints or export controls.

Key insights

AI's future is bifurcated between capital-intensive infrastructure consolidation and efficiency-driven algorithmic innovation.

Principles

Algorithmic innovation can rival capital dominance.
Vertical integration blurs model and infrastructure providers.

Method

DeepSeek-V3.2 utilizes Group Relative Policy Optimization (GRPO) within a Mixture-of-Experts architecture, enabling self-evaluation of group outputs without a resource-heavy critic model during reinforcement learning.

In practice

Explore MoE architectures for training efficiency.
Consider vertical integration for AI deployments.

Topics

AI Investment Strategy
Algorithmic Efficiency
AI Infrastructure
Large Language Models
World Models

Best for: AI Architect, Machine Learning Engineer, NLP Engineer, AI Engineer, AI Researcher, CTO

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by TheSequence.