China’s DeepSeek previews new AI model a year after jolting US rivals

· Source: The Verge · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Fundamental Awareness, quick

Summary

Chinese AI company DeepSeek has released a preview of its next-generation AI model, V4, claiming it can compete with leading closed-source systems from US rivals like Anthropic, Google, and OpenAI. DeepSeek states that V4 significantly improves over previous models, particularly in coding capabilities, which are crucial for AI agents and tools such as ChatGPT Codex and Claude Code. This release also marks a notable achievement for China's chip industry, as DeepSeek explicitly highlights V4's compatibility with domestic Huawei technology. The company previously gained attention with its R1 model, which it claimed was trained at a fraction of the cost of US systems, though V4's training costs and hardware remain undisclosed. DeepSeek has faced accusations of using banned Nvidia chips and misusing Anthropic's Claude to enhance its products.

Key takeaway

For AI Architects evaluating new foundation models, DeepSeek's V4 preview suggests a competitive open-source option, particularly for coding applications and those prioritizing compatibility with Huawei hardware. You should assess V4's performance benchmarks against established models like Anthropic's Claude and OpenAI's systems, especially if seeking alternatives to US-centric AI infrastructure. Be mindful of ongoing geopolitical and intellectual property concerns surrounding DeepSeek's training practices.

Key insights

DeepSeek's V4 model aims to challenge top US AI systems, emphasizing coding prowess and domestic chip compatibility.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, AI Architect, AI Scientist, Director of AI/ML, Tech Journalist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Verge.