NOR Flash Next in AI-Driven Memory Crunch
Summary
The NOR flash memory market is experiencing a significant supply crunch, primarily driven by surging demand from AI servers and data centers. The number of NOR devices per server rack has increased dramatically, from 3-5 units to over 30, with NOR content in Nvidia's GB200 NVL72 system potentially exceeding $900 per rack within two years. NOR flash is critical for AI server safe boot, initialization, and firmware storage due to its fast random-access read speeds, high reliability, and low latency. It also supports independent power management for HBM devices and is vital for edge AI in storing operating systems and neural network weights. This increased demand is intensifying competition for production capacity, with reports indicating potential price hikes from major suppliers like Macronix, which is also reportedly scaling down NOR capacity to boost MLC NAND flash production. The industry is also looking towards 3D NOR flash, which promises higher densities and lower latency, but it is still years away from widespread commercial availability.
Key takeaway
For CTOs and VPs of Engineering managing AI infrastructure, the escalating NOR flash supply crunch and potential price hikes necessitate proactive supply chain management. You should evaluate your current and projected NOR flash requirements, explore alternative suppliers, and consider the long-term implications of 3D NOR flash development for future system designs. Prepare for potential cost increases and lead time extensions for critical boot and firmware storage components.
Key insights
AI server demand is causing a NOR flash supply crunch, driving innovation towards 3D NOR technology.
Principles
- High reliability is crucial for AI server boot processes.
- Deterministic read latencies benefit edge AI model storage.
In practice
- Use NOR flash for AI server firmware and boot code.
- Integrate NOR flash for HBM power management.
- Store neural network weights on NOR for edge inference.
Topics
- NOR Flash Memory
- AI Servers
- Memory Supply Chain
- 3D NOR Flash
- Edge AI
Best for: Investor, CTO, VP of Engineering/Data, AI Engineer, AI Architect, AI Operations Specialist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Big Data & AI News - EE Times.