What Matters in Practical Learned Image Compression

· Source: Apple Machine Learning Research · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Internet of Things (IoT) & Connected Devices · Depth: Expert, quick

Summary

Apple researchers have developed a new learned image codec, optimized for both perceptual quality and on-device runtime, aiming to bridge the gap between potential and practical application of learned codecs. The study involved a comprehensive analysis of key modeling choices and novel techniques, followed by a performance-aware neural architecture search across millions of configurations. This process identified models that meet specific on-device runtime targets while maximizing compression performance based on perceptual metrics. The resulting codec demonstrates significant improvements, achieving 2.3–3x bitrate savings compared to traditional codecs like AV1, AV2, VVC, ECM, and JPEG-AI, and 20–40% savings against leading learned codecs. Notably, it encodes 12MP images on an iPhone 17 Pro Max in 230ms and decodes them in 150ms, outperforming many ML-based codecs running on a V100 GPU.

Key takeaway

For AI Engineers developing mobile-first image processing applications, this research indicates that jointly optimizing for perceptual quality and on-device runtime can yield substantial performance gains. You should explore neural architecture search techniques to identify codec configurations that meet specific hardware constraints, as demonstrated by the 230ms encode and 150ms decode times for 12MP images on an iPhone 17 Pro Max, while also achieving significant bitrate savings.

Key insights

Optimizing learned image codecs for both perceptual quality and on-device speed yields superior compression and performance.

Principles

Method

A comprehensive study of modeling choices, including novel techniques, followed by performance-aware neural architecture search to identify optimal backbone configurations for target runtime and perceptual quality.

In practice

Topics

Code references

Best for: AI Engineer, Computer Vision Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Hardware Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Apple Machine Learning Research.