From Pixels to DNA: Why the Future of Compression Is About Every Kind of Data

· Source: Towards Data Science · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Emerging Technologies & Innovation · Depth: Advanced, long

Summary

The landscape of data compression has expanded significantly beyond traditional audio and video, now encompassing diverse data types like genomes, point clouds, 3D scenes, and neural networks, driven by an exponential increase in global data generation, projected to reach 175 zettabytes by 2025. Organizations like ISO/IEC JTC 1/SC 29, responsible for JPEG and MPEG standards, are broadening their scope from "media for humans" to "data for humans and machines." Key developments include JPEG AI, a learning-based image coding standard using latent tensors for both human viewing and machine analysis; JPEG Trust, a framework for digital image authenticity and provenance; JPEG Pleno for plenoptic data like light fields and holograms; JPEG XS for ultra-low latency video; and JPEG DNA for storing images in biological molecules. MPEG continues to evolve video codecs, with the H.267 standard aiming for a ~40% bitrate reduction over VVC by 2028, while also prioritizing energy efficiency and integrating AI-based tools like Neural Network Video Coding (NNVC). Furthermore, new standards like Video Coding for Machines (VCM) and Feature Coding for Machines (FCM) are emerging to optimize visual data for machine tasks, with FCM offering up to 97% bandwidth savings by compressing intermediate neural features instead of pixels. Neural Network Coding (NNC) can compress deep neural networks by up to 97% without accuracy loss, and Gaussian Splat Coding (GSC) addresses 3D scene compression. Audio standards like MPEG-H Audio are moving towards object-based approaches for immersive and personalized experiences, including features like Dialog+ for enhanced speech clarity.

Key takeaway

For Computer Vision Engineers developing systems that process visual data, you should prioritize compression standards that support both human and machine consumption, such as JPEG AI and MPEG's VCM/FCM. These advancements enable significant bandwidth savings and enhanced privacy by processing intermediate features rather than raw pixels, while also integrating authenticity frameworks like JPEG Trust. Your adoption of these evolving standards will be crucial for building scalable, energy-efficient, and trustworthy AI-driven applications.

Key insights

Data compression is evolving into a foundational technology for all digital data, shifting from human-centric media to machine-centric semantics.

Principles

Method

JPEG AI transforms images into latent tensors for compression, enabling both human viewing and direct machine analysis. FCM compresses intermediate neural features from edge devices for cloud inference, reducing bandwidth and enhancing privacy.

In practice

Topics

Best for: Computer Vision Engineer, AI Engineer, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Towards Data Science.