Normalizing Flows with Iterative Denoising

· Source: Apple Machine Learning Research · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

Iterative TARFlow (iTARFlow) is a new Normalizing Flow (NF) generative model that achieves competitive performance on image modeling tasks, specifically across ImageNet resolutions of 64, 128, and 256 pixels. This model advances prior efforts like TARFlow by maintaining a fully end-to-end, likelihood-based objective during training, distinguishing it from diffusion models. During sampling, iTARFlow employs an autoregressive generation step followed by an iterative denoising procedure, drawing inspiration from diffusion-style methods. The research also includes an analysis of characteristic artifacts produced by iTARFlow, providing insights for future enhancements to Normalizing Flow generative models. Source code for iTARFlow is publicly available on GitHub.

Key takeaway

For research scientists exploring generative models, iTARFlow presents a strong alternative to diffusion models, particularly for image generation tasks. You should consider evaluating iTARFlow's performance on your specific datasets, especially if an end-to-end likelihood-based objective is a priority. Investigating its characteristic artifacts could also inform novel architectural improvements for Normalizing Flows.

Key insights

iTARFlow advances Normalizing Flows with a likelihood-based objective and diffusion-inspired iterative denoising for image generation.

Principles

Method

iTARFlow trains with an end-to-end likelihood objective, then samples via autoregressive generation followed by iterative denoising inspired by diffusion models.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Apple Machine Learning Research.