Sculpting NeRF Geometry: Human-Preference Fine-Tuning of a 3D-Aware Face GAN

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision · Depth: Expert, quick

Summary

A novel method fine-tunes a pretrained 3D-aware generative model, specifically EG3D, to sculpt Neural Radiance Field (NeRF) geometry using human preferences. Unlike existing reinforcement learning from human feedback (RLHF) pipelines that optimize explicit surface representations, this approach directly learns a reward over radiance-field density (σ) values without external mesh or shape priors. The reward model, which requires no pretraining and trains on a small set of preference samples, directly reads the continuous 3D density field of the NeRF, providing a geometry-only learning signal. This process avoids text conditioning, mesh extraction, or multi-view rendering. While a density-consistency constraint maintains 2D appearance, the FID-50k score rises from 4.09 to 6.66. The fine-tuned generator, based on a single annotator's preferences, achieved user preference in 74.4% of pairwise comparisons for face geometries.

Key takeaway

For Computer Vision Engineers developing 3D generative models, consider directly fine-tuning NeRF-based GANs with human preference reinforcement learning. This method allows you to sculpt 3D geometry by learning rewards over radiance-field density, bypassing mesh extraction or multi-view rendering. You can achieve significant user preference for geometry, even with a small dataset of human preferences, though expect a measurable increase in FID-50k.

Key insights

Fine-tuning 3D-aware GANs directly with human preference rewards on NeRF density improves geometry without explicit surface supervision.

Principles

Method

Fine-tune a pretrained 3D-aware GAN (EG3D) using a learned reward model over radiance-field density (σ) values. The reward model, trained on preference samples, directly reads the NeRF's 3D density field, applying a geometry-only signal.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.