Sculpting NeRF Geometry: Human-Preference Fine-Tuning of a 3D-Aware Face GAN
Summary
A novel method fine-tunes a pretrained 3D-aware generative model, specifically EG3D, to sculpt Neural Radiance Field (NeRF) geometry using human preferences. Unlike existing reinforcement learning from human feedback (RLHF) pipelines that optimize explicit surface representations, this approach directly learns a reward over radiance-field density (σ) values without external mesh or shape priors. The reward model, which requires no pretraining and trains on a small set of preference samples, directly reads the continuous 3D density field of the NeRF, providing a geometry-only learning signal. This process avoids text conditioning, mesh extraction, or multi-view rendering. While a density-consistency constraint maintains 2D appearance, the FID-50k score rises from 4.09 to 6.66. The fine-tuned generator, based on a single annotator's preferences, achieved user preference in 74.4% of pairwise comparisons for face geometries.
Key takeaway
For Computer Vision Engineers developing 3D generative models, consider directly fine-tuning NeRF-based GANs with human preference reinforcement learning. This method allows you to sculpt 3D geometry by learning rewards over radiance-field density, bypassing mesh extraction or multi-view rendering. You can achieve significant user preference for geometry, even with a small dataset of human preferences, though expect a measurable increase in FID-50k.
Key insights
Fine-tuning 3D-aware GANs directly with human preference rewards on NeRF density improves geometry without explicit surface supervision.
Principles
- Direct density-field reward enables geometry-only learning.
- Small preference samples can yield robust 3D geometry improvement.
- Density-consistency helps preserve 2D appearance during reshaping.
Method
Fine-tune a pretrained 3D-aware GAN (EG3D) using a learned reward model over radiance-field density (σ) values. The reward model, trained on preference samples, directly reads the NeRF's 3D density field, applying a geometry-only signal.
In practice
- Apply human preference RLHF directly to NeRF density fields.
- Use density-consistency constraints to maintain 2D quality.
- Train reward models with minimal preference data for 3D geometry.
Topics
- Neural Radiance Fields
- Generative Adversarial Networks
- Reinforcement Learning from Human Feedback
- 3D Face Generation
- EG3D
- Computer Vision
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.