Sculpting NeRF Geometry: Human-Preference Fine-Tuning of a 3D-Aware Face GAN

2026-06-25 · Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision · Depth: Expert, quick

Summary

A novel method fine-tunes a pretrained 3D-aware generative model, specifically EG3D, to sculpt Neural Radiance Field (NeRF) geometry using human preferences. Unlike existing reinforcement learning from human feedback (RLHF) pipelines that optimize explicit surface representations, this approach directly learns a reward over radiance-field density (σ) values without external mesh or shape priors. The reward model, which requires no pretraining and trains on a small set of preference samples, directly reads the continuous 3D density field of the NeRF, providing a geometry-only learning signal. This process avoids text conditioning, mesh extraction, or multi-view rendering. While a density-consistency constraint maintains 2D appearance, the FID-50k score rises from 4.09 to 6.66. The fine-tuned generator, based on a single annotator's preferences, achieved user preference in 74.4% of pairwise comparisons for face geometries.

Key takeaway

For Computer Vision Engineers developing 3D generative models, consider directly fine-tuning NeRF-based GANs with human preference reinforcement learning. This method allows you to sculpt 3D geometry by learning rewards over radiance-field density, bypassing mesh extraction or multi-view rendering. You can achieve significant user preference for geometry, even with a small dataset of human preferences, though expect a measurable increase in FID-50k.

Key insights

Fine-tuning 3D-aware GANs directly with human preference rewards on NeRF density improves geometry without explicit surface supervision.

Principles

Direct density-field reward enables geometry-only learning.
Small preference samples can yield robust 3D geometry improvement.
Density-consistency helps preserve 2D appearance during reshaping.

Method

Fine-tune a pretrained 3D-aware GAN (EG3D) using a learned reward model over radiance-field density (σ) values. The reward model, trained on preference samples, directly reads the NeRF's 3D density field, applying a geometry-only signal.

In practice

Apply human preference RLHF directly to NeRF density fields.
Use density-consistency constraints to maintain 2D quality.
Train reward models with minimal preference data for 3D geometry.

Topics

Neural Radiance Fields
Generative Adversarial Networks
Reinforcement Learning from Human Feedback
3D Face Generation
EG3D
Computer Vision

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.