PortraitGen: Exemplar-Driven GRPO with Dual-Reward Guidance for Photorealistic Portrait Generation
Summary
PortraitGen is a new framework designed for photorealistic portrait generation, addressing limitations in existing Reinforcement Learning methods like Group Relative Policy Optimization (GRPO) for text-to-image post-training. Current GRPO approaches often produce superficial aesthetics but fail to resolve critical AI artifacts and biological implausibilities, primarily due to the absence of real images during post-training and a lack of specific rewards for fine-grained artifact suppression. PortraitGen tackles these issues by directly incorporating real images into GRPO sampling groups via image inversion, thereby breaking inherent generative boundaries. Furthermore, it introduces a dual-reward mechanism, comprising OmniReward for general quality and AI-Portrait for human-centric fidelity, to explicitly guide the model towards photorealism. Evaluated on the newly curated PortraitBench benchmark, PortraitGen significantly outperforms existing baselines, effectively suppressing AI artifacts and achieving unprecedented photorealism.
Key takeaway
For Machine Learning Engineers developing photorealistic image generation models, you should consider integrating real images directly into your reinforcement learning post-training. This approach, exemplified by PortraitGen's use of image inversion and dual-reward guidance (OmniReward, AI-Portrait), significantly suppresses AI artifacts and improves human-centric fidelity. Adopt domain-specific benchmarks like PortraitBench to validate true photorealism beyond superficial aesthetics.
Key insights
Photorealistic portrait generation improves by integrating real images and dual-reward guidance into RL post-training.
Principles
- Generative boundaries are broken by real image integration.
- Dual-reward mechanisms enhance fine-grained fidelity.
- RL post-training benefits from targeted artifact suppression.
Method
PortraitGen integrates real images into GRPO sampling via image inversion and employs a dual-reward system: OmniReward for general quality and AI-Portrait for human-centric fidelity.
In practice
- Use image inversion to introduce real data into RL training.
- Implement specific rewards for fine-grained artifact reduction.
- Develop custom benchmarks for domain-specific evaluation.
Topics
- Photorealistic Portrait Generation
- Reinforcement Learning
- GRPO
- Image Inversion
- Dual-Reward Systems
- AI Artifact Suppression
- PortraitBench
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.