AMD-Powered 3D Gaussian Splatting for Autonomous Driving Scenes
Summary
AMD has released a new fork of Street Gaussians (SG), an extension of 3D Gaussian Splatting (3DGS), optimized for efficient training on AMD GPUs using ROCm™. This implementation addresses the limitation of original 3DGS to static scenes by explicitly separating dynamic street scenes into static backgrounds and multiple dynamic foreground objects, each represented by 3D Gaussians. The new release migrates SG to the latest gsplat API (v1.5.3), simplifying the rendering pipeline and reducing Python-side overhead. The workflow, validated on AMD Instinct™ MI300X GPUs, includes installation on ROCm™, data pre-processing using the Waymo Open Dataset, training, and evaluation. New rendering features enable smoother video generation via pose interpolation and scene editing capabilities, such as adjusting vehicle speeds, trajectories, or removing/duplicating objects, to generate novel training data for autonomous driving systems. Performance benchmarks on Waymo scenes show PSNR values up to 37.25, SSIM up to 0.95, and LPIPS as low as 0.19.
Key takeaway
For AI Engineers developing autonomous driving systems, this AMD-optimized Street Gaussians implementation offers a robust platform for dynamic scene reconstruction and editing. You can leverage its explicit object separation to generate synthetic training data for critical scenarios, such as altered vehicle trajectories or speeds, which are difficult to collect in real-world conditions. This capability can significantly enhance the robustness and safety of your autonomous driving models.
Key insights
Street Gaussians on AMD GPUs enables dynamic 3D scene reconstruction and editing for autonomous driving.
Principles
- Decompose dynamic scenes into static and moving components.
- Use Fourier coefficients for time-varying object appearance.
- Combine L1, SSIM, depth, sky, and semantic losses for quality.
Method
The method involves building data pre-processing and Street Gaussian Docker files, downloading and extracting Waymo data, running a data processing pipeline, performing training, and then rendering and evaluating the trained model.
In practice
- Generate novel training data by editing vehicle trajectories.
- Interpolate training poses for smoother, higher framerate videos.
- Isolate moving vehicles or static backgrounds for analysis.
Topics
- 3D Gaussian Splatting
- Street Gaussians
- Autonomous Driving Scenes
- AMD Instinct MI300X
- ROCm Platform
Code references
- LightwheelAI/street-gaussians-ns
- ROCm/gsplat
- nerfstudio-project/gsplat
- silogen/ai-samples
- silogen/ai-samples
Best for: Machine Learning Engineer, Computer Vision Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by AMD ROCm Blogs.