DeepJEB++: Foundation Model-Driven Large-Scale 3D Engineering Dataset via 2D Latent Space Augmentation

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Engineering & Applied Sciences · Depth: Expert, quick

Summary

DeepJEB++ is a foundation-model-driven data-augmentation framework designed to generate large-scale, simulation-labeled 3D engineering datasets from small seed sets. It addresses limitations in existing 3D augmentation techniques and automates the simulation-labeling process. The framework expands a small set of jet engine brackets (fewer than 400 designs) into 15,360 simulation-labeled 3D brackets, achieving a 40x expansion using a single GPU per stage. Its core methodology involves augmenting data in a 2D latent space before transferring to 3D. This process includes fine-tuning a 2D latent diffusion model, synthesizing novel views with a vision-language-model (VLM) quality filter, lifting validated images to 3D meshes, and automatically assigning finite-element labels like mass, stress, and displacement.

Key takeaway

For Machine Learning Engineers or Research Scientists constrained by limited 3D engineering data or manual simulation labeling, DeepJEB++ offers a scalable solution. You should consider adopting this 2D-to-3D augmentation strategy to rapidly generate large, labeled 3D datasets. This approach significantly reduces resource demands and manual intervention for complex engineering simulations, enabling faster iteration in data-driven design.

Key insights

2D latent space augmentation efficiently expands small 3D engineering datasets with physics-based labels.

Principles

Method

Fine-tune a 2D latent diffusion model, synthesize views with VLM filtering, lift 2D images to 3D meshes via a generative foundation model, then automate finite-element label assignment.

In practice

Topics

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.