Connecting the Models: A Global Mega-model of MDE Projects on GitHub
Summary
A new study introduces ModelGraph, a global mega-model and dataset derived from 7,436 GitHub projects containing over 325,000 Model-Driven Engineering (MDE) artefacts. Published in March 2026, this resource details relationships between MDE artefacts like Ecore, ATL, and Xtext. The methodology involved mining GitHub, recovering project-specific mega-models, and constructing a global mega-model by analyzing near-duplicates. This comprehensive dataset, available with a web-based exploration tool, aims to foster empirical analysis of MDE tool usage and inter-project dependencies. It addresses a long-standing gap in understanding MDE in practice.
Key takeaway
For Research Scientists studying Model-Driven Engineering, this ModelGraph dataset offers an unprecedented resource to empirically analyze MDE tool usage and inter-project dependencies. You can explore technology combinations, identify reuse patterns, and train specialized AI models, overcoming previous data limitations. Utilize the provided web tool and APIs to investigate MDE project maturity and evolution.
Key insights
A global mega-model and dataset reveals MDE artefact relationships and reuse patterns across 7,436 GitHub projects.
Principles
- MDE lacks package managers, leading to copy-paste reuse.
- Implicit artefact dependencies require specialized recovery methods.
Method
Mine GitHub for MDE artefacts, recover project-level mega-models by analyzing dependencies, then merge into a global mega-model via near-duplicate detection.
In practice
- Explore MDE project health and technology combinations.
- Train AI models using deduplicated MDE artefacts.
- Identify central hubs and structural patterns.
Topics
- Model-Driven Engineering
- GitHub Mining
- Mega-models
- EMF Ecosystem
- Artefact Duplication
- Empirical Software Engineering
Code references
- utwente-fmt/attop
- upohl/mechatronicuml
- fraunhofer-iem/uppaal-model
- fraunhofer-iem/mechatronicuml
- NCIP/lexevs
Best for: AI Scientist, Research Scientist, Software Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.