Neobank Monzo Builds Governed Data Mesh Across 100 Teams and 12000 dbt Models
Summary
Neobank Monzo has successfully redesigned its data warehouse to support over 100 teams and 12,000 dbt models, implementing a "meshy" approach that reduced warehouse costs by approximately 40% and improved data delivery speed by 25%. This rebuild, initiated in the last year, involved defining clear modeling layers, formalizing cross-team data dependencies with explicit interface models, and enforcing validation of structure, naming, and access patterns through CI. The migration, currently 30% complete, aims to manage distributed data ownership at scale, ensuring performance, consistency, and high quality even with increased contributions, including those from AI-assisted coding. Monzo's architecture relies on automated guardrails and shared tooling, including a command-line tool called Modelgen for generating SQL and YAML models.
Key takeaway
For CTOs or VPs of Engineering managing large-scale data platforms, Monzo's data mesh implementation offers a blueprint for cost reduction and improved data delivery. Your teams should consider adopting explicit data interfaces, automated validation via CI, and a layered data architecture to manage distributed ownership effectively. This approach can mitigate the risks of inconsistent data and escalating costs as your dbt model count grows.
Key insights
Monzo's data mesh approach uses governed interfaces and CI-enforced standards to scale data ownership and reduce costs.
Principles
- Enforce clear data standards.
- Formalize data sharing via explicit interfaces.
- Automate quality checks with CI.
Method
Monzo structures data into four layers: automated landing, generated normalized, logical, and presentation models. Consistency is enforced via Modelgen and CI-backed data standards.
In practice
- Define unique keys for each data model.
- Implement freshness tests for data quality.
- Use CI for strict naming and metadata validation.
Topics
- Data Mesh
- dbt Models
- Data Governance
- CI/CD
- Neobank
Best for: CTO, VP of Engineering/Data, Director of AI/ML, Data Engineer, Analytics Engineer, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by InfoQ.