Your Knowledge Graph Has Amnesia. This Paper From Bosch Fixes It.
Summary
A new paper from Bosch introduces a provenance engine and the PROV-STAR ontology designed to address the "amnesia" inherent in traditional knowledge graphs (KGs). Existing KGs typically lose historical data when triples are updated, making it impossible to track changes over time. This solution, evaluated on 5.15 million triples of Bosch manufacturing data, records every modification in real-time on a separate provenance KG. It utilizes RDF-star to embed provenance metadata directly into individual triples, avoiding reification bloat and enabling the reconstruction of any past version of the entire KG using a single SPARQL-star query. The system demonstrated a storage overhead of only 1.9% for tracking over 10,800 changes.
Key takeaway
For data architects and compliance officers managing critical knowledge graphs, this Bosch solution offers a robust method to overcome data loss from updates. Implementing PROV-STAR allows your organization to maintain a complete, auditable history of all data changes, crucial for regulatory compliance and debugging. You can reconstruct any past state of your KG, significantly enhancing data governance and reliability without substantial storage overhead.
Key insights
PROV-STAR and its provenance engine enable granular, real-time change tracking for RDF knowledge graphs.
Principles
- Provenance metadata should be granular.
- Separate provenance KGs minimize overhead.
Method
The solution employs a provenance engine (middleware), the PROV-STAR ontology, and query transformation to record every change on a separate provenance KG, using RDF-star to attach metadata directly to triples.
In practice
- Track changes in product catalogs.
- Maintain compliance audit trails.
- Reconstruct past data states.
Topics
- Knowledge Graphs
- Data Provenance
- RDF-star
- PROV-STAR Ontology
- SPARQL-star
Best for: CTO, VP of Engineering/Data, Director of AI/ML, Data Engineer, Data Scientist, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by AI Advances - Medium.