Data quality is the AI strategy
Summary
NYU Langone Health, a prominent academic health system, has strategically modernized its data infrastructure using Databricks' unified data and AI platform, migrating from an on-premises data lake and enterprise data warehouse. This initiative, led by Chief Digital and Information Officer Nader Mherabi, began in 2017 to prepare for emerging AI capabilities, focusing on improving data quality at the source rather than filtering it downstream. The institution has invested in common transactional platforms like a single electronic health record and ERP system, establishing master data sources and unified identifiers to ensure data accuracy and trustworthiness. This foundational work supports a broad community of clinicians, analysts, and scientists across care delivery, operations, and research, enabling real-time clinical decision support and advanced analytics.
Key takeaway
For CTOs and VPs of Engineering building AI-driven health systems, your primary focus must be on upstream data quality and unification. Investing in common transactional platforms and robust data governance from the outset will create a reliable foundation for AI models and real-time decision support, preventing costly downstream data remediation and ensuring patient safety. Embrace the unpredictability of AI by focusing on value creation and continuous education.
Key insights
Prioritizing data quality at the source is fundamental for effective AI and real-time decision support in healthcare.
Principles
- Fix data quality at transactional systems first.
- Unified data prevents conflicting metrics.
- Unified governance is critical for AI.
Method
Invest in common transactional platforms, establish master data sources, and implement unified identifiers. Use a data catalog like Unity Catalog for discoverability and governance, defining ownership and access controls.
In practice
- Implement a single EHR and ERP system.
- Define master data sources for patient and financial data.
- Invest in data literacy programs for all users.
Topics
- Data Quality Strategy
- Healthcare AI
- Unified Data Platform
- Real-time Clinical Decision Support
- Data Governance
Best for: CTO, VP of Engineering/Data, Executive, Director of AI/ML, AI Architect, IT Professional
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Databricks.