Data quality is the AI strategy

· Source: Databricks · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Cloud Computing & IT Infrastructure · Depth: Advanced, medium

Summary

NYU Langone Health, a prominent academic health system, has strategically modernized its data infrastructure using Databricks' unified data and AI platform, migrating from an on-premises data lake and enterprise data warehouse. This initiative, led by Chief Digital and Information Officer Nader Mherabi, began in 2017 to prepare for emerging AI capabilities, focusing on improving data quality at the source rather than filtering it downstream. The institution has invested in common transactional platforms like a single electronic health record and ERP system, establishing master data sources and unified identifiers to ensure data accuracy and trustworthiness. This foundational work supports a broad community of clinicians, analysts, and scientists across care delivery, operations, and research, enabling real-time clinical decision support and advanced analytics.

Key takeaway

For CTOs and VPs of Engineering building AI-driven health systems, your primary focus must be on upstream data quality and unification. Investing in common transactional platforms and robust data governance from the outset will create a reliable foundation for AI models and real-time decision support, preventing costly downstream data remediation and ensuring patient safety. Embrace the unpredictability of AI by focusing on value creation and continuous education.

Key insights

Prioritizing data quality at the source is fundamental for effective AI and real-time decision support in healthcare.

Principles

Method

Invest in common transactional platforms, establish master data sources, and implement unified identifiers. Use a data catalog like Unity Catalog for discoverability and governance, defining ownership and access controls.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Executive, Director of AI/ML, AI Architect, IT Professional

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Databricks.