An AI-First Approach to Data Engineering with Lakeflow and Agent Bricks

· Source: Databricks · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Software Development & Engineering · Depth: Intermediate, medium

Summary

Databricks Lakeflow is an AI-native engineering platform designed to integrate and productionize AI models directly within ETL workflows using Agent Bricks AI functions. It enables engineers to orchestrate AI workloads at scale, automating complex pipelines while maintaining full enterprise context. Key AI functions include `ai_extract`, `ai_classify`, `ai_translate`, and the recently launched `ai_parse_document`, which transforms unstructured data into structured formats using multimodal foundation models. Lakeflow also offers `ai_query()` for running AI-driven transformations across large datasets with any LLM, leveraging serverless batch inference for faster, cost-efficient processing. The platform supports use cases like generating new data, structuring and organizing data, and improving data quality, as demonstrated by customers like Kard, Banco Bradesco, and Locala.

Key takeaway

For data engineers focused on building reliable, production-grade pipelines, Lakeflow offers a unified platform to embed AI directly into ETL. You can automate complex data processing, extract insights from unstructured data, and orchestrate AI workloads at scale without introducing new complexity. Consider integrating Lakeflow's AI functions to streamline workflows, reduce manual effort, and unlock new business insights from your data.

Key insights

Databricks Lakeflow integrates AI functions directly into ETL workflows for scalable, context-aware data processing.

Principles

Method

Integrate AI functions like `ai_extract`, `ai_classify`, `ai_parse_document`, and `ai_query` into existing ETL workflows. Orchestrate these AI-powered transformations using Lakeflow Jobs for scalable batch processing.

In practice

Topics

Best for: Data Engineer, MLOps Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Databricks.