Protect Performance and Reduce Surprise Costs with Default Warehouse
Summary
Databricks SQL has introduced Default Warehouse, a new generally available feature designed to streamline ad hoc query execution by allowing administrators to pre-select a specific SQL warehouse. This feature addresses common challenges such as performance degradation, unpredictable costs, and governance issues that arise when users manually select warehouses or when the system defaults to "Last Selected" or alphabetical ordering. Default Warehouse ensures that exploratory queries are routed to the appropriate compute resources, preventing lightweight queries from consuming large production warehouse capacity. It supports various ad hoc SQL surfaces, including SQL Editor, Catalog Explorer, AI/BI Dashboards, Alerts, and Genie Spaces, offering both workspace-level defaults and user-level customization options.
Key takeaway
For CTOs and VP of Engineering overseeing Databricks SQL environments, implementing Default Warehouse is crucial for optimizing cloud spend and improving operational efficiency. By configuring appropriate default warehouses for ad hoc queries, you can significantly reduce unnecessary large warehouse wake-ups and ensure critical production workloads are not impacted by exploratory analysis. This change can lead to more predictable performance and substantial cost savings, as evidenced by customer data showing a rise from 77% to 96% of Catalog Explorer queries going to smaller warehouses.
Key insights
Default Warehouse automates SQL warehouse selection for ad hoc queries, optimizing cost and performance.
Principles
- Automate resource allocation for ad hoc queries.
- Provide flexibility for both admins and power users.
Method
Admins set a workspace-level default SQL warehouse for ad hoc surfaces. Users can customize their own default, with admin oversight, to ensure queries run on intended compute.
In practice
- Set smaller warehouses as default for exploratory queries.
- Monitor cost reductions for ad hoc workloads.
Topics
- Default Warehouse
- Databricks SQL
- SQL Warehouses
- Cost Optimization
- Workload Isolation
Best for: CTO, VP of Engineering/Data, Data Engineer, Director of AI/ML, IT Professional
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Databricks.