datasette-extract 0.3a0

· Source: Simon Willison's Weblog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Intermediate, quick

Summary

Datasette-extract 0.3a0, released on April 1st, 2026, introduces new capabilities for importing unstructured data, including text and images, into structured tables. This update integrates with datasette-llm, allowing users to manage and configure the large language models (LLMs) available for extraction tasks. Specifically, model availability for extraction can now be controlled via the `extract` purpose within datasette-llm's model configuration. This enhancement streamlines the process of defining which LLMs are utilized for converting diverse unstructured inputs into a structured database format, improving flexibility and control for data professionals.

Key takeaway

For data engineers and analysts working with Datasette, this update means you can now precisely control the LLMs used for extracting structured data from unstructured sources. Configure your preferred models through datasette-llm's `extract` purpose to optimize performance and cost for your specific text and image processing needs. This integration simplifies managing your data pipeline's AI components.

Key insights

Datasette-extract 0.3a0 integrates with datasette-llm for configurable unstructured data extraction.

Principles

Method

Datasette-extract uses datasette-llm to define and manage LLM configurations for data extraction, specifying models via the `extract` purpose in LLM model settings.

In practice

Topics

Code references

Best for: AI Engineer, NLP Engineer, Computer Vision Engineer, Data Scientist, Data Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.