Granite 4.1 3B SVG Pelican Gallery

· Source: Simon Willison's Weblog · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Intermediate, quick

Summary

IBM recently released its Granite 4.1 family of large language models (LLMs) under an Apache 2.0 license, available in 3B, 8B, and 30B parameter sizes. Concurrently, Unsloth published a collection of 21 GGUF-encoded, quantized variants of the 3B Granite 4.1 model, with file sizes ranging from 1.2GB to 6.34GB, totaling 51.3GB. An experiment was conducted using these Unsloth GGUF files to prompt "Generate an SVG of a pelican riding a bicycle" across different quantized sizes. The results showed no discernible pattern linking output quality to model size; all generated SVGs were of poor quality, with even the smallest model producing a better bicycle image than larger variants.

Key takeaway

For AI Engineers evaluating quantized LLMs for creative content generation, you should not assume that larger quantized models will inherently produce higher quality outputs. Your focus should be on empirical testing across various quantization levels and model sizes for your specific use case, as smaller variants might surprisingly outperform larger ones in certain tasks, challenging conventional wisdom about model scaling.

Key insights

Quantization level does not guarantee improved output quality for specific creative tasks.

Method

The experiment involved prompting 21 different GGUF-quantized variants of the Granite 4.1 3B model with the same SVG generation request to compare output quality against model size.

In practice

Topics

Best for: Machine Learning Engineer, Prompt Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.