CreatiParser: Generative Image Parsing of Raster Graphic Designs into Editable Layers

· Source: Takara TLDR - Daily AI Papers · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Generative Design & Graphic Parsing · Depth: Expert, medium

Summary

CreatiParser is a novel hybrid generative framework designed for raster-to-layer graphic design parsing, enabling the decomposition of a design image into editable text, background, and sticker layers. Unlike traditional multi-stage pipelines that suffer from error accumulation, CreatiParser uses a vision-language model to parse text regions into a rendering protocol for faithful reconstruction and flexible re-editing. Background and sticker layers are generated via a multi-branch diffusion architecture with RGBA support. The framework incorporates ParserReward and Group Relative Policy Optimization to align generation quality with human design preferences. Evaluated on the Parser-40K and Crello datasets, CreatiParser demonstrated superior performance, achieving an average improvement of 23.7% across all metrics compared to existing methods.

Key takeaway

For research scientists developing graphic design tools, CreatiParser offers a robust approach to transform static raster images into editable, layered designs. You should consider integrating hybrid generative frameworks and vision-language models to overcome limitations of multi-stage pipelines, enhancing both controllability and downstream editing capabilities in your systems. This method significantly improves decomposition quality and editability.

Key insights

CreatiParser decomposes raster graphic designs into editable layers using a hybrid generative framework for enhanced editing.

Principles

Method

CreatiParser parses text with a vision-language model, generates background/sticker layers via multi-branch RGBA diffusion, and refines output using ParserReward with Group Relative Policy Optimization.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Creative Technologist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.