Usability Analysis of Configurator User Interfaces with Multimodal Large Language Models

· Source: cs.SE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Emerging Technologies & Innovation · Depth: Expert, extended

Summary

A study investigates the use of multimodal large language models (MLLMs) for semi-automated usability analysis of configurator user interfaces (UIs). Researchers synthesized 18 configurator-specific usability criteria from literature and applied them in an MLLM-based analysis of 16 real-world configurators. Using Google's gemini-2.5-flash model and screen recordings, the approach generated severity ratings and improvement suggestions. The MLLM-based analysis averaged 14.4 seconds per criterion and 258.4 seconds per configurator. A review by six software engineers found 88.5% of issue descriptions and 98.0% of improvement recommendations plausible, confirming MLLMs can reliably identify issues and provide domain-aware suggestions, significantly reducing analysis effort.

Key takeaway

For software engineers developing or maintaining complex configurator UIs, this research suggests integrating MLLM-based analysis into your development workflow. You can significantly reduce manual effort in identifying usability issues by leveraging tools like gemini-2.5-flash with screen recordings and domain-specific criteria. While human validation is still necessary, this approach provides rapid, actionable feedback, accelerating iteration cycles and improving user experience, especially for overlooked aspects like variant comparison.

Key insights

MLLMs can semi-automate configurator UI usability analysis using domain-specific criteria and video input, reducing manual effort.

Principles

Method

Provide an MLLM (e.g., gemini-2.5-flash) with screen recordings of user interactions, 18 configurator-specific usability criteria, and detailed prompts to generate severity ratings and improvement suggestions.

In practice

Topics

Best for: AI Scientist, Research Scientist, Software Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.