XPASS-Vis: A Dataset for Cross-Domain Personalized Image Aesthetic Assessment

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision · Depth: Expert, quick

Summary

XPASS-Vis is introduced as the first dataset specifically designed for cross-domain Personalized Image Aesthetic Assessment (PIAA), addressing the limitation of existing datasets confined to single domains or lacking sufficient per-annotator samples. This new dataset comprises 6,526 stimuli across three visual domains—art, fashion, and landscape—and features ratings from 129 annotators, resulting in 87,836 user-stimulus interactions. Each interaction includes an overall aesthetic score and nine aesthetic-emotion ratings. Crucially, each annotator provided over 200 ratings per domain, facilitating robust personalization both within and across domains. The research also establishes baseline models for cross-domain PIAA using unsupervised domain adaptation (UDA), demonstrating that the top-performing UDA method recovers approximately 60% (Spearman's ρ = .28) of the supervised upper bound, indicating meaningful transferability of preferences while highlighting a need for PIAA-specific adaptation strategies.

Key takeaway

For AI Scientists and Computer Vision Engineers developing personalized image aesthetic assessment models, XPASS-Vis offers a critical resource for exploring cross-domain transferability. You should leverage this dataset to train and evaluate models that generalize aesthetic preferences across diverse visual domains like art, fashion, and landscape. This enables more robust PIAA systems, but be aware that significant gaps remain, necessitating the development of PIAA-specific adaptation strategies to achieve supervised performance levels.

Key insights

Personalized aesthetic preferences are meaningfully transferable across visual domains, despite individual subjectivity.

Principles

Method

Baseline models for cross-domain PIAA are established via unsupervised domain adaptation (UDA), transferring models from a labeled source to an unlabeled target domain.

In practice

Topics

Best for: AI Scientist, Computer Vision Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.