LinguIUTics at PsyDefDetect: Iterative Imbalance-Aware Fine-tuning of Qwen3-8B for Psychological Defense Mechanism Classification

2026-05-30 · Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

LinguIUTics achieved a macro F1-score of 0.3917 in the PsyDefDetect 2026 shared task for psychological defense mechanism classification, ranking 4th among 21 teams. This represents a 7.7 absolute point (24.4% relative) improvement over the Ministral-8B baseline of 31.48 macro F1. Addressing severe class imbalance that hindered BERT-family encoders and zero-shot LLMs on rare classes, the team employed QLoRA fine-tuning of Qwen3-8B. Their methodology incorporated grouped stratified cross-validation to prevent data leakage, minority-class round-robin lexical augmentation, and a post-processing pipeline featuring logit bias tuning and ensemble blending. These strategies significantly enhanced minority-class recall, boosting the critical "Unclear" class (Level 8) from near-zero performance to an F1 score of 0.797.

Key takeaway

For NLP Engineers developing classification systems with severe class imbalance, particularly in clinical contexts like psychological defense mechanisms, you should prioritize QLoRA fine-tuning of models like Qwen3-8B. Implement grouped stratified cross-validation and minority-class lexical augmentation to improve rare-class recall. Your post-processing pipeline should include logit bias tuning and ensemble blending to achieve substantial performance gains on critical, underrepresented categories.

Key insights

Iterative imbalance-aware fine-tuning of LLMs, combined with augmentation and post-processing, significantly improves rare-class NLP performance.

Principles

Class imbalance severely degrades rare-class performance for BERT-family and zero-shot LLMs.
Grouped stratified cross-validation prevents data leakage in imbalanced datasets.
Combining augmentation, fine-tuning, and post-processing boosts minority-class recall.

Method

QLoRA fine-tuning of Qwen3-8B, grouped stratified cross-validation, minority-class round-robin lexical augmentation, and a post-processing pipeline with logit bias tuning and ensemble blending.

In practice

Apply QLoRA fine-tuning to LLMs for imbalanced text classification.
Use grouped stratified cross-validation for robust model evaluation.
Implement lexical augmentation for underrepresented classes.

Topics

Psychological Defense Mechanisms
Utterance Classification
Class Imbalance
QLoRA Fine-tuning
Qwen3-8B
Lexical Augmentation
Ensemble Blending

Best for: Research Scientist, AI Engineer, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.