MentalMARBERT: Domain-Adaptive Pre-training and Two-Stage Fine-Tuning for Arabic Mental Health Disorders Detection
Summary
A study addresses the challenges of detecting mental health disorders from Arabic social media text, including dialectal variation, informal language, and limited annotated resources. It proposes a two-phase framework for Arabic mental health text classification. Phase 1 involves Domain-Adaptive and Task-Adaptive Pretraining (DAPT and TAPT) of AraBERT, CAMeLBERT, and MARBERT using a large corpus of unlabeled Arabic mental health tweets to identify the most effective backbone model. Phase 2 evaluates the selected model across four configurations, combining single-stage and hierarchical two-stage classification architectures with full fine-tuning and Low-Rank Adaptation (LoRA). The researchers constructed a novel annotated Arabic mental health dataset of 50,670 tweets across six categories, achieving strong inter-annotator agreement (Krippendorff's Alpha = 0.733). Experimental results show that the domain-adapted MARBERT, named MentalMARBERT, significantly improves performance. The hierarchical two-stage architecture with full fine-tuning achieved the best overall performance, with a macro-F1 of 0.861 and an accuracy of 0.877.
Key takeaway
For NLP Engineers developing Arabic mental health detection systems, this research indicates that domain-adaptive pre-training and hierarchical classification are critical for achieving high performance. You should prioritize fine-tuning models like MARBERT on domain-specific data and implement a two-stage classification architecture. This approach, demonstrated by MentalMARBERT's 0.861 macro-F1, offers a robust strategy to overcome dialectal challenges and class imbalance in Arabic social media text.
Key insights
Domain-adaptive pre-training and hierarchical classification significantly improve Arabic mental health disorder detection.
Principles
- Domain-adaptive pre-training enhances model performance.
- Hierarchical classification improves multi-class detection.
- High-quality annotated datasets are crucial.
Method
A two-phase framework: DAPT/TAPT on pre-trained models, then evaluate selected model with hierarchical two-stage classification and full fine-tuning.
In practice
- Apply DAPT/TAPT for domain-specific NLP tasks.
- Consider hierarchical architectures for multi-class problems.
- Develop high-quality, domain-specific datasets.
Topics
- Mental Health Detection
- Arabic NLP
- Domain-Adaptive Pre-training
- Hierarchical Classification
- MARBERT
- Text Classification
Best for: AI Scientist, NLP Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.