A Context-Aware Dataset for Stance Detection in Bioethical Controversies on Reddit

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

BioStance is a new context-aware dataset designed for stance detection in bioethical controversies on social media. It comprises 39,600 annotated Post-Comment pairs extracted from Reddit discussions concerning bioethics. The dataset addresses six controversial targets, spanning three dimensions of bioethical conflict: fundamental value conflicts, individual liberty versus collective responsibility, and technological uncertainty. Each data instance retains its hierarchical conversational context and was labeled by three independent annotators using a three-class stance scheme: Favor, Against, and None. The annotations demonstrate substantial reliability, achieving a mean Krippendorff's α of 0.82. BioStance aims to support research in context-aware stance detection, argument mining, and the computational analysis of bioethical discourse, providing a large-scale, domain-specific resource.

Key takeaway

For NLP Engineers and Research Scientists developing models for stance detection or argument mining in sensitive domains, BioStance offers a critical, large-scale resource. You should consider integrating this context-aware dataset, with its 39,600 annotated Reddit pairs and high annotation reliability (α=0.82), to train and evaluate models. This will improve your system's ability to accurately interpret complex bioethical discourse, moving beyond simple keyword matching to capture nuanced conversational context.

Key insights

BioStance offers a large, context-aware dataset for stance detection in complex bioethical social media discourse.

Principles

In practice

Topics

Best for: AI Scientist, NLP Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.