Data Quality in Crowdsourcing and Spamming Behavior Detection | allainews.com

April 30, 2024, 4:42 a.m. | Yang Ba, Michelle V. Mancenido, Erin K. Chiou, Rong Pan

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.17582v1 Announce Type: cross
Abstract: As crowdsourcing emerges as an efficient and cost-effective method for obtaining labels for machine learning datasets, it is important to assess the quality of crowd-provided data, so as to improve analysis performance and reduce biases in subsequent machine learning tasks. Given the lack of ground truth in most cases of crowdsourcing, we refer to data quality as annotators' consistency and credibility. Unlike the simple scenarios where Kappa coefficient and intraclass correlation coefficient usually can apply, …

abstract analysis arxiv behavior behavior detection biases cost crowdsourcing cs.hc cs.lg data data quality datasets detection labels machine machine learning performance quality reduce spamming stat.ap tasks truth type

More from arxiv.org / cs.LG updates on arXiv.org

TAnet: A New Temporal Attention Network for EEG-based Auditory Spatial Attention Decoding with a Short … 23 hours ago | arxiv.org

abstract arxiv attention cs.lg +14

State Derivative Normalization for Continuous-Time Deep Neural Networks 23 hours ago | arxiv.org

abstract arxiv continuous cs.lg +15

Measurement-driven neural-network training for integrated magnetic tunnel junction arrays 23 hours ago | arxiv.org

abstract applications arrays arxiv +21

Higher-Order Equivariant Neural Networks for Charge Density Prediction in Materials 23 hours ago | arxiv.org

abstract arxiv challenge cond-mat.mtrl-sci +21

Non-parametric regression for robot learning on manifolds 23 hours ago | arxiv.org

abstract applications arxiv cs.lg +17

Learning the dynamics of a one-dimensional plasma model with graph neural networks 23 hours ago | arxiv.org

abstract arxiv class cs.lg +20

RealFill: Reference-Driven Generation for Authentic Image Completion 23 hours ago | arxiv.org

arxiv authentic cs.ai cs.cv +6

Leveraging Self-Supervised Vision Transformers for Segmentation-based Transfer Function Design 23 hours ago | arxiv.org

abstract arxiv color cs.cv +19

Dilated convolutional neural network for detecting extreme-mass-ratio inspirals 23 hours ago | arxiv.org

abstract arxiv astro-ph.im binary +19

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net