Debiasing Algorithm through Model Adaptation | allainews.com

March 18, 2024, 4:43 a.m. | Tomasz Limisiewicz, David Mare\v{c}ek, Tom\'a\v{s} Musil

stat.ML updates on arXiv.org arxiv.org

arXiv:2310.18913v3 Announce Type: replace-cross
Abstract: Large language models are becoming the go-to solution for the ever-growing number of tasks. However, with growing capacity, models are prone to rely on spurious correlations stemming from biases and stereotypes present in the training data. This work proposes a novel method for detecting and mitigating gender bias in language models. We perform causal analysis to identify problematic model components and discover that mid-upper feed-forward layers are most prone to convey bias. Based on the …

abstract algorithm arxiv bias biases capacity correlations cs.ai cs.cl data gender gender bias however language language models large language large language models model adaptation novel solution stat.ml stemming stereotypes tasks through training training data type work

More from arxiv.org / stat.ML updates on arXiv.org

Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions 2 days, 7 hours ago | arxiv.org

abstract arxiv bias canonical +16

Dropout Regularization Versus $\ell_2$-Penalization in the Linear Model 2 days, 7 hours ago | arxiv.org

abstract arxiv behavior convergence +15

Partial recovery and weak consistency in the non-uniform hypergraph Stochastic Block Model 2 days, 7 hours ago | arxiv.org

abstract algorithm arxiv block +15

Estimating the Number of Components in Finite Mixture Models via Variational Approximation 2 days, 7 hours ago | arxiv.org

abstract approximation arxiv bayes +11

Conformalized Ordinal Classification with Marginal and Conditional Coverage 2 days, 7 hours ago | arxiv.org

abstract algorithm applications arxiv +16

Multi-Study R-Learner for Estimating Heterogeneous Treatment Effects Across Studies Using Statistical Machine Learning 2 days, 16 hours ago | arxiv.org

abstract arxiv effects machine +15

Spatial best linear unbiased prediction: A computational mathematics approach for high dimensional massive datasets 2 days, 16 hours ago | arxiv.org

abstract arxiv challenges classification +20

Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems 4 days, 7 hours ago | arxiv.org

abstract arxiv class complexity +14

Estimation and Uniform Inference in Sparse High-Dimensional Additive Models 4 days, 7 hours ago | arxiv.org

abstract arxiv confidence construct +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Software Engineering Manager, Generative AI - Characters

@ Meta | Bellevue, WA | Menlo Park, CA | Seattle, WA | New York City | San Francisco, CA

View on ai-jobs.net

Senior Operations Research Analyst / Predictive Modeler

@ LinQuest | Colorado Springs, Colorado, United States

View on ai-jobs.net