Debiasing Algorithm through Model Adaptation | allainews.com

March 18, 2024, 4:43 a.m. | Tomasz Limisiewicz, David Mare\v{c}ek, Tom\'a\v{s} Musil

stat.ML updates on arXiv.org arxiv.org

arXiv:2310.18913v3 Announce Type: replace-cross
Abstract: Large language models are becoming the go-to solution for the ever-growing number of tasks. However, with growing capacity, models are prone to rely on spurious correlations stemming from biases and stereotypes present in the training data. This work proposes a novel method for detecting and mitigating gender bias in language models. We perform causal analysis to identify problematic model components and discover that mid-upper feed-forward layers are most prone to convey bias. Based on the …

abstract algorithm arxiv bias biases capacity correlations cs.ai cs.cl data gender gender bias however language language models large language large language models model adaptation novel solution stat.ml stemming stereotypes tasks through training training data type work

More from arxiv.org / stat.ML updates on arXiv.org

Geometric-Based Pruning Rules For Change Point Detection in Multiple Independent Time Series 13 hours ago | arxiv.org

abstract algorithms arxiv change +20

Leave-one-out least squares Monte Carlo algorithm for pricing Bermudan options 13 hours ago | arxiv.org

abstract algorithm arxiv bias +13

Recursively Feasible Shrinking-Horizon MPC in Dynamic Environments with Conformal Prediction Guarantees 13 hours ago | arxiv.org

abstract agents arxiv autonomous +18

Model orthogonalization and Bayesian forecast mixing via Principal Component Analysis 13 hours ago | arxiv.org

abstract analysis arxiv bayesian +18

$\ell_1$-Regularized Generalized Least Squares 13 hours ago | arxiv.org

abstract accuracy arxiv autoregressive +15

The fast committor machine: Interpretable prediction with kernels 13 hours ago | arxiv.org

abstract arxiv cs.na dynamics +13

A Stable and Efficient Covariate-Balancing Estimator for Causal Survival Effects 3 days, 13 hours ago | arxiv.org

abstract art arxiv causal +12

Subgradient Convergence Implies Subdifferential Convergence on Weakly Convex Functions: With Uniform Rates Guarantees 3 days, 13 hours ago | arxiv.org

abstract arxiv challenge convergence +15

A Gaussian Process Model for Ordinal Data with Applications to Chemoinformatics 3 days, 13 hours ago | arxiv.org

abstract applications arxiv create +16

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net