Visual Explanations of Image-Text Representations via Multi-Modal Information Bottleneck Attribution | allainews.com

June 25, 2024, 4:50 a.m. | Ying Wang, Tim G. J. Rudner, Andrew Gordon Wilson

cs.LG updates on arXiv.org arxiv.org

arXiv:2312.17174v2 Announce Type: replace-cross
Abstract: Vision-language pretrained models have seen remarkable success, but their application to safety-critical settings is limited by their lack of interpretability. To improve the interpretability of vision-language models such as CLIP, we propose a multi-modal information bottleneck (M2IB) approach that learns latent representations that compress irrelevant information while preserving relevant visual and textual features. We demonstrate how M2IB can be applied to attribution analysis of vision-language pretrained models, increasing attribution accuracy and improving the interpretability of …

abstract application arxiv attribution clip cs.ai cs.cv cs.lg image information interpretability language language models modal multi multi-modal pretrained models replace safety safety-critical success text type via vision vision-language vision-language models visual

More from arxiv.org / cs.LG updates on arXiv.org

Bayesian identification of nonseparable Hamiltonians with multiplicative noise using deep learning and reduced-order modeling 11 hours ago | arxiv.org

abstract arxiv bayesian cs.lg +17

MMGPL: Multimodal Medical Data Analysis with Graph Prompt Learning 11 hours ago | arxiv.org

abstract analysis arxiv cs.cv +16

Self-Supervised Detection of Perfect and Partial Input-Dependent Symmetries 11 hours ago | arxiv.org

arxiv cs.cv cs.lg detection +3

MixerFlow: MLP-Mixer meets Normalising Flows 11 hours ago | arxiv.org

abstract architectures arxiv context +15

Machine Learning-Enabled Software and System Architecture Frameworks 11 hours ago | arxiv.org

abstract architecture arxiv concerns +22

Efficient Interaction-Aware Interval Analysis of Neural Network Feedback Loops 11 hours ago | arxiv.org

abstract analysis arxiv cs.lg +19

Kernelised Normalising Flows 11 hours ago | arxiv.org

abstract architecture arxiv capabilities +14

GSplit: Scaling Graph Neural Network Training on Large Graphs via Split-Parallelism 11 hours ago | arxiv.org

abstract arxiv class cs.dc +25

Reinforcement Learning in Credit Scoring and Underwriting 11 hours ago | arxiv.org

abstract action adapt arxiv +17

AI Focused Biochemistry Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, CA

View on ai-jobs.net

Senior Quality Specialist - JAVA

@ SAP | Bengaluru, IN, 560066

View on ai-jobs.net

Aktuar Financial Lines (m/w/d)

@ Zurich Insurance | Köln, DE

View on ai-jobs.net

Senior Network Engineer

@ ManTech | 054H - 124TchnlgyPrkWy,SBurlington,VT

View on ai-jobs.net

Pricing Analyst

@ EDF | Exeter, GB

View on ai-jobs.net

Specialist IS Engineer

@ Amgen | US - California - Thousand Oaks - Field/Remote

View on ai-jobs.net