March 29, 2024, 4:44 a.m. | Anees Ur Rehman Hashmi, Dwarikanath Mahapatra, Mohammad Yaqub

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.18996v1 Announce Type: new
Abstract: Explaining Deep Learning models is becoming increasingly important in the face of daily emerging multimodal models, particularly in safety-critical domains like medical imaging. However, the lack of detailed investigations into the performance of explainability methods on these models is widening the gap between their development and safe deployment. In this work, we analyze the performance of various explainable AI methods on a vision-language model, MedCLIP, to demystify its inner workings. We also provide a simple …

abstract arxiv cs.cv daily deep dive deep learning domains explainability face gap however imaging investigations language language models medical medical imaging multimodal multimodal models performance safety safety-critical type vision vision-language models

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Robotics Technician - 3rd Shift

@ GXO Logistics | Perris, CA, US, 92571