Feb. 7, 2024, 5:47 a.m. | Fudan Zheng Jindong Cao Weijiang Yu Zhiguang Chen Nong Xiao Yutong Lu

cs.CV updates on arXiv.org arxiv.org

Most advances in medical image recognition supporting clinical auxiliary diagnosis meet challenges due to the low-resource situation in the medical field, where annotations are highly expensive and professional. This low-resource problem can be alleviated by leveraging the transferable representations of large-scale pre-trained vision-language models via relevant medical text prompts. However, existing pre-trained vision-language models require domain experts to carefully design the medical prompts, which greatly increases the burden on clinicians. To address this problem, we propose a weakly supervised prompt …

advances annotations challenges classification clinical cs.cv diagnosis image image recognition language language models low medical medical field professional prompt prompt learning prompts recognition scale text via vision vision-language models

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne