April 2, 2024, 7:47 p.m. | Fan Bai, Yuxin Du, Tiejun Huang, Max Q. -H. Meng, Bo Zhao

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.00578v1 Announce Type: new
Abstract: Medical image analysis is essential to clinical diagnosis and treatment, which is increasingly supported by multi-modal large language models (MLLMs). However, previous research has primarily focused on 2D medical images, leaving 3D images under-explored, despite their richer spatial information. This paper aims to advance 3D medical image analysis with MLLMs. To this end, we present a large-scale 3D multi-modal medical dataset, M3D-Data, comprising 120K image-text pairs and 662K instruction-response pairs specifically tailored for various 3D …

analysis arxiv cs.cv image language language models large language large language models medical modal multi-modal type

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analyst (Commercial Excellence)

@ Allegro | Poznan, Warsaw, Poland

Senior Machine Learning Engineer

@ Motive | Pakistan - Remote

Summernaut Customer Facing Data Engineer

@ Celonis | Raleigh, US, North Carolina

Data Engineer Mumbai

@ Nielsen | Mumbai, India