March 7, 2024, 5:46 a.m. | Shuchang Ye, Mingyuan Meng, Mingjian Li, Dagan Feng, Jinman Kim

cs.CV updates on arXiv.org arxiv.org

arXiv:2401.13267v2 Announce Type: replace
Abstract: With increasing reliance on medical imaging in clinical practices, automated report generation from medical images is in great demand. Existing report generation methods typically adopt an encoder-decoder deep learning framework to build a uni-directional image-to-report mapping. However, such a framework ignores the bi-directional mutual associations between images and reports, thus incurring difficulties in associating the intrinsic medical meanings between them. Recent generative representation learning methods have demonstrated the benefits of dual-modal learning from both image …

abstract arxiv automated build clinical cs.cv decoder deep learning deep learning framework demand dynamic encoder encoder-decoder framework however image images imaging mapping medical medical imaging modal practices reliance report type

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Director, Clinical Data Science

@ Aura | Remote USA

Research Scientist, AI (PhD)

@ Meta | Menlo Park, CA | New York City