March 14, 2024, 4:41 a.m. | Teng Xiao, Chao Cui, Huaisheng Zhu, Vasant G. Honavar

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.08167v1 Announce Type: new
Abstract: Recent advancements in biology and chemistry have leveraged multi-modal learning, integrating molecules and their natural language descriptions to enhance drug discovery. However, current pre-training frameworks are limited to two modalities, and designing a unified network to process different modalities (e.g., natural language, 2D molecular graphs, 3D molecular conformations, and 3D proteins) remains challenging due to inherent gaps among them. In this work, we propose MolBind, a framework that trains encoders for multiple modalities through contrastive …

abstract alignment arxiv biology chemistry cs.cl cs.lg current designing discovery drug discovery frameworks graphs however language modal molecules multi-modal multimodal natural natural language network pre-training process proteins q-bio.qm training type

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Machine Learning Engineer (AI, NLP, LLM, Generative AI)

@ Palo Alto Networks | Santa Clara, CA, United States

Consultant Senior Data Engineer F/H

@ Devoteam | Nantes, France