Jan. 31, 2024, 3:42 p.m. | Shengchao Liu Weili Nie Chengpeng Wang Jiarui Lu Zhuoran Qiao Ling Liu Jian Tang Chaowei Xiao

cs.CL updates on arXiv.org arxiv.org

There is increasing adoption of artificial intelligence in drug discovery. However, existing studies use machine learning to mainly utilize the chemical structures of molecules but ignore the vast textual knowledge available in chemistry. Incorporating textual knowledge enables us to realize new drug design objectives, adapt to text-based instructions and predict complex biological activities. Here we present a multi-modal molecule structure-text model, MoleculeSTM, by jointly learning molecules' chemical structures and textual descriptions via a contrastive learning strategy. To train MoleculeSTM, we …

adapt adoption artificial artificial intelligence chemistry cs.cl cs.lg design discovery drug design drug discovery editing intelligence knowledge machine machine learning modal molecules multi-modal q-bio.qm retrieval stat.ml studies text textual vast

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Machine Learning Engineer (AI, NLP, LLM, Generative AI)

@ Palo Alto Networks | Santa Clara, CA, United States

Consultant Senior Data Engineer F/H

@ Devoteam | Nantes, France