all AI news
ProtT3: Protein-to-Text Generation for Text-based Protein Understanding
May 22, 2024, 4:47 a.m. | Zhiyuan Liu, An Zhang, Hao Fei, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua
cs.CL updates on arXiv.org arxiv.org
Abstract: Language Models (LMs) excel in understanding textual descriptions of proteins, as evident in biomedical question-answering tasks. However, their capability falters with raw protein data, such as amino acid sequences, due to a deficit in pretraining on such data. Conversely, Protein Language Models (PLMs) can understand and convert protein data into high-quality representations, but struggle to process texts. To address their limitations, we introduce ProtT3, a framework for Protein-to-Text Generation for Text-based Protein Understanding. ProtT3 empowers …
arxiv cs.cl cs.mm protein q-bio.qm text text generation type understanding
More from arxiv.org / cs.CL updates on arXiv.org
Dodo: Dynamic Contextual Compression for Decoder-only LMs
1 day, 16 hours ago |
arxiv.org
Active Learning for Multilingual Fingerspelling Corpora
1 day, 16 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Senior Data Engineer
@ Displate | Warsaw
Analyst, Data Analytics
@ T. Rowe Price | Owings Mills, MD - Building 4
Regulatory Data Analyst
@ Federal Reserve System | San Francisco, CA
Sr. Data Analyst
@ Bank of America | Charlotte
Data Analyst- Tech Refresh
@ CACI International Inc | 1J5 WASHINGTON DC (BOLLING AFB)
Senior AML/CFT & Data Analyst
@ Ocorian | Ebène, Mauritius