March 5, 2024, 2:43 p.m. | Rui Sun, Lirong Wu, Haitao Lin, Yufei Huang, Stan Z. Li

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.00875v1 Announce Type: cross
Abstract: Augmentation is an effective alternative to utilize the small amount of labeled protein data. However, most of the existing work focuses on design-ing new architectures or pre-training tasks, and relatively little work has studied data augmentation for proteins. This paper extends data augmentation techniques previously used for images and texts to proteins and then benchmarks these techniques on a variety of protein-related tasks, providing the first comprehensive evaluation of protein augmentation. Furthermore, we propose two …

abstract architectures arxiv augmentation benchmark cs.ai cs.lg data design ing paper predictive predictive models pre-training protein proteins q-bio.bm q-bio.qm small tasks training type via work

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Scientist

@ Publicis Groupe | New York City, United States

Bigdata Cloud Developer - Spark - Assistant Manager

@ State Street | Hyderabad, India