March 5, 2024, 2:43 p.m. | Rui Sun, Lirong Wu, Haitao Lin, Yufei Huang, Stan Z. Li

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.00875v1 Announce Type: cross
Abstract: Augmentation is an effective alternative to utilize the small amount of labeled protein data. However, most of the existing work focuses on design-ing new architectures or pre-training tasks, and relatively little work has studied data augmentation for proteins. This paper extends data augmentation techniques previously used for images and texts to proteins and then benchmarks these techniques on a variety of protein-related tasks, providing the first comprehensive evaluation of protein augmentation. Furthermore, we propose two …

abstract architectures arxiv augmentation benchmark cs.ai cs.lg data design ing paper predictive predictive models pre-training protein proteins q-bio.bm q-bio.qm small tasks training type via work

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US