April 17, 2024, 4:42 a.m. | Noah Lewis, Jean Luca Bez, Suren Byna

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.10386v1 Announce Type: cross
Abstract: High-Performance Computing (HPC) systems excel in managing distributed workloads, and the growing interest in Artificial Intelligence (AI) has resulted in a surge in demand for faster methods of Machine Learning (ML) model training and inference. In the past, research on HPC I/O focused on optimizing the underlying storage system for modeling and simulation applications and checkpointing the results, causing writes to be the dominant I/O operation. These applications typically access large portions of the data …

abstract applications artificial artificial intelligence arxiv computing cs.ai cs.dc cs.lg demand distributed excel faster hpc inference intelligence machine machine learning machine learning applications performance research survey systems training type workloads

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Scientist

@ ITE Management | New York City, United States