Nov. 5, 2023, 6:42 a.m. | Nikhil Shenoy, Prudencio Tossou, Emmanuel Noutahi, Hadrien Mary, Dominique Beaini, Jiarui Ding

cs.LG updates on arXiv.org arxiv.org

In the field of Machine Learning Interatomic Potentials (MLIPs),
understanding the intricate relationship between data biases, specifically
conformational and structural diversity, and model generalization is critical
in improving the quality of Quantum Mechanics (QM) data generation efforts. We
investigate these dynamics through two distinct experiments: a fixed budget
one, where the dataset size remains constant, and a fixed molecular set one,
which focuses on fixed structural diversity while varying conformational
diversity. Our results reveal nuanced patterns in generalization metrics.
Notably, …

arxiv biases budget data diversity dynamics machine machine learning model generalization physics quality quantum quantum mechanics relationship role through understanding

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Lead Data Modeler

@ Sherwin-Williams | Cleveland, OH, United States