Feb. 5, 2024, 3:42 p.m. | Zeliang Kan Shae McFadden Daniel Arp Feargus Pendlebury Roberto Jordaney Johannes Kinder Fabio Pierazz

cs.LG updates on arXiv.org arxiv.org

Machine learning (ML) plays a pivotal role in detecting malicious software. Despite the high F1-scores reported in numerous studies reaching upwards of 0.99, the issue is not completely solved. Malware detectors often experience performance decay due to constantly evolving operating systems and attack methods, which can render previously learned knowledge insufficient for accurate decision-making on new inputs. This paper argues that commonly reported results are inflated due to two pervasive sources of experimental bias in the detection task: spatial bias …

attack methods bias classification cs.cr cs.lg cs.pf experience experimental issue machine machine learning malware malware classification operating systems performance pivotal role software space space and time studies systems tesseract

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Data Engineering Manager

@ Microsoft | Redmond, Washington, United States

Machine Learning Engineer

@ Apple | San Diego, California, United States