June 23, 2022, 1:10 a.m. | Marcell Stippinger, Dávid Hanák, Marcell T. Kurbucz, Gergely Hanczár, Olivér M. Törteli, Zoltán Somogyvári

cs.LG updates on arXiv.org arxiv.org

The lack of freely available (real-life or synthetic) high or ultra-high
dimensional, multi-class datasets may hamper the rapidly growing research on
feature screening, especially in the field of biometrics, where the usage of
such datasets is common. This paper reports a Python package called
BiometricBlender, which is an ultra-high dimensional, multi-class synthetic
data generator to benchmark a wide range of feature screening methods. During
the data generation process, the overall usefulness and the intercorrelations
of blended features can be controlled …

arxiv biometric data feature generator lg space synthetic data

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analytics & Insight Specialist, Customer Success

@ Fortinet | Ottawa, ON, Canada

Account Director, ChatGPT Enterprise - Majors

@ OpenAI | Remote - Paris