May 20, 2022, 1:11 a.m. | Cezar Sas, Andrea Capiluppi, Claudio Di Sipio, Juri Di Rocco, Davide Di Ruscio

cs.LG updates on arXiv.org arxiv.org

GitHub is the world's largest host of source code, with more than 150M
repositories. However, most of these repositories are not labeled or
inadequately so, making it harder for users to find relevant projects. There
have been various proposals for software application domain classification over
the past years. However, these approaches lack a well-defined taxonomy that is
hierarchical, grounded in a knowledge base, and free of irrelevant terms. This
work proposes GitRanking, a framework for creating a classification ranked into …

arxiv classification github ranking sampling software topics

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analytics & Insight Specialist, Customer Success

@ Fortinet | Ottawa, ON, Canada

Account Director, ChatGPT Enterprise - Majors

@ OpenAI | Remote - Paris