May 1, 2024, 4:42 a.m. | Stefan Grafberger, Paul Groth, Sebastian Schelter

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.19591v1 Announce Type: cross
Abstract: Data scientists develop ML pipelines in an iterative manner: they repeatedly screen a pipeline for potential issues, debug it, and then revise and improve its code according to their findings. However, this manual process is tedious and error-prone. Therefore, we propose to support data scientists during this development cycle with automatically derived interactive suggestions for pipeline improvements. We discuss our vision to generate these suggestions with so-called shadow pipelines, hidden variants of the original pipeline …

abstract arxiv code cs.db cs.lg cs.se data data preparation data scientists debug error however improving iterative ml pipelines pipeline pipelines process scientists shadow support type via

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York