all AI news
Towards Interactively Improving ML Data Preparation Code via "Shadow Pipelines"
May 1, 2024, 4:42 a.m. | Stefan Grafberger, Paul Groth, Sebastian Schelter
cs.LG updates on arXiv.org arxiv.org
Abstract: Data scientists develop ML pipelines in an iterative manner: they repeatedly screen a pipeline for potential issues, debug it, and then revise and improve its code according to their findings. However, this manual process is tedious and error-prone. Therefore, we propose to support data scientists during this development cycle with automatically derived interactive suggestions for pipeline improvements. We discuss our vision to generate these suggestions with so-called shadow pipelines, hidden variants of the original pipeline …
abstract arxiv code cs.db cs.lg cs.se data data preparation data scientists debug error however improving iterative ml pipelines pipeline pipelines process scientists shadow support type via
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York