June 14, 2024, 2:38 p.m. | /u/RaeudigerRaffi

Machine Learning www.reddit.com

I'm excited to share an updated open-source resource we’ve been working on—an improved version of the Spider dataset originally published by Yale University for Text2SQL tasks. You can check it out here: [https://huggingface.co/datasets/RaffaSch121/fixed\_spider](https://huggingface.co/datasets/RaffaSch121/fixed_spider)

During our own model training at [Turbular](http://www.turbular.com), we identified several issues in the original dataset. To help the community and give back, we decided to address these problems and release a corrected version. We hope this enhanced dataset will benefit everyone working on Text2SQL and similar projects. …

download experiment free machinelearning you

Senior Data Engineer

@ Displate | Warsaw

Professor/Associate Professor of Health Informatics [LKCMedicine]

@ Nanyang Technological University | NTU Novena Campus, Singapore

Research Fellow (Computer Science (and Engineering)/Electronic Engineering/Applied Mathematics/Perception Sciences)

@ Nanyang Technological University | NTU Main Campus, Singapore

Java Developer - Assistant Manager

@ State Street | Bengaluru, India

Senior Java/Python Developer

@ General Motors | Austin IT Innovation Center North - Austin IT Innovation Center North

Research Associate (Computer Engineering/Computer Science/Electronics Engineering)

@ Nanyang Technological University | NTU Main Campus, Singapore