Aug. 31, 2023, 5:28 p.m. | /u/nihit-d

Machine Learning www.reddit.com

Hi everyone,

Wanted to share an open source project we've been working on for the last few weeks: [Autolabel](https://github.com/refuel-ai/autolabel) is an open source Python library to label and enrich text datasets with LLMs (Large Language Models).

**Why?**

Access to clean, labeled data is a huge bottleneck for most ML/data science teams. From [experiments](https://www.refuel.ai/blog-posts/llm-labeling-technical-report) across a variety of NLP tasks and datasets, we have found that the most capable LLMs are able to label data at better quality than human annotators, …

call code csv data data labeling dataset feedback import json labeling labels library llms machinelearning writing

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne