July 13, 2023, 10:57 p.m. | /u/Educational_Grass_38

Machine Learning www.reddit.com

Here's a free colab notebook that I've been playing around with to generate JSON datasets from PDFs for fine-tuning LLMs and evaluate outputs/prompts for Toxicity, Bias, Quality etc.

Colab: [https://colab.research.google.com/drive/1KCn1HIeD3fQy8ecT74yHa3xgJZvdNvqL?usp=sharing](https://colab.research.google.com/drive/1KCn1HIeD3fQy8ecT74yHa3xgJZvdNvqL?usp=sharing)

GitHub Repo: [https://github.com/kw2828/guardrail-ml](https://github.com/kw2828/guardrail-ml)

bias colab dataset datasets etc fine-tuning free json llms machinelearning notebook playing prompts quality toxicity

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Scientist

@ Publicis Groupe | New York City, United States

Bigdata Cloud Developer - Spark - Assistant Manager

@ State Street | Hyderabad, India