May 9, 2023, 6:51 p.m. | /u/lewtun

Machine Learning www.reddit.com

Hi folks, it’s Lewis here from the research team at Hugging Face 👋.

We’ve been tinkering with [BigCode’s StarCoder model for code generation](https://huggingface.co/bigcode/starcoder) the last few days and wondered whether it could be turned into a coding assistant with a little bit of fine-tuning.

Somewhat surprisingly, the answer is yes! We fine-tuned StarCoder on two high-quality datasets that have been created by the community:

- [OpenAssistant’s dataset](https://huggingface.co/datasets/OpenAssistant/oasst1) of 40k+ conversations, spanning a diverse range of topics from philosophy to poetry. …

assistant call coding community datasets face hugging face machinelearning multiple quality research research team starcoder team

Data Engineer

@ Bosch Group | San Luis Potosí, Mexico

DATA Engineer (H/F)

@ Renault Group | FR REN RSAS - Le Plessis-Robinson (Siège)

Advisor, Data engineering

@ Desjardins | 1, Complexe Desjardins, Montréal

Data Engineer Intern

@ Getinge | Wayne, NJ, US

Software Engineer III- Java / Python / Pyspark / ETL

@ JPMorgan Chase & Co. | Jersey City, NJ, United States

Lead Data Engineer (Azure/AWS)

@ Telstra | Telstra ICC Bengaluru