April 3, 2024, 7:58 p.m. | /u/iordanissh

Machine Learning www.reddit.com

I have tried to do RLHF on my own. Using libraries such as:


[https://huggingface.co/docs/trl/en/index](https://huggingface.co/docs/trl/en/index)


And outside the examples, it doesn't work.

Have anyone successfully used RLHF (outside academia) or is struggling with it as well?

Any specific use-cases you can share would be helpful.

academia cases examples libraries machinelearning rlhf work

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Machine Learning Engineer

@ Apple | Sunnyvale, California, United States