May 4, 2023, 11 a.m. | Jesus Rodriguez

TheSequence thesequence.substack.com

The new framework builds on the scalability capabilities of DeepSpeed to fine tune LLMs using RLHF.

chat chatgpt deepspeed edge feedback framework human human feedback inside llms microsoft rlhf scalability

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote