all AI news
Computer Vision Meetup: Who needs RLHF When You Have SFT?
DEV Community dev.to
This talk will center around Reinforcement Learning from Human Feedback, and more importantly, “Why” is it even needed over Supervised Fine-Tuning? We will also understand in easy terms some current open problems in RLHF as far as research in academia is concerned.
Speaker: Srishti Gureja is an ML engineer and researcher broadly interested in two things: ML efficiency techniques, including but not limited to designing algorithms that make maximum use of the hardware at hand, and the alignment in LLMs …
academia ai center computer computer vision computervision current datascience easy engineer feedback fine-tuning human human feedback machinelearning meetup ml engineer reinforcement reinforcement learning research rlhf sft speaker supervised fine-tuning talk terms vision will