all AI news
Meet StableVicuna, The First Large-Scale Open-Source RLHF Chatbot by Stability AI
Stories by ODSC - Open Data Science on Medium medium.com
The development and release of chatbots have been significant in recent months. Open-source alternatives have further fueled interest in tuning large language models for a chat. However, there is a lack of open-source models that have applied both instruction finetuning and reinforcement learning through human feedback (RLHF) training.
In a blog post, Stability AI introduced StableVicuna, the first large-scale open-source chatbot trained via reinforcement learning through human feedback or RLHF. It is a further instruction fine-tuned and RLHF-trained version …
artificial intelligence chat chatbot chatbots data science development feedback finetuning human human feedback language language models large language models open-data reinforcement reinforcement learning release rlhf scale stability ai stablevicuna through training