KaggleX Final Project with Adejumobi Joshua | Kaggle | allainews.com

March 7, 2024, 6:35 p.m. | Kaggle

Kaggle www.youtube.com

About this project: The Yoruba-RAG project focuses on enhancing the performance of large language models, like GPT-3, when handling questions in low-resource languages like Yoruba. The project involves web scraping from a Yoruba blog using Beautiful Soup, storing the data in a text file, and dividing it into smaller chunks. To effectively process Yoruba text, the Language-agnostic BERT Sentence Embedding (LaBSE) model is employed, and the results are stored in a Chroma database. This enriched database significantly improves GPT's ability …

blog data file gpt gpt-3 joshua kaggle language language models languages large language large language models low performance project questions rag scraping text web web scraping

More from www.youtube.com / Kaggle

How to Suggest Edits in a Kaggle Dataset? 2 months, 1 week ago | www.youtube.com

community data data scientists dataset +12

KaggleX Final Project Presentation with Isabela Yepes | Kaggle 2 months, 1 week ago | www.youtube.com

classifier classifiers current data +19

KaggleX Final Project Presentation with Disleve Kanku | Kaggle 2 months, 1 week ago | www.youtube.com

collection data database deep learning +12

KaggleX Final Project with Adejumobi Joshua | Kaggle 2 months, 1 week ago | www.youtube.com

blog data file gpt +17

KaggleX Final Project Presentation with Anabelle Capois Espinal | Kaggle 2 months, 1 week ago | www.youtube.com

analyst data data analyst focus +14

KaggleX Final Project Presentation with Jessica Pool | Kaggle 2 months, 1 week ago | www.youtube.com

affordable housing availability home homes +12

KaggleX Final Project Presentation with Samuel Okon | Kaggle 2 months, 1 week ago | www.youtube.com

computer computer vision computer vision technology crops +11

KaggleX Final Project Presentation with Purity Nyagweth | Kaggle 2 months, 1 week ago | www.youtube.com

bird building data data scientist +12

KaggleX Final Project Presentation with Binita Shrestha | Kaggle 2 months, 1 week ago | www.youtube.com

air pollution air quality analysis data +10

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net