June 30, 2023, 3 p.m. | Venelin Valkov

Venelin Valkov www.youtube.com

Are Open LLMs any good when it comes to longer texts?

In this video, we dive into the world of Long Sequence Modeling, exploring a 7B LLM, XGen-7B. With an impressive 8K input sequence length and fine-tuning on public-domain instructional data, XGen-7B promises a competition against state-of-the-art LLMs. We'll look at performance on standard NLP benchmarks, long sequence modeling tasks, and code generation.

I'll take you through the process of loading the instruction model in a Google Colab Notebook and …

code colab competition data dataset fine-tuning good google llm llms modeling overview public tokens video world

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US