Dec. 8, 2023, 4:30 p.m. | Venelin Valkov

Venelin Valkov www.youtube.com

Let's explore "Gemini", Google's new multimodal AI model family. We'll examine how Gemini advances the state-of-the-art in large-scale language modeling, image and audio processing, and video understanding, building upon foundational work in sequence models, neural networks, and machine learning systems. Gemini Ultra sets new records in 30 out of 32 benchmarks, including text and reasoning, image understanding, video understanding, and speech benchmarks. Notably, Gemini Ultra marks a milestone in human-expert performance on MMLU, a benchmark for knowledge and reasoning.

Gemini …

advances ai model architecture art audio building chatgpt dataset explore family gemini gemini ultra google google gemini gpt4 image language learning systems machine machine learning modeling multimodal multimodal ai networks neural networks new multimodal processing records report scale state systems technical training understanding video video understanding work

More from www.youtube.com / Venelin Valkov

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Data Science Analyst

@ Mayo Clinic | AZ, United States

Sr. Data Scientist (Network Engineering)

@ SpaceX | Redmond, WA