Dec. 6, 2023, 3:43 p.m. | /u/becausecurious

Artificial Intelligence www.reddit.com

* https://deepmind.google/technologies/gemini/#capabilities

* Benchmarks: https://imgur.com/DWNQcaY ([Table 2 on Page 7](https://storage.googleapis.com/deepmind-media/gemini/gemini_1_report.pdf)) - Gemini Pro (the launched model) is worse than ChatGPT4, but a bit better than GPT3.5. All the examples are for Ultra (actual state of the art outperforming GPT4), which won't be available until 2024.

* Promo video: https://www.youtube.com/watch?v=UIZAiXYceBI (& see other videos on that channel for more)

* Technical paper: https://goo.gle/GeminiPaper

Some details ([source](https://news.ycombinator.com/item?id=38545044)):

- 32k context length

- efficient attention mechanisms (for e.g. multi-query attention (Shazeer, 2019))

- …

32k context artificial attention attention mechanisms audio chen context encoding features figure gemini query speech visual work

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Director, Clinical Data Science

@ Aura | Remote USA

Research Scientist, AI (PhD)

@ Meta | Menlo Park, CA | New York City