all AI news
Topic: benchmarks
Nvidia Tops Llama 2, Stable Diffusion Speed Trials
1 day, 3 hours ago |
spectrum.ieee.org
Introducing DBRX: A New State-of-the-Art Open LLM
1 day, 3 hours ago |
www.databricks.com
Steampipe dashboards and benchmarks for your data
1 day, 13 hours ago |
www.infoworld.com
InternLM2 Technical Report
1 day, 17 hours ago |
arxiv.org
The Unreasonable Ineffectiveness of the Deeper Layers
1 day, 17 hours ago |
arxiv.org
Open Source Conversational LLMs do not know most Spanish words
2 days, 17 hours ago |
arxiv.org
RakutenAI-7B: Extending Large Language Models for Japanese
2 days, 17 hours ago |
arxiv.org
Construction of a Japanese Financial Benchmark for Large Language Models
3 days, 17 hours ago |
arxiv.org
The Death of the Static AI Benchmark
5 days, 23 hours ago |
towardsdatascience.com
Reference-based Metrics Disprove Themselves in Question Generation
1 week, 1 day ago |
arxiv.org
Evaluating Language Model Agency through Negotiations
1 week, 2 days ago |
arxiv.org
Deep learning for dynamic graphs: models and benchmarks
1 week, 2 days ago |
arxiv.org
Do CLIPs Always Generalize Better than ImageNet Models?
1 week, 2 days ago |
arxiv.org
Self-Consistency Boosts Calibration for Math Reasoning
1 week, 3 days ago |
arxiv.org
Renovating Names in Open-Vocabulary Segmentation Benchmarks
1 week, 6 days ago |
arxiv.org
Introducing DBRX: A New State-of-the-Art Open LLM
1 day, 3 hours ago |
www.databricks.com
Construction of a Japanese Financial Benchmark for Large Language Models
3 days, 17 hours ago |
arxiv.org
The Death of the Static AI Benchmark
5 days, 23 hours ago |
towardsdatascience.com
Nvidia Tops Llama 2, Stable Diffusion Speed Trials
1 day, 3 hours ago |
spectrum.ieee.org
Steampipe dashboards and benchmarks for your data
1 day, 13 hours ago |
www.infoworld.com
Open Source Conversational LLMs do not know most Spanish words
2 days, 17 hours ago |
arxiv.org
InternLM2 Technical Report
1 day, 17 hours ago |
arxiv.org
Items published with this topic over the last 90 days.
Latest
Nvidia Tops Llama 2, Stable Diffusion Speed Trials
1 day, 3 hours ago |
spectrum.ieee.org
Introducing DBRX: A New State-of-the-Art Open LLM
1 day, 3 hours ago |
www.databricks.com
Steampipe dashboards and benchmarks for your data
1 day, 13 hours ago |
www.infoworld.com
InternLM2 Technical Report
1 day, 17 hours ago |
arxiv.org
The Unreasonable Ineffectiveness of the Deeper Layers
1 day, 17 hours ago |
arxiv.org
Open Source Conversational LLMs do not know most Spanish words
2 days, 17 hours ago |
arxiv.org
RakutenAI-7B: Extending Large Language Models for Japanese
2 days, 17 hours ago |
arxiv.org
Construction of a Japanese Financial Benchmark for Large Language Models
3 days, 17 hours ago |
arxiv.org
The Death of the Static AI Benchmark
5 days, 23 hours ago |
towardsdatascience.com
Reference-based Metrics Disprove Themselves in Question Generation
1 week, 1 day ago |
arxiv.org
Evaluating Language Model Agency through Negotiations
1 week, 2 days ago |
arxiv.org
Deep learning for dynamic graphs: models and benchmarks
1 week, 2 days ago |
arxiv.org
Do CLIPs Always Generalize Better than ImageNet Models?
1 week, 2 days ago |
arxiv.org
Self-Consistency Boosts Calibration for Math Reasoning
1 week, 3 days ago |
arxiv.org
Renovating Names in Open-Vocabulary Segmentation Benchmarks
1 week, 6 days ago |
arxiv.org
Topic trend (last 90 days)
Top (last 7 days)
Introducing DBRX: A New State-of-the-Art Open LLM
1 day, 3 hours ago |
www.databricks.com
Construction of a Japanese Financial Benchmark for Large Language Models
3 days, 17 hours ago |
arxiv.org
The Death of the Static AI Benchmark
5 days, 23 hours ago |
towardsdatascience.com
Nvidia Tops Llama 2, Stable Diffusion Speed Trials
1 day, 3 hours ago |
spectrum.ieee.org
Steampipe dashboards and benchmarks for your data
1 day, 13 hours ago |
www.infoworld.com
Open Source Conversational LLMs do not know most Spanish words
2 days, 17 hours ago |
arxiv.org
InternLM2 Technical Report
1 day, 17 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Senior ML Researcher - 3D Geometry Processing | 3D Shape Generation | 3D Mesh Data
@ Promaton | Europe
Cleared Senior Software Engineer, Computer Vision, Federal
@ CCRi | Chantilly, Virginia, United States
Data Analyst - B2C
@ DAZN | Hyderabad, India
Product Marketing Manager - AI Chatbot
@ SendBird | San Mateo, California, United States
Alternance Alternant Ingénieur Développement logiciel temps réel embarqué / computer vision (F/H)
@ Alstom | Villeurbanne, FR
AOT Data Analyst II - Highway Project Delivery
@ State of Vermont | Barre, VT, US