all AI news
Topic: benchmarks
How Good is Phi-3-Mini for RAG, Routing, Agents
1 day, 15 hours ago |
www.youtube.com
Building-PCC: Building Point Cloud Completion Benchmarks
2 days, 5 hours ago |
arxiv.org
CVPR 2024 Datasets and Benchmarks - Part 1: Datasets
4 days, 3 hours ago |
dev.to
Stability AI Releases 3D Model Generation AI Stable Video 3D
4 days, 12 hours ago |
www.infoq.com
BEST LLMs for Coding, Long Context, Overall Perform
4 days, 13 hours ago |
www.youtube.com
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
4 days, 20 hours ago |
arxiv.org
Quoting Phi-3 Technical Report
4 days, 22 hours ago |
simonwillison.net
[D] Llama-3 may have just killed proprietary AI models
5 days, 10 hours ago |
www.reddit.com
AI now surpasses humans in almost all performance benchmarks
5 days, 12 hours ago |
www.reddit.com
Penske Introduces Catalyst AI™
1 week, 1 day ago |
ai-techpark.com
Sampling-based Pseudo-Likelihood for Membership Inference Attacks
1 week, 2 days ago |
arxiv.org
Quality Assessment of Prompts Used in Code Generation
1 week, 3 days ago |
arxiv.org
Revealing data leakage in protein interaction benchmarks
1 week, 3 days ago |
arxiv.org
On the Calibration of Multilingual Question Answering LLMs
1 week, 4 days ago |
arxiv.org
Progressive Knowledge Graph Completion
1 week, 4 days ago |
arxiv.org
RankCLIP: Ranking-Consistent Language-Image Pretraining
1 week, 4 days ago |
arxiv.org
AI Competitions and Benchmarks: Dataset Development
1 week, 4 days ago |
arxiv.org
Stability AI Releases 3D Model Generation AI Stable Video 3D
4 days, 12 hours ago |
www.infoq.com
AI now surpasses humans in almost all performance benchmarks
5 days, 12 hours ago |
www.reddit.com
Quoting Phi-3 Technical Report
4 days, 22 hours ago |
simonwillison.net
BEST LLMs for Coding, Long Context, Overall Perform
4 days, 13 hours ago |
www.youtube.com
CVPR 2024 Datasets and Benchmarks - Part 1: Datasets
4 days, 3 hours ago |
dev.to
How Good is Phi-3-Mini for RAG, Routing, Agents
1 day, 15 hours ago |
www.youtube.com
[D] Llama-3 may have just killed proprietary AI models
5 days, 10 hours ago |
www.reddit.com
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
4 days, 20 hours ago |
arxiv.org
Building-PCC: Building Point Cloud Completion Benchmarks
2 days, 5 hours ago |
arxiv.org
Items published with this topic over the last 90 days.
Latest
How Good is Phi-3-Mini for RAG, Routing, Agents
1 day, 15 hours ago |
www.youtube.com
Building-PCC: Building Point Cloud Completion Benchmarks
2 days, 5 hours ago |
arxiv.org
CVPR 2024 Datasets and Benchmarks - Part 1: Datasets
4 days, 3 hours ago |
dev.to
Stability AI Releases 3D Model Generation AI Stable Video 3D
4 days, 12 hours ago |
www.infoq.com
BEST LLMs for Coding, Long Context, Overall Perform
4 days, 13 hours ago |
www.youtube.com
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
4 days, 20 hours ago |
arxiv.org
Quoting Phi-3 Technical Report
4 days, 22 hours ago |
simonwillison.net
[D] Llama-3 may have just killed proprietary AI models
5 days, 10 hours ago |
www.reddit.com
AI now surpasses humans in almost all performance benchmarks
5 days, 12 hours ago |
www.reddit.com
Penske Introduces Catalyst AI™
1 week, 1 day ago |
ai-techpark.com
Sampling-based Pseudo-Likelihood for Membership Inference Attacks
1 week, 2 days ago |
arxiv.org
Quality Assessment of Prompts Used in Code Generation
1 week, 3 days ago |
arxiv.org
Revealing data leakage in protein interaction benchmarks
1 week, 3 days ago |
arxiv.org
On the Calibration of Multilingual Question Answering LLMs
1 week, 4 days ago |
arxiv.org
Progressive Knowledge Graph Completion
1 week, 4 days ago |
arxiv.org
RankCLIP: Ranking-Consistent Language-Image Pretraining
1 week, 4 days ago |
arxiv.org
AI Competitions and Benchmarks: Dataset Development
1 week, 4 days ago |
arxiv.org
Topic trend (last 90 days)
Top (last 7 days)
Stability AI Releases 3D Model Generation AI Stable Video 3D
4 days, 12 hours ago |
www.infoq.com
AI now surpasses humans in almost all performance benchmarks
5 days, 12 hours ago |
www.reddit.com
Quoting Phi-3 Technical Report
4 days, 22 hours ago |
simonwillison.net
BEST LLMs for Coding, Long Context, Overall Perform
4 days, 13 hours ago |
www.youtube.com
CVPR 2024 Datasets and Benchmarks - Part 1: Datasets
4 days, 3 hours ago |
dev.to
How Good is Phi-3-Mini for RAG, Routing, Agents
1 day, 15 hours ago |
www.youtube.com
[D] Llama-3 may have just killed proprietary AI models
5 days, 10 hours ago |
www.reddit.com
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
4 days, 20 hours ago |
arxiv.org
Building-PCC: Building Point Cloud Completion Benchmarks
2 days, 5 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Director, Clinical Data Science
@ Aura | Remote USA
Research Scientist, AI (PhD)
@ Meta | Menlo Park, CA | New York City