all AI news
Topic: evaluation
Benchmarking LLMs via Uncertainty Quantification
1 day, 11 hours ago |
arxiv.org
ICDM 2020 Knowledge Graph Contest: Consumer Event-Cause Extraction
1 day, 22 hours ago |
arxiv.org
Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
3 days, 11 hours ago |
arxiv.org
Developing VTable Custom Edit Component with React
4 days, 6 hours ago |
dev.to
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models
4 days, 11 hours ago |
arxiv.org
Evaluating Retrieval Quality in Retrieval-Augmented Generation
4 days, 11 hours ago |
arxiv.org
LLM Evaluators Recognize and Favor Their Own Generations
4 days, 11 hours ago |
arxiv.org
AutoAD III: The Prequel -- Back to the Pixels
4 days, 11 hours ago |
arxiv.org
CrossScore: Towards Multi-View Image Evaluation and Scoring
4 days, 11 hours ago |
arxiv.org
Holistic Safety and Responsibility Evaluations of Advanced AI Models
4 days, 11 hours ago |
arxiv.org
The Solution for the CVPR2024 NICE Image Captioning Challenge
5 days, 11 hours ago |
arxiv.org
Benchmarking LLMs via Uncertainty Quantification
1 day, 11 hours ago |
arxiv.org
ICDM 2020 Knowledge Graph Contest: Consumer Event-Cause Extraction
1 day, 22 hours ago |
arxiv.org
LLM Evaluators Recognize and Favor Their Own Generations
4 days, 11 hours ago |
arxiv.org
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models
4 days, 11 hours ago |
arxiv.org
Developing VTable Custom Edit Component with React
4 days, 6 hours ago |
dev.to
The Solution for the CVPR2024 NICE Image Captioning Challenge
5 days, 11 hours ago |
arxiv.org
Evaluating Retrieval Quality in Retrieval-Augmented Generation
4 days, 11 hours ago |
arxiv.org
Holistic Safety and Responsibility Evaluations of Advanced AI Models
4 days, 11 hours ago |
arxiv.org
Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
3 days, 11 hours ago |
arxiv.org
Items published with this topic over the last 90 days.
Latest
Benchmarking LLMs via Uncertainty Quantification
1 day, 11 hours ago |
arxiv.org
ICDM 2020 Knowledge Graph Contest: Consumer Event-Cause Extraction
1 day, 22 hours ago |
arxiv.org
Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
3 days, 11 hours ago |
arxiv.org
Developing VTable Custom Edit Component with React
4 days, 6 hours ago |
dev.to
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models
4 days, 11 hours ago |
arxiv.org
Evaluating Retrieval Quality in Retrieval-Augmented Generation
4 days, 11 hours ago |
arxiv.org
LLM Evaluators Recognize and Favor Their Own Generations
4 days, 11 hours ago |
arxiv.org
AutoAD III: The Prequel -- Back to the Pixels
4 days, 11 hours ago |
arxiv.org
CrossScore: Towards Multi-View Image Evaluation and Scoring
4 days, 11 hours ago |
arxiv.org
Holistic Safety and Responsibility Evaluations of Advanced AI Models
4 days, 11 hours ago |
arxiv.org
The Solution for the CVPR2024 NICE Image Captioning Challenge
5 days, 11 hours ago |
arxiv.org
Topic trend (last 90 days)
Top (last 7 days)
Benchmarking LLMs via Uncertainty Quantification
1 day, 11 hours ago |
arxiv.org
ICDM 2020 Knowledge Graph Contest: Consumer Event-Cause Extraction
1 day, 22 hours ago |
arxiv.org
LLM Evaluators Recognize and Favor Their Own Generations
4 days, 11 hours ago |
arxiv.org
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models
4 days, 11 hours ago |
arxiv.org
Developing VTable Custom Edit Component with React
4 days, 6 hours ago |
dev.to
The Solution for the CVPR2024 NICE Image Captioning Challenge
5 days, 11 hours ago |
arxiv.org
Evaluating Retrieval Quality in Retrieval-Augmented Generation
4 days, 11 hours ago |
arxiv.org
Holistic Safety and Responsibility Evaluations of Advanced AI Models
4 days, 11 hours ago |
arxiv.org
Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
3 days, 11 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Principal Applied Scientist
@ Microsoft | Redmond, Washington, United States
Data Analyst / Action Officer
@ OASYS, INC. | OASYS, INC., Pratt Avenue Northwest, Huntsville, AL, United States