all AI news
Can AI Replace Developers? Princeton and University of Chicago's SWE-bench Tests AI on Real Coding Issues [N]
Oct. 16, 2023, 1:04 p.m. | /u/AIsupercharged
Machine Learning www.reddit.com
For the latest advancements in AI, [look here first](https://supercharged-ai.beehiiv.com/subscribe?utm_source=reddit&utm_medium=ai-replace-dev&utm_campaign=campaign).
https://preview.redd.it/rq5vl22bckub1.png?width=1292&format=png&auto=webp&s=d79988bfe0ab37b0f97f55296d7a7341c9292c11
**A New Approach to Evaluating AI Models**
* Researchers use real-world software engineering problems from GitHub to assess language models' coding problem-solving skills.
* SWE-bench, introduced by Princeton and …
coding developers development engineering evaluation github language language models machinelearning practical programming software software engineering solutions solve swe tech tests university university of chicago
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
DevOps Engineer (Data Team)
@ Reward Gateway | Sofia/Plovdiv