Feb. 12, 2024, 5:42 a.m. | Ashish Hooda Mihai Christodorescu Miltos Allamanis Aaron Wilson Kassem Fawaz Somesh Jha

cs.LG updates on arXiv.org arxiv.org

Large Language Models' success on text generation has also made them better at code generation and coding tasks. While a lot of work has demonstrated their remarkable performance on tasks such as code completion and editing, it is still unclear as to why. We help bridge this gap by exploring to what degree auto-regressive models understand the logical constructs of the underlying programs. We propose Counterfactual Analysis for Programming Concept Predicates (CACP) as a counterfactual testing framework to evaluate whether …

box bridge code code completion code generation coding concepts cs.ai cs.lg cs.pl cs.se editing gap language language models large language large language models performance programming success tasks text text generation them work

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote