Feb. 26, 2024, 5:41 a.m. | Charlie Cowen-Breen, Creston Brooks, Robert Calef, Anna Sappington

cs.LG updates on arXiv.org arxiv.org

arXiv:2402.15020v1 Announce Type: new
Abstract: Beam search with masked language models (MLMs) is challenging in part because joint probability distributions over sequences are not readily available, unlike for autoregressive models. Nevertheless, estimating such distributions has applications in many domains, including protein engineering and ancient text restoration. We present probabilistically-sound methods for beam search with MLMs. First, we clarify the conditions under which it is theoretically sound to perform text infilling with MLMs using standard beam search. When these conditions fail, …

abstract applications arxiv autoregressive models cs.cl cs.lg domains engineering language language models part probability protein protein engineering search sound text type

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne