April 30, 2024, 4:50 a.m. | Jordan Meadows, Tamsin James, Andre Freitas

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.18384v1 Announce Type: new
Abstract: Language models can hallucinate when performing complex and detailed mathematical reasoning. Physics provides a rich domain for assessing mathematical reasoning capabilities where physical context imbues the use of symbols which needs to satisfy complex semantics (\textit{e.g.,} units, tensorial order), leading to instances where inference may be algebraically coherent, yet unphysical. In this work, we assess the ability of Language Models (LMs) to perform fine-grained mathematical and physical reasoning using a curated dataset encompassing multiple notations …

abstract arxiv capabilities context cs.cl domain fine-grained inference instances language language models llm mathematical reasoning physics reasoning semantics type units via

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Cloud Data Platform Engineer

@ First Central | Home Office (Remote)

Associate Director, Data Science

@ MSD | USA - New Jersey - Rahway

Data Scientist Sr.

@ MSD | CHL - Santiago - Santiago (Calle Mariano)