Feb. 13, 2024, 5:46 a.m. | Lizhou Fan Wenyue Hua Lingyao Li Haoyang Ling Yongfeng Zhang

cs.LG updates on arXiv.org arxiv.org

Complex reasoning ability is one of the most important features of current LLMs, which has also been leveraged to play an integral role in complex decision-making tasks. Therefore, the investigation into the reasoning capabilities of Large Language Models (LLMs) is critical: numerous benchmarks have been established to assess the reasoning abilities of LLMs. However, current benchmarks are inadequate in offering a rigorous evaluation of the full extent of reasoning abilities that LLMs are capable of achieving. They are also prone …

cs.ai cs.cc cs.cl cs.lg

