16

SciBench, NLP Research

Benchmarking scientific problem solving abilities of LLMs.

My team and I parsed tens of scientific textbooks to understand mathematical reasoning patterns in LLM problem solving. Do LLMs solve problems without understanding them? Or do they have complex reasoning capabilities?