LLMs Fail Middle School Word Problems, Say Apple Researchers

2 days, 4 hours ago bankinfosecurity

AI Mimics Reasoning Without Understanding, Struggles With Irrelevant Data Rashmi Ramesh (rashmiramesh_) • October 14, 2024

Math is tough. Especially when you lack cognition. (Image: Shutterstock)

Cutting-edge large language models would fail eighth grade math, say artificial intelligence researchers at Apple - likely because AI is mimicking the process of reasoning rather than actually engaging in it.

Company researchers tested a handful of large model's ability to handle that bane of word problem solvers everywhere: extraneous information meant to throw off the solution.

OpenAI o1-mini and Llama3-8B fell for it exactly as a perplexed test-taker would, falling inexorably for the misdirection.

"Overall, we find that models tend to convert statements to operations without truly understanding their meaning," researchers wrote in a paper submitted earlier this month.

Among the tests designed to probe LLMs' ability to reason, researchers prompted LLMs with the ...

Copyright of this story solely belongs to bankinfosecurity . To see the full text click HERE

Share:

More related news