Categories: Apple

Apple’s study proves that LLM-based AI models are flawed because they cannot reason

A new paper from Apple’s artificial intelligence scientists has found that engines based on large language models, such as those from Meta and OpenAI, still lack basic reasoning skills.

Apple plans to introduce its own version of AI starting with iOS 18.1 – image credit Apple

The group has proposed a new benchmark, GSM-Symbolic, to help others measure the reasoning capabilities of various large language models (LLMs). Their initial testing reveals that slight changes in the wording of queries can result in significantly different answers, undermining the reliability of the models.

The group investigated the “fragility” of mathematical reasoning by adding contextual information to their queries that a human could understand, but which should not affect the fundamental mathematics of the solution. This resulted in varying answers, which shouldn’t happen.

Continue Reading on AppleInsider | Discuss on our Forums

Source: AppleInsider News

AddThis Website Tools
WBN

Share
Published by
WBN

Recent Posts

The EU is betraying its citizens and weakening privacy for political gainThe EU is betraying its citizens and weakening privacy for political gain

The EU is betraying its citizens and weakening privacy for political gain

Maybe Apple will never fully walk away from Europe, but the European Commission has just…

14 hours ago
Warner Bros.’ Shelved Coyote vs. Acme Feature May Get a Second ChanceWarner Bros.’ Shelved Coyote vs. Acme Feature May Get a Second Chance

Warner Bros.’ Shelved Coyote vs. Acme Feature May Get a Second Chance

A new report says Ketchup Entertainment, which picked up The Day the Earth Blew Up:…

14 hours ago

Used Car of the Day: 1968 Ford Mustang

We're staying old school today with this 1968 Ford Mustang. This one has had the…

14 hours ago

Trump Threatens to Defund the NYC Subway

Transportation Secretary Sean Duffy is demanding a "safety plan" from the city.

14 hours ago

Apple will launch new ‘homeOS’ this year, here’s what’s coming

Apple has a big software year ahead, with major redesigns coming to iOS 19, macOS…

14 hours ago

Elon Musk Is Joining Microsoft in $30 Billion Data Center Project

The biggest backer of OpenAI, Microsoft is now building its own AI models and teaming…

15 hours ago