New York Times Game Stumps AI Models

It is claimed by OpenAI that there are glimmers of AGI in its latest reasoning models. Still the different models currently available in the market such o1 from OpenAI, Anthropic from Google and Amazon and Microsoft’s model could not solve the Connections puzzle of the New York Times. The puzzle is solved by countless people everyday.

Connections refer to a word game which is deceptively simple. You are given 16 terms, and you have to figure out what terms have in common, within groups of four. The commonality could be as simple as the ‘titles of the book’ or as the words that start with ‘fire’. In fact, it is a challenging puzzle.

All the models failed to solve the puzzle despite the hype created around them.

At least o1 could get some of the groupings right but the other groupings were bizarre.

It was clear that LLMs work well while regurgitating already well-documented information but struggle while facing novel queries.

OpenAI claims that it has reached close to AGI or has achieved the start of it. Perhaps the company is keeping it wrapped, because this is not AGI manifestation at all.

print

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *