The Arc Prize is a programming competition to drive progress towards Artificial General Intelligence (AGI). It is now in its second iteration: ARC-AGI-2.
What I find really interesting is that the latest challenge is really easy for humans, but LLMs have 0% success rate, and other AI reasoning systems get less that 5% success!
If there is ANY proof that the hype over LLMs being ‘Artificial Intelligence’ is somewhat misleading … it’s the fact that current LLMs cannot get anywhere close to a decent success rating.
Take the human test for yourself.
Leave a Reply