If 78% sounds like a lackluster C+ grade to you, Bhushan explains that a single person, even an expert one, would likely fail the same test, scoring maybe 10% at best. "These are questions that would be complex for a human to even attempt," he tells Upstarts.
It is increasingly hard to quickly demonstrate each generational change as AI has gotten better, since a lot of the old things AI was bad at, like math or counting letters in words, are now trivial for AI to do.
1mo ago
Underscored — save the words that stop you in your tracks.