underscored

@underscored

2 clips · 1 follower

Follow
Tag:benchmarkingClear

If 78% sounds like a lackluster C+ grade to you, Bhushan explains that a single person, even an expert one, would likely fail the same test, scoring maybe 10% at best. "These are questions that would be complex for a human to even attempt," he tells Upstarts.

Alex Konrad
5d ago

It is increasingly hard to quickly demonstrate each generational change as AI has gotten better, since a lot of the old things AI was bad at, like math or counting letters in words, are now trivial for AI to do.

1mo ago

Underscored — save the words that stop you in your tracks.

Start saving quotes →