underscored — Underscored

@underscored

3 clips · 1 follower

Tag:ai-capabilitiesClear

dwarkesh.com

The next big breakthrough will be AIs learning on the job

But one reason that I think it quite underrated, and also which reveals the canyon walls against which the river of AI progress will only slowly chip away at, is that it is not enough for a domain to be verifiable. It also has to be very grindable—in the sense that you can run lots of parallel rollouts against a deterministic and replayable simulator.
— Dwarkesh Patel

1mo ago

dwarkesh.com

The sample efficiency black hole

Imagine if it took a couple decades worth of courses with hundreds of concurrent professors and millions of practice tasks for you to learn how to polish a word file. Even the task count difference understates the gap - the models have to grind their far more numerous tasks each far harder. Whereas a human student might practice a textbook problem once or twice, GRPO has the model generate hundreds to thousands of rollouts per task.

1mo ago

oneusefulthing.org

Sign of the future: GPT-5.5

It is increasingly hard to quickly demonstrate each generational change as AI has gotten better, since a lot of the old things AI was bad at, like math or counting letters in words, are now trivial for AI to do.

3mo ago

Underscored — save the words that stop you in your tracks.

Start saving quotes →