For starters, it means you should not take the success of the math and ascribe i...

bbor · on Feb 9, 2025

Interesting metaphor, but I’m not sure you’re fully appreciating the hypothetical. The agent didn’t seem like it was going to solve a math problem, it did.

Before intuitive computing, the best we could do with word problems was Wolfram-esque regex stuff, which I’m guessing we all know was quite error-prone. Now, we have agents that can take quite vague word problems and use any sequence of KB/web searches, python programs, and further intuitive reasoning steps to arrive at the requested answer. That’s pretty impressive, and I don’t think “well technically it relies on tools” makes it less impressive! Something that wasn’t possible yesterday is possible today; that alone matters.

Re:general skepticism, I’ve given up on convincing people that AGI is close, so all ill say is “hedge your bets” ;)