I just oneshot it with claude code (opus 4.5) using this prompt. It took about 5...

nl · 2025-12-06T03:12:28 1764990748

Programs can solve mazes and LLMs can program. That's a different thing completely.

JamesSwift · 2025-12-06T03:37:58 1764992278

That just seems like an arbitrary limitation. Its like asking someone to do answer a math calculation but "no thinking allowed". Like, I guess we can gauge if a model just _knows all knowable things in the universe_ using that method... but anything of any value that you are gauging in terms of 'intelligence', is going to actually be validating their ability to go "outside the scope" of what they actually are (an autocomplete on steroids).

flyinglizard · 2025-12-06T04:12:54 1764994374

We know there are very simple maze solving algorithms you could code in few lines of Python but no one could claim that constitutes intelligence. The difference is between applying intuitive logic and using a predetermined tool.

esafak · 2025-12-05T22:42:50 1764974570

If you allow tool use much simpler models can solve it.