Just dumping the raw DOM into the LLM context is brutal on token usage. We've se...

apitman · 2025-08-27T01:51:46 1756259506

Maybe people will start making simpler/smaller websites in order to work better with AI tools. That would be nice.

pishpash · 2025-08-27T05:28:26 1756272506

You just need to capture the rendering and represent that.

commanderkeen08 · 2025-08-27T01:28:07 1756258087

Playwrights MCP went had a strong idea to default to the accessibility tree instead of DOM. Unfortunately, even that is pretty chonky.

kodefreeze · 2025-08-27T06:26:06 1756275966

This is really interesting. We've been working on a smaller set of this problem space. We've also found in some cases you need to somehow pass to the model the sequence of events that happen (like a video of a transition).

For instance, we were running a test case on a e commerce website and they have a random popup that used to come up after initial Dom was rendered but before action could be taken. This would confuse the LLM for the next action it needed to take because it didn't know the pop-up came up.

edg5000 · 2025-08-27T06:24:08 1756275848

It could work simmilar to Claude Code right? Where it won't ingest the entire codebase, rather search for certain strings or start looking at a directed location and follow references from there. Indeed it seems infeasible to ingest the whole thing.