I totally agree. I remember the June magic as well - almost overnight my abiliti...

thecoppinger · 2025-10-20T23:20:23 1761002423

> trying things that were beyond my ability to implement technically but within the bounds of my conceptual understanding

This is a really neat way of describing the phenomenon I've been experiencing and trying to articulate, cheers!

Arisaka1 · 2025-10-21T09:36:26 1761039386

When I was in high school, I would see the algebra teacher work through expressions and go "ohhh, that makes sense". But when I got back home to work with the homework, I couldn't make the pieces fit.

Isn't that the same? Just because you recognize something someone else wrote and makes you go "ohh, I understand it conceptually" doesn't mean that you can apply that concept in a few days or weeks.

So when the person you responded to says:

>almost overnight *my abilities* and throughput were profoundly increased

I'd argue the throughput did but his abilities really weren't, because without the tool in question you're just as good as before the tool. To truly claim that his abilities were profoundly increased, he has to be able to internalize the pattern, recognize the pattern, and successfully reproduce it across variable contexts.

Another example would be claiming that my painting abilities and throughput were profoundly increased, because I used to draw stick figures and now I can draw Yu-Gi-Oh! cards by using the tool. My throughput was really increased, but my abilities as a painter really haven't.

catigula · 2025-10-20T22:41:23 1761000083

>I think, in most cases, GPT5-Codex finally is as good as a senior engineer for my specific use case.

This is beyond bananas to me given that I regularly see codex high and Gpt-5-high both fail to create basic react code slightly off the normal distribution.

hansvm · 2025-10-20T23:27:21 1761002841

That might say something about the understandability of the react framework/paradigm ;)

Quality varies a lot based on what you're doing, how you prompt it, how you orchestrate it, and how you babysit and correct it. I haven't seen anything I'd call senior, but I have seen it, for some classes of tasks, turn this particular engineer into many seniors. I still have to supply all the heavy lifting (here's the concurrency model, how you'll ensure exactly-once-delivery, particular functions and classes you definitely want, a few common pitfalls to avoid, etc), but then it can flesh out the details extremely well.

aaronblohowiak · 2025-10-20T23:35:23 1761003323

It makes me waaayyyy faster but, like you, that’s because I already know what has to be done.

evilduck · 2025-10-21T13:22:15 1761052935

If you really want to see it fail at something easy, try to have write something that can use JSX but doesn't use React (Bun, Hono, etc). Seems like no amount of context management and detailed instructions will keep it from reaching for React-isms.

catigula · 2025-10-22T00:19:12 1761092352

Bear AI signal whenever we see glimpses that the reasoning is just pattern matching to artifacts of actual human reasoning.

pkreg01 · 2025-10-20T23:21:51 1761002511

Do you mind if I ask what kind of React code you're working on? I've had good success using Codex for my frontend development, especially since all of my projects consistently rely on a pretty widely used and well documented component library. I realize that makes my use case fairly narrow, so I don't think I've discovered the limits you have.

catigula · 2025-10-20T23:26:21 1761002781

Normal legacy react enterprise application.

Today I was trying to get it to temporarily shim in for development and consume the value of a redux store via merely putting a default in the reducer. Depending on that value, the application would present different state.

It failed to accomplish this and added a disgusting amount of defensive nonsense code in my saga, reducer and component to ensure the value was there. It took me a very short time to correct it but just watching it completely fail at this task was borderline absurd.

pkreg01 · 2025-10-20T23:36:35 1761003395

Thanks for the context! I feel the same way. When it fails it fails hard. This is why I'm extremely skeptical of any of the non-cli cloud solutions - as you observed, I think the failures compound and cascade if you don't stop them early, which requires a compelling interface and the ability to manually intervene very fast.