isn't this insane? why aren't people freaking out? the jump in capability is out...

HarHarVeryFunny · 2026-04-07T23:08:51 1775603331

If it's so great at software engineering and bug fixing, then why does Claude Code still have 5000+ open bugs?

https://github.com/anthropics/claude-code/issues?q=is%3Aissu...

Apparently whatever SWE-bench is measuring isn't very relevant.

anuramat · 2026-04-08T03:22:08 1775618528

as much as I hate cc, 95% of the issues there are either AI psychosis or user error

iLoveOncall · 2026-04-08T08:26:33 1775636793

So it should be insanely easy for this world altering model to comb through them and close irrelevant ones.

anuramat · 2026-04-08T08:57:07 1775638627

torturing a model with human stupidity probably doesn't align with their position on model welfare; wondering if they tried bullying it into hacking its way out of the slop gulag

HarHarVeryFunny · 2026-04-08T12:08:34 1775650114

Yes, perhaps it finds it stressful operating on itself.

Maybe that's why they haven't released it - to give it a vacation?

menno-dot-ai · 2026-04-08T09:56:16 1775642176

@anthropic, send me an email if you need access to a jupyter notebook that'd motivate haiku to hack itself into and then back out of the pentagon

HarHarVeryFunny · 2026-04-08T12:07:16 1775650036

So "only" 250 real bugs?

FergusArgyll · 2026-04-07T23:38:49 1775605129

Probably because a human still has to review every change and they don't have time

HarHarVeryFunny · 2026-04-08T00:24:10 1775607850

So if all the AI code is being reviewed by humans (not sure this is true, but let's assume it is), then why are there 5000+ bugs? Are you blaming the Anthropic developers rather than the AI?

tripledry · 2026-04-08T06:44:40 1775630680

Also, why is Anthropic still hiring SWEs?

Eufrat · 2026-04-07T19:32:52 1775590372

Anthropic needs to show that its models continually get better. If the model showed minimal to no improvement, it would cause significant damage to their valuation. We have no way of validating any of this, there are no independent researchers that can back any of the assertions made by Anthropic.

I don’t doubt they have found interesting security holes, the question is how they actually found them.

This System Card is just a sales whitepaper and just confirms what that “leak” from a week or so ago implied.

HDThoreaun · 2026-04-08T02:54:56 1775616896

Well they said theyll be giving the model to select tech companies to use, there soon will be independent users who can comment on its capabilities.

xvector · 2026-04-08T04:35:38 1775622938

Most big tech companies have access to the model, you can absolutely "validate their claims" or talk to someone that can.

mirsadm · 2026-04-07T21:50:56 1775598656

The numbers only go up to 100% though.

neolefty · 2026-04-07T22:31:11 1775601071

Many numbers already have! That's why we keep coming up with new, harder, benchmarks.

nsingh2 · 2026-04-07T18:41:07 1775587267

It's going to be expensive to serve (also not generally available), considering they said it's the largest model they've ever trained.

I suspect it's going to be used to train/distill lighter models. The exciting part for me is the improvement in those lighter models.

AstroBen · 2026-04-07T20:07:46 1775592466

It seems inevitable that costs will come down over time. Expensive models today will be cheap models in a few years.

azan_ · 2026-04-07T20:22:19 1775593339

What's interesting is that scaling appears to continue to pay off. Gwern was right - as always.

RivieraKid · 2026-04-07T20:51:01 1775595061

I've been increasingly "freaking out" since about 3 - 4 years ago and it seems that the pessimistic scenario is materializing. It looks like it will be over for software engineers in a not so distant future. In January 2025 I said that I expect software engineers to be replaced in 2 years (pessimistic) to 5 years (optimistic). Right now I'm guessing 1 to 3 years.

sekai · 2026-04-08T06:51:02 1775631062

> I've been increasingly "freaking out" since about 3 - 4 years ago and it seems that the pessimistic scenario is materializing. It looks like it will be over for software engineers in a not so distant future. In January 2025 I said that I expect software engineers to be replaced in 2 years (pessimistic) to 5 years (optimistic). Right now I'm guessing 1 to 3 years.

Tell me how this will replace Jira, planning, convincing PM's about viability. Programming is only a part of the job devs are doing.

AI psychosis is truly next level in these threads.

AstroBen · 2026-04-08T16:58:25 1775667505

> Programming is only a part of the job devs are doing.

Programming is a huge part of the job. In a world where AI does the programming we're going to need 80% fewer software professionals.

It won't be a full replacement of the role, you're correct there - but it'll be a major downsizing because of productivity gains.

ryeights · 2026-04-08T18:17:00 1775672220

If the "new software engineering" is Jira, planning, and convincing PM's about viability all day, you can count me out!

stavros · 2026-04-08T12:50:36 1775652636

Have you never filed JIRA tickets, planned, or debated viability with an AI? Which part of those are you finding that an AI absolutely cannot do better than the average developer?

anuramat · 2026-04-08T03:27:47 1775618867

it's not gonna get much more autonomous without self play and major change in architecture

kypro · 2026-04-07T21:03:15 1775595795

I assure you it will soon become very clear that mass job losses are one of the least concerning side effects of developing the magic "everything that can plausibly been done within the constraints of physics is now possible" machine.

We're opening a can of worms which I don't think most people have the imagination to understand the horrors of.

ls612 · 2026-04-07T23:42:28 1775605348

While I'm definitely concerned that AI is a massive driver of centralization of power, at least in theory being able to do far more things in the space of "things physics admits to be possible" is massively wealth enhancing. That is literally how we have gotten from the pre-industrial world to today.

kypro · 2026-04-08T00:15:38 1775607338

Controversially I'd argue that there is likely an optimal and stable level of technological advancement which we would be wise to not to cross. That said, we are human so we will, I'd just rather it happened in a couple hundred years rather than a decade or two.

For example, it's hard to imagine an AI which gives us the capability to cure cancer, but doesn't give us the capability to create target super viruses.

Nick Bostrom's Vulnerable World Hypothesis more or less describes my own concerns, https://nickbostrom.com/papers/vulnerable.pdf

At some point we should probably try to resist the urge to pick balls out of the urn as we may eventually pull out a ball we don't want.

ls612 · 2026-04-08T00:56:09 1775609769

Also controversially, it isn't clear to me that perfect totalitarianism (what he calls solutions 3 and 4) is a preferable outcome to devastation.

MattRix · 2026-04-07T21:06:26 1775595986

yeesh yep, though it's more Pandora's Box than a can of worms, since it can't exactly be closed once it's opened

jasondigitized · 2026-04-08T03:16:14 1775618174

What is the opposite of horrors and why don't we talk about those ever.

ash_091 · 2026-04-07T21:57:56 1775599076

Do you have any sources I could read to better understand your concern?

cruffle_duffle · 2026-04-07T22:37:16 1775601436

Piles and piles of sci-fi novels.

kypro · 2026-04-08T00:04:53 1775606693

What sources would you even be looking for? I think you're asking the wrong question. It's not like I'm arguing a scientific theory which can be backed by data and experimentation. I can only provide you reasoning for why I believe what I believe.

Firstly, I'd propose that all technological advances are a product of time and intelligence, and that given unlimited time and intelligence, the discovery and application of new technologies is fundamentally only limited by resources and physics.

There are many technologies which might plausibly exist, but which we have not yet discovered because we only have so much intelligence and have only had so much time.

With more intelligence we should assume the discovery of new technologies will be much quicker – perhaps exponential if we consider the rate of current technology discovery and exponential progression of AI.

There are lots of technologies we have today which would seem like magic to people in the past. Future technologies likely exist which would make us feel this way were they available today.

While it's hard to predict specifically which technologies could exist soon in a world with ASI, if we assume it's within the bounds of available resources and physics, we should assume it's at least plausible.

Examples:

- Mind control – with enough knowledge about how the brain works you can likely devise sensory or electro-magnetic input that would manipulate the functioning of brain to either strongly influence or effectively dictate it's output.

- Mind simulation - again, with enough knowledge of the brain, you could take a snapshot of someones mind with an advanced electro-magnetic device and simulate it to torture them in parallel to reveal any secret, or just because you feel like doing it.

- Advantage torture – with enough knowledge of human biology death becomes optional in the future. New methods of torture which would have previously have killed the victim are now plausible. States like North-Korea can now force humans to work for hundreds of years in incomprehensible agony for opposing the state.

- Advanced biological weapons – with enough knowledge of virology sophisticated tailor-made viruses replace nerve agents as Russia's weapon of choice for killing those accused of treason. These viruses remain dormant in the host for months infecting them and people genetically similar to them (parents, children, grandchildren). After months, the virus rapidly kills its hosts in horrific ways.

I could go on, you just need to use your imagination. I'm not arguing any of the above are likely to be discovered, just that it would be very naive to think AI will stop at a cure for cancer. If it gives us cure for cancer, it will give us lots of things we might wish it didn't.

amunozo · 2026-04-08T07:04:10 1775631850

You are supposing it's possible to know that much about some things that maybe are not knowledgeable to us, even with these tools. Life is extremely complex, more than it's typically assumed by engineering-minded people. Let's be humble here and acknowledge it.

stavros · 2026-04-08T12:53:23 1775652803

Life might be complex, but it isn't unknowable. Claiming life is unknowable isn't being humble, it's being naive.

amunozo · 2026-04-09T08:47:16 1775724436

Why couldn't it be unknowable? I am not saying that it is, but it could be. The human brain has its limits and things could me too complex for us to understand enough to be able to modify them at will. We could understand a lot, but not enough to manipulate it with certainty. Biology is not physics.

stavros · 2026-04-09T10:41:49 1775731309

Because physics is knowable, and I don't think an unknowable thing can be created from a knowable thing.

amunozo · 2026-04-10T12:02:04 1775822524

Why not? Human mind has its limits. The complexity of physics is orders of magnitude smaller than biology, let alone any kind of social science. Physics is the exception, not the rule. The rest of sciences are way more messy.

stavros · 2026-04-10T12:38:24 1775824704

What are the limits? What have we run up against that we couldn't understand, no matter how much we tried?

amunozo · 2026-04-13T13:42:17 1776087737

Almost anything outside physics is not predictable. Anything that involves human behavior is totally not understood, especially if it involves a bunch of humans (economy, sociology...). You could describe it, sure, but that is not the same as understating and modifying at will.

kypro · 2026-04-08T14:06:52 1775657212

I would acknowledge that. I don't think these things are remotely possible any time soon with current rates of progress.

However, I think people tend to fail to acknowledge the product of exponential trends, so the question in my mind is more whether or not you believe AI will unlock an exponential increase in the rate of progress and understanding. Extremely complex is still finite complexity at the end of the day.

Maybe AI won't significantly increase the rate of progress across all scientific fields. I am fairly confident it will significantly increase the rate of progress over at least some though, and it seems likely to me that biological progresses will be much easier for us to model and predict with AI. I'm much less sure about progress in domains like physics and robotics.

throw310822 · 2026-04-08T02:28:39 1775615319

On the slightly optimistic side, much more intelligence will be spent in countering these criminal uses than in enabling them. For each of the terrible inventions you mentioned, there are other inventions to counter them.

nozzlegear · 2026-04-07T19:35:27 1775590527

Freak out about what? I read the announcement and thought "that's a dumb name, they sure are full of themselves" – then I went back to using Claude as a glorified commit message writer. For all its supposed leaps, AI hasn't affected my life much in the real except to make HN stories more predictable.

oliver236 · 2026-04-07T20:03:35 1775592215

anuramat · 2026-04-07T18:49:24 1775587764

"some model I don't get to use is much better at benchmarks"

pick one or more: comically huge model, test time scaling at 10e12W, benchmark overfit

estearum · 2026-04-07T18:58:57 1775588337

So... you're not excited because it might take a few months before we can use it or something? I don't get your comment.

RivieraKid · 2026-04-07T20:43:53 1775594633

Whether you're excited depends on what do you do for living and how close you are to financial independence.

estearum · 2026-04-07T20:55:39 1775595339

I agree there are other valid reasons not to be excited about this, I just can't make sense of the ones provided above.

randomgermanguy · 2026-04-07T19:20:39 1775589639

I think the general question is if they'll release it at all, haven't yet read anything stating that they would

estearum · 2026-04-07T19:33:12 1775590392

Well let me introduce people to a few brand new concepts:

https://en.wikipedia.org/wiki/Capitalism

https://en.wikipedia.org/wiki/Race_to_the_bottom

https://en.wikipedia.org/wiki/Arms_race

Of course they'll release it once they can de-risk it sufficently and/or a competitor gets close enough on their tail, whichever comes first.

anuramat · 2026-04-08T03:18:37 1775618317

I'm not excited because they might be ~lying

yrds96 · 2026-04-07T19:51:39 1775591499

I think there's no SOA advance on this one worthy of "freaking out".

Looks like they just built a way larger model, with the same quirks than Claude 4. Seems like a super expensive "Claude 4.7" model.

I have no doubts that Google and OpenAI already done that for internal (or even government) usage.

mofeien · 2026-04-07T19:13:41 1775589221

I am freaking out. The world is going to get very messy extremely quickly in one or two further jumps in capability like this.

RivieraKid · 2026-04-07T21:02:40 1775595760

Messy in a way that would affect you?

mofeien · 2026-04-08T14:35:01 1775658901

I can think of several possible messy outcomes that would be able to directly affect me, not all mutually exclusive:

- Job loss by me being replaced by an AI or by somebody using an AI. Or by an AI using an AI.

- Resulting societal instability once blue collar jobs get fully automated at scale, and there is no plan in place to replace this loss of peoples' livelihoods.

- People turning to AI models instead of friends for emotional support, loss of human connection.

- Erosion of democracy by making authoritarianism and control very scalable, broad in-detail population surveillance and automated investigation using LLMs that was previously bounded by manpower.

- Autonomous weapons, "Slaughterbots" as in the short film from 2017

- Biorisk through dangerous biological capabilities that enable a smaller team of less skilled terrorists to use a jailbroken LLM to create something dangerous.

- Other powers in the world deciding that this technology is too powerful in the hands of the US, or too dangerous to be built at all and has to be stopped by all means.

- Loss of/Voluntary ceding of control over something much smarter than us. "If Anyone Build It, Everyone Dies"

RALaBarge · 2026-04-07T23:42:37 1775605357

Exploits in embedded systems that will never be properly updated is just one thing I can think of if one really thought about it.

thunderfork · 2026-04-07T22:17:08 1775600228

"Internet no longer viable" would affect everyone, probably

BobbyJo · 2026-04-07T22:49:02 1775602142

The only thing preventing this today is cost, not capability. As costs come down over the next 5 years, the idea that the internet was once dominated by people will seem quaint.

RobertDeNiro · 2026-04-07T20:03:03 1775592183

Well for one, it’s a PDF

dysoco · 2026-04-07T18:56:38 1775588198

Wait until you see real usage. Benchmark numbers do not necessarily translate to real world performance (at least not by the same amount).

ryeights · 2026-04-07T23:05:46 1775603146

Until recently I would have described myself as an AI skeptic. HN has been a great source for cope on the AI subject over the years. You can find nitpicks, caveats, all sorts of reasons to believe things aren’t as significant as they seem. For me Opus 4.5 was the inflection point where I started to think “maybe this isn’t a bubble.” The figures in this report, if accurate, are terrifying.

risyachka · 2026-04-07T20:46:11 1775594771

the time to freak out was 2 years ago.

m3kw9 · 2026-04-08T14:34:04 1775658844

have you used it once?

oliver236 · 2026-04-12T05:53:35 1775973215

ok bro