More

ankurdhama · 2026-01-22T03:47:38 1769053658

WSLg also works on Windows 10.

ankurdhama · 2026-01-07T03:34:21 1767756861

So the "hardware failure" happening exactly at the same time the Windows update installation failed are not related? That sounds like a one in a billion kind of coincident.

eli · 2026-01-07T03:39:36 1767757176

An upgrade process involves heavy CPU use, disk read/writes, and at least a few power cycles in short time period. Depending what OP was doing on it otherwise, it could've been the highest temperature the device had ever seen. It's not so crazy.

My guess would've been SSD failure, which would make sense to seem to appear after lots of writes. In the olden days I used to cross my fingers when rebooting spinning disk servers with very long uptimes because it was known there was a chance they wouldn't come back up even though they were running fine.

jonathanlydall · 2026-01-07T06:16:19 1767766579

Not for a server, but many years ago my brother had his work desktop fail after he let it cold boot for the first time in a very long time.

Normally he would leave his work machine turned on but locked when leaving the office.

Office was having electrical work done and asked that all employees unplug their machines over the weekend just in case of a surge or something.

On the Monday my brother plugged in machine and it wouldn’t turn on. Initially the IT guy remarked that my brother didn’t follow the instructions to unplug it.

He later retracted the comment after it was determined the power supply capacitors had gone bad a while back, but the issue with them was not apparent until they had a chance to cool down.

GCUMstlyHarmls · 2026-01-07T05:28:54 1767763734

> In the olden days I used to cross my fingers when rebooting spinning disk servers with very long uptimes because it was known there was a chance they wouldn't come back up even though they were running fine.

HA! Not just me then!

I still have an uneasy feeling in my guts doing reboots, especially on AM5 where the initial memory timing can take 30s or so.

I think most of my "huh, its broken now?" experiences as a youth were probably the actual install getting wonky though, rather than the few rare "it exploded" hardware failures after reboot, though that definitely happened.

zelon88 · 2026-01-07T05:19:09 1767763149

This, 100%.

I'd like to add my reasoning for a similar failure of an HP Proliant server I encountered.

Sometimes hardware can fail during long uptime and not become a problem until the next reboot. Consider a piece of hardware with 100 features. During typical use, the hardware may only use 50 of those features. Imagine one of the unused features has failed. This would not cause a catastrophic failure during typical use, but on startup (which rarely occurs) that feature is necessary and the system will not boot without it. If it could, it could still perform it's task... because the damaged feature is not needed. But it can't get past the boot phase, where the feature is required.

Tl;dr the system actually failed months ago and the user didn't notice because the missing feature was not needed again until the next reboot.

startupsfail · 2026-01-07T05:54:10 1767765250

Is there a good reason why upgrades need to stress-test the whole system? Can't they go slowly, throttling resource usage to background levels?

They involve heavy CPU use, stress the whole system completely unnecessary, the system easily sees the highest temperature the device had ever seen during these stress tests. If during that strain something fails or gets corrupted, it's a system-level corruption...

Incidentally, Linux kernel upgrades are not better. During DKMS updates the CPU load skyrockets and then a reboot is always sketchy. There's no guarantee that something would not go wrong, a secure boot issue after a kernel upgrade in particular could be a nightmare.

zelon88 · 2026-01-07T06:28:47 1767767327

To answer your question; it helps to explain what the upgrade process entails.

In the case of Linux DKMS updates: DKMS is re-compiling your installed kernel modules to match the new kernel. Sometimes a kernel update will also update the system compiler. In that instance it can be beneficial for performance or stability to have all your existing modules recompiled with the new version of the compiler. The new kernel comes with a new build environment, which DKMS uses to recompile existing kernel modules to ensure stability and consistency with that new kernel and build system.

Also, kernel modules and drivers may have many code paths that should only be run on specific kernel versions. This is called 'conditional compilation' and it is a technique programmers use to develop cross platform software. Think of this as one set of source code files that generates wildly different binaries depending on the machine that compiled it. By recompiling the source code after the new kernel is installed, the resulting binary may be drastically different than the one compiled by the previous kernel. Source code compiled on a 10 year old kernel might contain different code paths and routines than the same source code that was compiled on the latest kernel.

Compiling source code is incredibly taxing on the CPU and takes significantly longer when CPU usage is throttled. Compiling large modules on extremely slow systems could take hours. Managing hardware health and temperatures is mostly a hardware level decision controlled by firmware on the hardware itself. That is usually abstracted away from software developers who need to be able to be certain that the machine running their code is functional and stable enough to run it. This is why we have "minimum hardware requirements."

Imagine if every piece of software contained code to monitor and manage CPU cooling. You would have software fighting each other over hardware priorities. You would have different systems for control, with some more effective and secure than others. Instead the hardware is designed to do this job intrinsically, and developers are free to focus on the output of their code on a healthy, stable system. If a particular system is not stable, that falls on the administrator of that system. By separating the responsibility between software, hardware, and implementation we have clear boundaries between who cares about what, and a cohesive operating environment.

startupsfail · 2026-01-08T05:56:03 1767851763

The default could be that a background upgrade should not be a foreground stress test.

Imagine you are driving a car and from time ro time, without any warning, it suddenly starts accelerating and decelerating aggressively. Your powertrain, engine, breaks are getting tear and wear, oh and at random that car also spins out and rolls, killing everyone inside (data loss).

This is roughly how current unattended upgrades work.

SecretDreams · 2026-01-07T04:35:03 1767760503

> Depending what OP was doing on it otherwise, it could've been the highest temperature the device had ever seen. It's not so crazy.

Kind of big doubt. This was probably not slamming the hardware.

refulgentis · 2026-01-07T05:08:15 1767762495

That was absolutely slamming the hardware. (source: worked on Android, and GPs comments re: this are 100% correct. I’d need a bit more, well anything, to even come around to the idea the opposite is even plausible. Best steelman is naïvete, like “aren’t updates are just a few mvs and a reboot?”)

tobyjsullivan · 2026-01-07T03:56:07 1767758167

Over my 35 years of computer use, most hardware failures (very, very rare) happen during a reboot or power-on. And most of my reboots happen when installing updates. It actually seems like a very high probability in my limited experience.

Of course, it’s possible that the windows update was a factor, when combined with other conditions.

fc417fc802 · 2026-01-07T04:17:20 1767759440

There's also the case where the hardware has failed but the system is already up so it just keeps running. It's when you finally go to reboot that everything falls apart in a visible manner.

da_chicken · 2026-01-07T08:02:30 1767772950

This is one of the reasons I am not a fan of uptime worship. It's not a stable system until it's able to cold boot.

Say you have a system that has been online for 5 years continuously until a power outage knocks it out. When power is restored, the system doesn't boot to a working system. How far back do you have to go to in your backups to find a known good system? And this isn't just about hardware failure, it's an issue of configuration changes, too.

phire · 2026-01-07T05:15:32 1767762932

I also notice that people with lots of experience with computers will automatically reboot when they encounter minor issues (have you tried turning it off and on again?).

When it then completely falls apart on reboot, they spend several hours trying to fix it and completely forget the "early warning signs" that motivated them to reboot in the first place.

I've think the same applies to updates. I know the time I'm most likely to think about installing updates is when my computer is playing up.

ssl-3 · 2026-01-07T07:34:30 1767771270

I try to do the opposite, and reboot only as a last resort.

If I reboot it and it starts working again, then I haven't fixed it at all.

Whatever the initial problem was is likely to still present after reboot -- and it will tend will pop up again later even if things temporarily seem to be working OK.

close04 · 2026-01-07T10:41:21 1767782481

> Whatever the initial problem was is likely to still present after reboot

You only know this after the reboot. Reboot to fix the issue and if it comes back then you know you have to dig deeper. Why sink hours of effort into fixing a random bit flip? I'll take the opposite position and say that especially for consumer devices most issues are caused by some random event resulting in a soft error. They're very common and if they happen you don't "troubleshoot" that.

ssl-3 · 2026-01-07T19:38:27 1767814707

With any system: When I can find and correct the problem out of the gate, then it remains corrected the issue does not recur.

fc417fc802 · 2026-01-07T10:01:15 1767780075

How do you avoid sinking time into chasing illusory bugs?

GranPC · 2026-01-07T03:43:01 1767757381

For all we know, this thing was on its last legs (these machines do run very hot!) and the update process might have been the final nail in the coffin. That doesn't mean Microsoft set out to kill OP's machine... Same thing could have happened if OP ran make -j8 -- we wouldn't blame GNU make.

wnevets · 2026-01-07T05:04:34 1767762274

This reminds me of the 3090 hardware problems being exposed by Amazons New World [1]. Everyone really wanted to blame the software.

https://www.pcgamer.com/amazon-new-world-killing-rtx-3090-gp...

Graziano_M · 2026-01-07T03:47:41 1767757661

I had a friend's dad's computer's HDD fail while I was installing Linux on it to show him it. That was terrifying. I still remember the error, and I just left with it (and Windows) unable to boot. Later my friend told me that the drive was toast.

Come to think of it, maybe it was me. I might have trashed the MBR? I remember the error, though, "Non system disk or disk error".

toast0 · 2026-01-07T06:40:08 1767768008

IIRC, that error text comes from the mbr. You may have trashed the partition table?

Graziano_M · 2026-01-08T01:30:36 1767835836

Yeah, I think so. It's been ~25 years, and only while typing out that comment did I remember the error message and realize that's probably what I had done.

If I recall correctly, he ended up scrapping the drive.

justinclift · 2026-01-07T15:44:22 1767800662

Yeah, sounds like the drive was still physically detected but that the expected boot loader wasn't present any more.

wvenable · 2026-01-07T04:09:42 1767758982

If had happened any other time, there wouldn't be a blog post about it and we wouldn't be reading about it.

olyjohn · 2026-01-07T03:39:50 1767757190

I've fixed thousands of PCs and Macs over my career. Coincidences like this happen all the time. I mean, have you seen the frequency of updates these days? There are always some kind of updates happening. So the chances of your system breaking during an update is not actually that slim.

pdpi · 2026-01-07T04:00:37 1767758437

I think it's fair to say they're related, yes. But causality can well be the other way around — that Windows upgrade failed because of flaky hardware.

santoshalper · 2026-01-07T03:41:27 1767757287

Two bugs occurring at the same time is definitely not one in a billion, and with billions of computers in the world, weird shit is going to happen.

Aurornis · 2026-01-07T04:23:56 1767759836

> That sounds like a one in a billion kind of coincident

Hardware is more likely to fail under load than at idle.

Blaming the last thing that was happening before hardware failed isn't a good conclusion, especially when the failure mode manifests as random startup failures instead of a predictable stop at some software stage.

nightfly · 2026-01-07T03:42:30 1767757350

windows update just doing a normal write causing the active chunk of flash memory being used to hold something in the boot loader to a different failed/failing section

taneq · 2026-01-07T04:05:26 1767758726

A software update can absolutely trigger or unmask a hardware bug. It’s not an either/or thing, it’s usually (if a hardware issue is actually present) both in tandem:

ezfe · 2026-01-07T04:22:59 1767759779

This happens all the time, people always doubt it - but the patterns are always consistent: large updates kill hardware that's in progress of failing

justsomehnguy · 2026-01-07T03:43:48 1767757428

"Hardware failure" => "WinUpdate failure" => "Corrupted system" conforms the Occam's razor.

croes · 2026-01-07T03:51:51 1767757911

Like winning the lottery?

Happens quite often

ankurdhama · 2025-12-31T05:35:53 1767159353

Have you tried any other desktop environment like Gnome or KDE?

nickjj · 2025-12-31T15:27:52 1767194872

I haven't tried a full desktop environment that's not niri yet.

With the direction of KDE, I don't know if it would change things because the latest upcoming version will be Wayland with no X11 support[0]. It sounds like it would still support XWayland in the same way niri supports it too, but the potential root cause (Wayland based compositor) would still be there? I'd also be using the same NVIDIA drivers.

I'm not deep into the woods yet in Linux on the desktop so I'm speculating a lot here.

Certainly worth a test before I go crawling back to Windows tho, however 95% of the reason I switched is because of niri, it's that good. With that said if Plasma ends up working as well as Windows of course I'd still choose that over my old Windows 10 set up.

[0]: https://www.phoronix.com/news/KDE-Plasma-2025-Wayland-Succes...

ankurdhama · 2025-12-11T06:43:49 1765435429

> The TaskBar at the bottom was a great innovation that persists to this day

The taskbar was supposed to be at the top of screen but some apps would put window at 0,0 and will be below taskbar, to fix they moved the taskbar to bottom (which I think is bad from ergonomic point of view)[1]

[1] https://devblogs.microsoft.com/oldnewthing/20030912-00/?p=42...

ankurdhama · 2025-11-16T05:48:47 1763272127

There are many MDM solutions for macOS that business use.

ankurdhama · 2025-10-05T04:17:55 1759637875

What's worse then copilot on task bar is the copilot key on keyboard. This key doesn't even have its own scan code, instead it send something like Win+Shift+F23.

1718627440 · 2025-10-05T20:23:57 1759695837

Sounds not to bad for me, or have you bound anything to Super-Shift-F23?

ankurdhama · 2025-09-27T05:28:25 1758950905

You mean the designers who did something like this: https://www.reddit.com/r/gnome/comments/1nrcvok/i_dont_see_t...

deaddodo · 2025-09-27T07:47:53 1758959273

https://news.ycombinator.com/item?id=45391654

ankurdhama · 2025-09-26T02:46:00 1758854760

Please fine $1B more for providing only SD stream on Linux.

ankurdhama · 2025-09-24T04:54:44 1758689684

Exactly, so many hoops to jump over on macos.

ankurdhama · 2025-09-24T02:48:17 1758682097

The title should be "Can life be modeled as computation". It doesn't mean life is computation.