More

mixermachine · 2026-02-08T07:56:21 1770537381

Nothing will come close to Opus 4.6 here. You will be able to fit a destilled 20B to 30B model on your GPU. Gpt-oss-20B is quite good in my testing locally on a Macbook Pro M2 Pro 32GB.

The bigger downside, when you compare it to Opus or any other hosted model, is the limited context. You might be able to achieve around 30k. Hosted models often have 128k or more. Opus 4.6 has 200k as its standard and 1M in api beta mode.

zozbot234 · 2026-02-08T08:14:31 1770538471

There are local models with larger context, but the memory requirements explode pretty quickly so you need to lower parameter count or resort to heavy quantization. Some local inference platforms allow you to place the KV cache in system memory (while still otherwise using GPU). Then you can just use swap to allow for even very long contexts, but this slows inference down quite a bit. (The write load on KV cache is just appending a KV vector per inferred token, so it's quite compatible with swap. You won't be wearing out the underlying storage all that much.)

mixermachine · 2026-01-21T15:18:45 1769008725

Regarding the $200 subscription. For Claude Code with Opus (and also Sonnet) you need that, yes.

I had ChatGPT Codex GPT5.2 high reasoning running on my side project for multiple hours the last nights. It created a server deployment for QA and PROD + client builds. It waited for the builds to complete, got the logs from Github Actions and fixed problems. Only after 4 days of this (around 2-4 hours) active coding I reached the weekly limit for the ChatGPT Plus Plan (23€). Far better value so far.

To be fully honest, it fucked up one flyway script. I have to fix this now my self :D. Will write a note in the Agent.md to never alter existing scripts. But the work otherwise was quite solid and now my server is properly deployed. If I would switch between High reasoning for Planing and Middle reasoning for coding, I would get even more usage.

moron4hire · 2026-01-21T16:30:03 1769013003

> ChatGPT Codex GPT5.2 high reasoning

"... brought to you by Costco."

But seriously, I can't help but think that this proliferation of massive numbers of iterations on these models and productizations of the models is an indication that their owners have no idea what they are doing with any of it. They're making variations and throwing them against the wall to see what sticks.

Aurornis · 2026-01-21T17:42:30 1769017350

It's really not that hard.

Codex = The model trained specifically for programming tasks. You want this if you're writing code.

GPT5.2 = The current version. You don't have to think about this, you just use the latest.

High Reasoning = A setting you select for balancing between longer thinking time or quicker answers. It's usually set and forget.

mixermachine · 2026-01-10T11:49:45 1768045785

Don't get me wrong, but somebody has to operate an exit node and somehow there needs to be a consensus on the protocol + routing.

If the network is only earth bound fixed wireless, the distance might be small enough that the state comes for the operator itself... This raises the cost of running this network from just money to life threat.

Getting many open source satellites up in orbit might not be feasible.

frrn · 2026-01-11T00:42:36 1768092156

Agreed that nothing is fully trustless on Earth. The point isn’t eliminating operators, it’s avoiding single points of coercion and failure. One exit can be shut down but many exits and type of networks (includong more alternative infra like the Guifi.net’s meshnetworks in Spain for example) across jurisdictions raise the cost from “call a CEO” to sustained political pressure or directly a CEO that has control over an entire network and its also a billionaire CEO with a messiah complex, far-right leanings and tendency to drug abuse.

Absolute decentralization is impossible. Reducing capture and increasing resilience is not. That’s a meaningful difference.

Said that, I’m happy with Starlink as an extra actor for a healthy mix of ISPs and networks that brings resilience.

mixermachine · 2026-01-10T11:40:30 1768045230

Got to say, I like the current Android versions. In the early days I flashed my Motorola Defy every second month with some cool new ROM. Always rooted and Xposed, always enabling something new.

Now I run a S23 Ultra and after two years it still does everything I need. OneUI 8.0 and Android 16. For work (app de) I also have a Pixel 7a, always with the newest Android Beta. Also works well.

Even the entry level phones work OK to pretty good now. My Samsung A16 5G (also for work) functions surprisingly well for 150€.

drnick1 · 2026-01-10T15:59:45 1768060785

> Now I run a S23 Ultra and after two years it still does everything I need.

Maybe, but it is fully under Google and Samsung's control, and is choke full of spyware. You couldn't pay me to use a stock (Googled) Android phone for this reason alone.

LoganDark · 2026-01-10T16:31:56 1768062716

Back when I used Android phones, tweaking was pretty important to me too. I still remember when I installed CyanogenMod on a Motorola XT1565, those were the days... Eventually, LineageOS, and then some new phones happened, not all of which were rootable, though I eventually ended up with a OnePlus 7 Pro which was pretty tweakable and even opened the possibility of bootloader re-locking, until a TWRP bug wiped my device and I pretty much stopped tweaking. Was never quite able to get EdXposed working right again...

MarsIronPI · 2026-01-10T17:33:12 1768066392

How well is rooting supported on these newer Android versions/devices? If I install LineageOS on my device, for example, I can be reasonably sure that Magisk will work fine. But how well does it work on a stock, locked-down ROM?

kasabali · 2026-01-11T05:17:49 1768108669

Most devices doesn't have unlockable bootloaders now thus you can't even root them unless it was a popular device and a temporary /finicky hack was found.

mixermachine · 2026-01-08T09:00:18 1767862818

Fully agree. ChatGPT is often very confident and tells me that X and Y is absolutely wrong in the code. It then answers with something worse... It also does rarely say "sorry, I was wrong" when the previous output was just plain lies. You really need to verify every answer because it is so confident.

I fully switched to Gemini 3 Pro. Looking into an Opus 4.5 subscription too.

My GF on the other side prefers ChatGPT for writing tasks quite a lot (school teacher classes 1-4).

mixermachine · 2026-01-05T07:49:21 1767599361

When you have enough experience and the project fits, this is the way to go. They don't pay for your time. They pay for your output and you can bill them on the output.

mixermachine · 2025-12-04T22:26:17 1764887177

Not quickly but if somebody puts enough money on the table, the fabs change too. All about cost and return. Micron just axed their brand crucial (end customer RAM and SSD) because they will only sell to database centers from now on.

Crazy times.

mixermachine · 2025-12-04T22:22:09 1764886929

Let's see if this demand truly holds. I'm still unsure. Currently nobody makes money with AI. There will be a correction (if I can trust my magic crystal ball ;) )

mixermachine · 2025-12-04T22:17:35 1764886655

Got to say, those prices were quite cheap. I also upgraded my home server to 32GB RAM and paid something like 55€. Now we can just wait for some bubble to pop...

mixermachine · 2025-12-04T22:15:16 1764886516

Sadly everything in the general direction of RAM or SSD chips is getting more expensive because a lot of production capacity is redistributed to serve AI chips and everything around.

Even lower end GPUs are getting more expensive even if they are not really useful for AI. But they still contain <some> chips and ram which is in high demand.

So yes, Apple will likely also have to pay higher priceses when they renew their contracts.