Been disappointed to see Ollama list models that are supported by the cloud prod...

mchiang · 2025-10-16T07:15:28 1760598928

Qwen3-coder:30b is in the blog post. This is one that most users will be able to run locally.

We are in this together! Hoping for more models to come from the labs in varying sizes that will fit on devices.

bigyabai · 2025-10-16T07:18:58 1760599138

I'm looking forward to future ollama releases that might attempt parity with the cloud offerings. I've since moved onto the Ollama compatibility API on KoboldCPP since they don't have any such limits with their inference server.

mchiang · 2025-10-16T07:26:54 1760599614

I am super hopeful! Hardware is improving, inference costs will continue to decrease, models will only improve...

Balinares · 2025-10-16T08:49:14 1760604554

How does Qwen3-Coder:30B compare to Instruct-2507 as a coding agent backend? I was under the impression that Instruct was intended to supersede Coder?

hephaes7us · 2025-10-16T18:38:05 1760639885

In this case, it's not about whether it fits on my physical hardware or not. It's about what seems like an arbitrary restriction designed to start pushing users to their cloud offering.

zozbot234 · 2025-10-16T07:59:27 1760601567

Aren't these models consistently quite large and hard to run locally? It's possible that future Ollama releases will allow you to dynamically manage VRAM memory in a way that enables these models to run with acceleration on even modest GPU hardware (such as by dynamically loading layers for a single 'expert' into VRAM, and opportunistically batching computations that happen to rely on the same 'expert' parameters - essentially doing manually what mmap does for you in CPU-only inference) but these 'tricks' will nonetheless come at non-trivial cost in performance.

colesantiago · 2025-10-16T07:28:04 1760599684

I know this is disappointing, but what business model would be best here for ollama?

1. Donationware - Let's be real, tokens are expensive and if they ask for everyone to chip in voluntarily people wouldn't do that and Ollama would go bust quickly.

2. Subscriptions (bootstrapped and no VCs) again like 1. people would have to pay for the cloud service as a subscription to be sustainable (would you?) or go bust.

3. Ads - Ollama could put ads in the free version but to remove them the users can pay for a higher tier, a somewhat good compromise, except developers don't like ads and don't like pay for their tools unless their company does it for them. No users = Ollama goes bust.

4. VCs - This is the current model which is why they have a cloud product and it keeps the main product free (for now). Again, if they cannot make money or sell to another company Ollama goes bust.

5. Fully Open Source (and 100% free) with Linux Foundation funding - Ollama could also go this route, but this means they wouldn't be a business anymore for investors and rely on the Linux Foundation's sponsors (Google, IBM, etc) for funding the LF to stay sustainable. The cloud product may stay for enterprises.

Ollama has already taken money from investors so they need to produce a return for them so 5. isn't an option in the long term.

6. Acquisition by another company - Ollama could get acquired and the product wouldn't change* (until the acquirer jacks up prices or messes with the product) which ultimately kills it anyway as the community moves on.

I don't see any other way that Ollama can not be enshittified without making a quick buck.

You just need to avoid VC backed tools and pay for bootstrapped ones without any ties to investors.

hephaes7us · 2025-10-16T19:07:29 1760641649

Tokens are expensive, sure, but I don't even _want_ Ollama to run inference for me.

Ollama gives me, essentially, a wrapper for llama.cpp and convenient hosting where I can download models.

I'm happy to pay for the bandwidth, plus a premium to cover their running this service.

I'm furthermore happy to pay a small charge to cover the development that they've done and continue to do to make local-inference easy for me.

CaptainOfCoit · 2025-10-16T10:12:39 1760609559

> I don't see any other way that Ollama can not be enshittified without making a quick buck.

Me neither. The mistake they did was getting outside investments, as now they're no longer in full control and eventually are gonna have to at least give the impression they give a shit about the investors, and it'll come at the cost of the users one way or another.

Please pay for your tools that are independently developed, we really need more community funding of projects so we can avoid this never-ending spiral of VC-fueled+killed tools.

sanex · 2025-10-16T12:46:05 1760618765

They got the investments before the company was even ollama. They exist because their VC was ok with them pivoting to build the current product. It's likely it wouldn't exist without the funding.

CaptainOfCoit · 2025-10-16T13:24:33 1760621073

I dunno, the founders could have also changed their frame of mind, and started the project without VC investments. AFAIK, the founding team worked at Docker before, a company that doesn't pay peanuts, so I'm sure they could have scraped together enough to bootstrap it initially.

But I understand the added zeros to the (maybe) future payout when you take VC funds is hard to ignore, I'm not blaming them for anything really.

vladsanchez · 2025-10-16T13:36:43 1760621803

Ok, so that glm-4.6 doesn't/can't run locally? That's quite a disappointment