Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It feels like a bit of history is missing... If ollama was founded 3 years before llama.cpp was released, what engine did they use then? When did they transition?


I don't think that is the case. Llama.cpp appeared within weeks after meta released llama to select researchers (which then made it out to the public). 3 years before that nobody knew of the name llama. I'm sure that llama.cpp existed first


> within weeks

One week, really, if we consider the "public" availability.

Llama announced: February 24, 2023

Weights leaked: March 3, 2023

Llama.cpp: March 10, 2023

(Ollama 0.0.1: Jul 8, 2023)


They spent several years in stealth mode but the initial release was llama.cpp.

Ollama v0.0.1 "Fast inference server written in Go, powered by llama.cpp" https://github.com/ollama/ollama/tree/v0.0.1


They spent several years in stealth mode

doing what?

trying to build themselves what llama.cpp ended up doing for them?


I asked myself the same question. Some other commenter mentioned above they started with some Kubernetes infrastructure thing and they pivoted later.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: