There are two huge things your 5 minute setup is missing which are very hard tec...

etiennedi · on Dec 14, 2021

Spot on! Both of those were motivating factors when building Weaviate (Open Source Vector Search Engine). We really wanted it to feel like a full database or search engine. You should be able to do anything you would do with Elasticsearch, etc. There should be no waiting time between creating an object and searching. Incremental Updates and Deletes supported, etc.

On your second point about efficient filtering, check out this article I wrote outlining how filtered vector search works in Weaviate: https://towardsdatascience.com/effects-of-filtered-hnsw-sear...

For even more details on filtering, check the documentation: https://www.semi.technology/developers/weaviate/current/arch...

freediver · on Dec 14, 2021

Correct, that would take more than 5 minutes, although still possible to do with Faiss (and not that hard relatively speaking - in the Teclis demo, I indeed did your second point - combine results with a keyword search engine and there are many simple solutions you can use out there like Meilisearch, Sonic etc.e). If you were to try using an external API for vector search, you would still need to build keyword based search separately (and then combining/ranking logic) so then you may be better off just building the entire stack anyway.

Anyway, for me, the number one priority was latency and it is hard to beat on-premise search for that.

Even then, a vector search API is just one component you will need in your stack. You need to pick the right model, create vectors (GPU intensive), then possibly combine search results with keyword based search (say BM25) to improve accuracy etc. I am still waiting to see an end-to-end API doing all this.

jkb79 · on Dec 15, 2021

>then possibly combine search results with keyword based search (say BM25) to >improve accuracy etc. I am still waiting to see an end-to-end API doing all this >

Vespa.ai supports combining dense vector search with keyword search and ranking, see https://docs.google.com/presentation/d/1vWKhSvFH-4MFcs4aNa9C...

There is also a Vespa sample application (open source, Apache 2) demonstrating multiple different retrieval and ranking strategies over at https://github.com/vespa-engine/sample-apps/blob/master/msma...

thirdtrigger · on Dec 15, 2021

> I am still waiting to see an end to end API doing all this

That’s kinda the idea of Weaviate. You might like the Wikipedia demo dataset that contains all this. You indeed need to run this demo on your own infra but the whole setup (from vector DB to ML models) is containerized https://github.com/semi-technologies/semantic-search-through...

dontreact · on Dec 14, 2021

Interesting. Did you also tackle the incremental update problem with FAISS?

leobg · on Dec 15, 2021

hnswlib[1] allows for incremental updates. And I believe in terms of accuracy it stacks up fairly well against alternatives like FAISS or ScaNN.

[1]: https://github.com/nmslib/hnswlib/

freediver · on Dec 14, 2021

No, I didn't have a need for it in this demo (but is certainly possible with Faiss).

dontreact · on Dec 15, 2021

Thanks for sharing all that you have so far. I’d be super curious to learn more about how to best do incremental updates with FAISS!

pierrefermat1 · on Dec 15, 2021

Likewise here, had a look around on the Weaviate website too and couldn't find much on how they are internally handling incremental updates

thirdtrigger · on Dec 15, 2021

Did you see this blog? https://db-engines.com/en/blog_post/87

It's an interesting point tho. Maybe it's good to add this to the docs as well

gk1 · on Dec 14, 2021

Exactly right. Things like data freshness (live index updates), CRUD operations, metadata filtering, and horizontal scaling are all “extras” that don’t come with Faiss. Hence the need for solutions like Google Matching Engine and Pinecone.io.

And even if you do just want ANN and nothing else, some people just want to make API calls to a live service and not worry about anything else.

moab · on Dec 14, 2021

Can you expand more or provide a concrete example for the second point? What kind of database-like searches are you thinking about for spatial data? Things like range-queries can already be (approximately) done. Or are you thinking about relational style queries on data associated with each point?

dontreact · on Dec 14, 2021

Yes exactly, relational style queries with each data point. Maybe you have some metadata about your images and maybe you need to join against another table to properly query them. But at the same time you want to only grab the first k nearest neighbors according to vector similarity.

gk1 · on Dec 14, 2021

Pinecone does this, at least if I’m understanding your use case right: https://www.pinecone.io/docs/metadata-filtering/

And you’re right, it wasn’t easy to build.