Milvus (vector database)


Milvus is a distributed vector database developed by Zilliz. It is available as both open-source software and a cloud service called Zilliz Cloud.
Milvus is an open-source project under the LF AI & Data Foundation and is distributed under the Apache License 2.0.

History

Milvus has been developed by Zilliz since 2017.
Milvus joined Linux Foundation as an incubation project in and became a graduate in. The details about its architecture and possible applications were presented at ACM SIGMOD Conference in 2021.
Milvus 2.0, a major redesign of the whole product with a new architecture, was released in.

Features

Similarity search

Various similarity search-related features are available in Milvus:
Milvus' similarity search engine relies on heavily-modified forks of third-party open-source similarity search libraries, such as Faiss, DiskANN and hnswlib.
Milvus includes optimizations for I/O data layout, specific to graph search indices.

Database

As a database, Milvus provides the following features:
Milvus can be deployed as an embedded database, standalone server, or distributed cluster. Zilliz Cloud offers a fully managed version.

GPU support

Milvus provides GPU accelerated index building and search using Nvidia CUDA technology via the Nvidia cuVS library, including a recent GPU-based graph indexing algorithm known as CAGRA.

Integration

Milvus provides official SDK clients for Java, NodeJS, Python and Go. An additional C# SDK client was contributed by Microsoft. The database can integrate with DataDog, Prometheus and Grafana for monitoring and alerts, as well as generative AI frameworks Haystack, LangChain, IBM Watsonx, and those provided by OpenAI.
Several storage providers have built integrations with Milvus to support AI workloads and large-scale vector search. These integrations aim to optimize performance, simplify inferencing workflows, and enhance data management capabilities:
  • Pure Storage
  • Cloudian
  • Weka.io
  • '''DDN'''