Relative Content

Tag Archive for vector-databasemilvus

Fit token to dimension

I’m using Python with LangChain to transform .txt files into chunks, where each chunk contains 512 tokens.

QueryNode Chunk Cache

We have a cluster deployment of Milvus on the k8s and our dataset sizes are of the order of a 150 million. The querynodes are distributed across 16 replicas with 40G of memory each. The current bottleneck for us in the partition load times and we exploring ways to improve the load times. We were referring to https://milvus.io/docs/chunk_cache.md and tried setting this up in our deployment. However, we do not see any noticeable improvements in the load times and from the logs, there is not indication that the chunk cache is being loaded. There are a couple of questions we have in this area:

Milvus Cluster not Running with Milvus Operator

I am trying to install milvus db with milvus operator on my EKS cluster. I have a running milvus operator pod. The logs of pod doesnot show any error. But when I try to create a milvus standalone cluster using this command : kubectl apply -f https://raw.githubusercontent.com/zilliztech/milvus-operator/main/config/samples/milvus_default.yaml