I wanted to ask your opinion about saving ids with corresponding metadata. Think of the data I might have. I want to use many processes to add various vectors to FAISS in real-time. There is a need for storage to UUIDs to map those vector ids to corresponding UUIDs when it's needed. There may be several distinct vectors for one UUID. Real-time insertion is required, and it must be as quick as feasible to retrieve the appropriate UUID given the faiss index. When the data wasn't changing and there were around 2M, I used to use numpy arrays to map those IDs to their matching UUIDs. But this time, because real-time insertion will be used and there could be billions of vectors, I wanted to come up with a more useable method of storing those UUIDs. What do you think would be the best way to store this data?
- NZr9xeI0gu - [524.6 , 5.42, 7452.1,... ,124.6]
- NZr9xeI0gu - [10.8 , 7.02, 300.6,... ,785.0]
- NZr9xeI0gu - [485.0 , 504.0, 243.0,... ,5.09]
- GrM4dtQykW - [894.0 , 444.0, 0.00,... ,411.00]
- GrM4dtQykW - [9.0 , 845.0, 243.0,... ,850.79]
- VsgCjTNHxm - [0 , 174.0, 6.0, ... ,954.55]