Questions tagged [chromadb]
69 questions
8
votes
4 answers
LangChain Chroma - load data from Vector Database
I have written LangChain code using Chroma DB to vector store the data from a website url. It currently works to get the data from the URL, store it into the project folder and then use that data to respond to a user prompt. I figured out how to…

max choate
- 81
- 1
- 3
4
votes
2 answers
ChromaDb add single document, only if it doesn't exist
I'm working with langchain and ChromaDb using python.
Now, I know how to use document loaders. For instance, the below loads a bunch of documents into ChromaDb:
from langchain.embeddings.openai import OpenAIEmbeddings
embeddings =…

user791793
- 413
- 1
- 6
- 19
4
votes
1 answer
Get all documents from ChromaDb using Python and langchain
I'm using langchain to process a whole bunch of documents which are in an Mongo database.
I can load all documents fine into the chromadb vector storage using langchain. Nothing fancy being done here. This is my code:
from…

user791793
- 413
- 1
- 6
- 19
3
votes
1 answer
streamlit: Your system has an unsupported version of sqlite3. Chroma requires sqlite3 >= 3.35.0
I previously deployed an app on Streamlit Cloud that utilized chromadb.
The app worked fine in the past. However, today I encountered a new error (as indicated in the title) and the app has stopped functioning.
I attempted to troubleshoot based on…

Xiang
- 230
- 2
- 10
3
votes
0 answers
Dynamically add more embedding of new document in chroma DB - Langchain
I have created a retrieval QA Chain which uses chromadb as vector DB for storing embeddings of "abc.txt" file. What if I want to dynamically add more document embeddings of let's say another file "def.txt"? How to do that? I don't want to reload the…

Jason
- 676
- 1
- 12
- 34
3
votes
1 answer
How to deploy chroma database (vector database) in production
I am working on a project where i want to save the embeddings in vector database.
need some help or resources to deploy chroma db for production use

ankitmhjn5
- 89
- 5
3
votes
5 answers
hnswlib package issue while installing chromadb in ubuntu
I am using ubuntu 20.04 focal and trying to install chromadb by using 'pip install chromdb' but I am getting following error
Building wheel for hnswlib (pyproject.toml) ... error
error: subprocess-exited-with-error
× Building wheel for…

Sinister _
- 31
- 1
- 3
3
votes
1 answer
Losing data of vector store Chromadb using Langchain
I am writing a question-answering bot using langchain. For storing my data in a database, I have chosen Chromadb. I have written the code below and it works fine
persist_directory = "./db"
embeddings = OpenAIEmbeddings()
vectordb =…

kheirq
- 31
- 2
3
votes
1 answer
LangChain, chromaDB Chroma.fromDocuments returns TypeError: Cannot read properties of undefined (reading 'data')
I am running a langChain process on a node local server.
On my code :
// Create docs with a loader
const loader = new TextLoader("Documentation/hello.txt");
const docs = await loader.load();
// Create vector store and index the docs
const…

qYUUU
- 33
- 1
- 5
2
votes
1 answer
AttributeError: module 'chromadb' has no attribute 'config'
so i recently started to work on chromabd and i am facing this error:
"module 'chromadb' has no attribute 'config'"
here is my code:
from langchain.vectorstores import Chroma
from sentence_transformers import SentenceTransformer
model…

Fahad Ali Yousaf
- 21
- 1
2
votes
1 answer
How to run multiprocess Chroma.from_documents() in Langchain
Can we somehow pass an option to run multiple threads/processes when we call Chroma.from_documents() in Langchain?
I am trying to embed 980 documents (embedding model is mpnet on CUDA), and it take forever.
Specs:
Software: Ubuntu 20.4 (on Win11…

Paris Char
- 477
- 4
- 17
2
votes
1 answer
Getting hnswlib error Index' has no attribute 'file_handle_count'
I am trying to use chromadb which is giving me error:
c:\Users\satya\AppData\Local\Programs\Python\Python39\lib\site-packages\chromadb\config.py in instance(self, type)
hnswlib_count = hnswlib.Index.file_handle_count
hnswlib_count = cast(int,…

Satya
- 8,693
- 5
- 34
- 55
2
votes
1 answer
Limit tokens per minute in LangChain, using OpenAI-embeddings and Chroma vector store
I am looking for a way to limit the tokens per minute when saving embeddings in a Chroma vector store. Here is my code:
[...]
# split the documents into chunks
text_splitter = CharacterTextSplitter(chunk_size=1500, chunk_overlap=0)
texts =…

Heka
- 73
- 1
- 8
2
votes
1 answer
How do you catch the duplicate id error when using langchain.vectorstores.Chroma.from_documents()
I use the following line to add langchain documents to a chroma database: Chroma.from_documents(docs, embeddings, ids=ids, persist_directory='db')
when ids are duplicates, I get this error: chromadb.errors.IDAlreadyExistsError
how do I catch the…

Suibhne
- 21
- 2
1
vote
0 answers
HuggingFace Space with Langchain and Gradio SDK: Why am I getting a FAISS-related error when I'm using Chroma, not FAISS?
I am building a HuggingFace Space with Langchain (Gradio SDK) to chat my data, cloning from Harrison Chase's Chat Your Data space and going from there. Fixed a deprecation issue (see Discussion), switched to a DirectoryLoader so I can ingest…

Leanna
- 107
- 9