Questions tagged [chromadb]

69 questions
8
votes
4 answers

LangChain Chroma - load data from Vector Database

I have written LangChain code using Chroma DB to vector store the data from a website url. It currently works to get the data from the URL, store it into the project folder and then use that data to respond to a user prompt. I figured out how to…
max choate
  • 81
  • 1
  • 3
4
votes
2 answers

ChromaDb add single document, only if it doesn't exist

I'm working with langchain and ChromaDb using python. Now, I know how to use document loaders. For instance, the below loads a bunch of documents into ChromaDb: from langchain.embeddings.openai import OpenAIEmbeddings embeddings =…
user791793
  • 413
  • 1
  • 6
  • 19
4
votes
1 answer

Get all documents from ChromaDb using Python and langchain

I'm using langchain to process a whole bunch of documents which are in an Mongo database. I can load all documents fine into the chromadb vector storage using langchain. Nothing fancy being done here. This is my code: from…
user791793
  • 413
  • 1
  • 6
  • 19
3
votes
1 answer

streamlit: Your system has an unsupported version of sqlite3. Chroma requires sqlite3 >= 3.35.0

I previously deployed an app on Streamlit Cloud that utilized chromadb. The app worked fine in the past. However, today I encountered a new error (as indicated in the title) and the app has stopped functioning. I attempted to troubleshoot based on…
Xiang
  • 230
  • 2
  • 10
3
votes
0 answers

Dynamically add more embedding of new document in chroma DB - Langchain

I have created a retrieval QA Chain which uses chromadb as vector DB for storing embeddings of "abc.txt" file. What if I want to dynamically add more document embeddings of let's say another file "def.txt"? How to do that? I don't want to reload the…
Jason
  • 676
  • 1
  • 12
  • 34
3
votes
1 answer

How to deploy chroma database (vector database) in production

I am working on a project where i want to save the embeddings in vector database. need some help or resources to deploy chroma db for production use
ankitmhjn5
  • 89
  • 5
3
votes
5 answers

hnswlib package issue while installing chromadb in ubuntu

I am using ubuntu 20.04 focal and trying to install chromadb by using 'pip install chromdb' but I am getting following error Building wheel for hnswlib (pyproject.toml) ... error error: subprocess-exited-with-error × Building wheel for…
Sinister _
  • 31
  • 1
  • 3
3
votes
1 answer

Losing data of vector store Chromadb using Langchain

I am writing a question-answering bot using langchain. For storing my data in a database, I have chosen Chromadb. I have written the code below and it works fine persist_directory = "./db" embeddings = OpenAIEmbeddings() vectordb =…
kheirq
  • 31
  • 2
3
votes
1 answer

LangChain, chromaDB Chroma.fromDocuments returns TypeError: Cannot read properties of undefined (reading 'data')

I am running a langChain process on a node local server. On my code : // Create docs with a loader const loader = new TextLoader("Documentation/hello.txt"); const docs = await loader.load(); // Create vector store and index the docs const…
qYUUU
  • 33
  • 1
  • 5
2
votes
1 answer

AttributeError: module 'chromadb' has no attribute 'config'

so i recently started to work on chromabd and i am facing this error: "module 'chromadb' has no attribute 'config'" here is my code: from langchain.vectorstores import Chroma from sentence_transformers import SentenceTransformer model…
2
votes
1 answer

How to run multiprocess Chroma.from_documents() in Langchain

Can we somehow pass an option to run multiple threads/processes when we call Chroma.from_documents() in Langchain? I am trying to embed 980 documents (embedding model is mpnet on CUDA), and it take forever. Specs: Software: Ubuntu 20.4 (on Win11…
Paris Char
  • 477
  • 4
  • 17
2
votes
1 answer

Getting hnswlib error Index' has no attribute 'file_handle_count'

I am trying to use chromadb which is giving me error: c:\Users\satya\AppData\Local\Programs\Python\Python39\lib\site-packages\chromadb\config.py in instance(self, type) hnswlib_count = hnswlib.Index.file_handle_count hnswlib_count = cast(int,…
Satya
  • 8,693
  • 5
  • 34
  • 55
2
votes
1 answer

Limit tokens per minute in LangChain, using OpenAI-embeddings and Chroma vector store

I am looking for a way to limit the tokens per minute when saving embeddings in a Chroma vector store. Here is my code: [...] # split the documents into chunks text_splitter = CharacterTextSplitter(chunk_size=1500, chunk_overlap=0) texts =…
Heka
  • 73
  • 1
  • 8
2
votes
1 answer

How do you catch the duplicate id error when using langchain.vectorstores.Chroma.from_documents()

I use the following line to add langchain documents to a chroma database: Chroma.from_documents(docs, embeddings, ids=ids, persist_directory='db') when ids are duplicates, I get this error: chromadb.errors.IDAlreadyExistsError how do I catch the…
Suibhne
  • 21
  • 2
1
vote
0 answers

HuggingFace Space with Langchain and Gradio SDK: Why am I getting a FAISS-related error when I'm using Chroma, not FAISS?

I am building a HuggingFace Space with Langchain (Gradio SDK) to chat my data, cloning from Harrison Chase's Chat Your Data space and going from there. Fixed a deprecation issue (see Discussion), switched to a DirectoryLoader so I can ingest…
Leanna
  • 107
  • 9
1
2 3 4 5