Llama.generate: prefix-match hit

Asked Aug 17 '23 at 04:07

Active Aug 29 '23 at 09:28

Viewed 236 times

I am using "llama-2-7b-chat.ggmlv3.q2_K.bin" (from hugging-face) using "LlamaCpp()" in langchain. The process of "Llama.generate: prefix-match hit" repeats itself so many times and answers itself. But I want answer only once. How can I set this to generate answer only once?

I am using LlamaCpp() to load the model and RetrievalQA to retrieval of answers.

edited Aug 29 '23 at 09:28

asked Aug 17 '23 at 04:07

Mayuresh Gawai

Llama.generate: prefix-match hit

0 Answers0