0

I'm making some progress:) developing my litle OCR Project. I was wondering if my idea is possible in this case!

After extracting the Text from a Images (ocr), I use nlp (spacy) to identify two Entities (LOCation and PERson). I write to a Dictionary and later in a JSON Data. That works good.

Now I'm wondering if I can improve my identified Entities. One way I can imagine is to use the right Language Model for the text. I have varies Texts in German, English,Spanish and French. At the moment I'm using the But now I have no idea how to put langdetect into this

Have a great week! Greets

General Grievance
  • 4,555
  • 31
  • 31
  • 45

1 Answers1

0

Here is a link that you might find useful when it comes to detecting a language (There are multiple options including langdetect) - How to detect language

You can create a dictionary with the languages you plan to detect and match it with langdetect's output. I guess you have the rest sorted out.