Questions tagged [cloud-document-ai]

This tag is for the Document AI product within Google Cloud Platform.

200 questions
7
votes
1 answer

Document AI: google.api_core.exceptions.InvalidArgument: 400 Request contains an invalid argument

I am getting this error when trying to implement the Document OCR from google cloud in python as explained here: https://cloud.google.com/document-ai/docs/ocr When I run result = client.process_document(request=request) I get this…
MegaSpeed45
  • 85
  • 1
  • 1
  • 6
5
votes
4 answers

Google Document AI training fails due to an error that is already addressed

I am training a model using Google's Document AI. The training fails with the following error (I have included only a part of the JSON file for simplicity but the error is identical for all documents in my dataset): "trainingDatasetValidation": { …
Aventinus
  • 1,322
  • 2
  • 15
  • 33
5
votes
1 answer

Google DocumentAI Java example fails with io.grpc.StatusRuntimeException: INVALID_ARGUMENT: Request contains an invalid argument

I wasted hours trying the Google Document AI java example from https://cloud.google.com/document-ai/docs/quickstart-client-libraries If you enter your for projectId, location and processorId like this String projectId = "6493xxxxxxxx"; …
schube
  • 652
  • 5
  • 18
5
votes
1 answer

Google Document Ai giving different outputs for the same file

I was using Document OCR API to extract text from a pdf file, but part of it is not accurate. I found that the reason may be due to the existence of some Chinese characters. The following is a made-up example in which I cropped part of the region…
3
votes
2 answers

Batch job submission error "Failed to process all documents", uris seem correct?

I've been trying to get Document AI batch submission working and having some difficulty. I have single file submission working using RawDocument and suppose I could just iterate over my data set (27k images) but chose batch since it seems like the…
Stephen
  • 1,607
  • 2
  • 18
  • 40
3
votes
2 answers

Firebase Deployment failure - Could not find functions.yaml. Must use http discovery

I'm trying to deploy a firebase cloud function, but I keep getting this error. The weirdest part is that I had it working fine but switched from firebase talking to cloud vision over to firebase talking to google documentation. And suddenly, this…
3
votes
1 answer

What argument is invalid for Google Document AI client library for Node.js?

I'm trying to run Document OCR from Google with a Node.js app. So I used the client library for Node JavaScript @google-cloud/documentai I did everything like in documentation sample There is my code const projectId = '*******'; const location =…
3
votes
2 answers

How to read one column texts with Google Cloud Vision API

I have the next document image: When I try to convert the image to text, the result is this: Top Text Ref: Rad: Dte: Ddo: Ejecutivo 76520400300 Banco de Bogotá Luz Adriana Bottom Text The problem is Google API recongnize it like two columns so,…
2
votes
1 answer

No output when running jumpstart Document OCR Processor script

I'm a beginner playing around with the Doc AI Cloud library and I was trying the run the program below. However, even after diligently following the instructions I get no output, no error report, or anything, just another line to try again like I…
2
votes
1 answer

Removing a Document AI operation

Using the Node client of Google Document AI we want to be able to check which review operations got completed after a certain time. However, it seems it's only possible to filter on CreateTime and not on UpdateTime:…
dndr
  • 2,319
  • 5
  • 18
  • 28
2
votes
2 answers

Specifying the document language in Google Document AI API

I'm trying to parse a handwritten document with Google Cloud Document AI. The document contains Cyrillic characters, however Document AI occasionally detects words with Latin characters. Is there way to specify the language of the document, so it…
Yuriy Chachora
  • 739
  • 6
  • 18
2
votes
1 answer

Labelling multi page documents with Google Doc AI Workbench

I'm trying to label multi page document to train a custom processor for Google Doc AI. The one issue that I cannot seem to figure out, is how to handle places where a single field, let's say a detailed description of an item, happens to cross 2…
2
votes
2 answers

Japanese OCR for GCP Document AI custom processor

I am training the GCP Document AI custom processor for my project. It seems the processor does not recognize Japanese text at all. Is there an option to enable Japanese language support?
anonaka
  • 85
  • 8
2
votes
2 answers

Document AI Contract Processor - batchProcessDocuments ignores fieldMask

My aim is to reduce the json file size, which contains the base64 image sections of the documents by default. I am using the Document AI - Contract Processor in US region, nodejs SDK. It is my understanding that setting fieldMask attribute in…
redvivi
  • 83
  • 8
2
votes
1 answer

Is it possible to create labels/entity types for a Custom Document Extractor using .NET client libraries?

I have successfully set up a Custom Document Extractor processor in the Google Cloud Console as described here and was able to familiarize myself with Google Document AI workflow. I was also able to create a Document Schema with custom labels and…
Inubot
  • 23
  • 2
1
2 3
13 14