This tag is for the Document AI product within Google Cloud Platform.
Questions tagged [cloud-document-ai]
200 questions
7
votes
1 answer
Document AI: google.api_core.exceptions.InvalidArgument: 400 Request contains an invalid argument
I am getting this error when trying to implement the Document OCR from google cloud in python as explained here: https://cloud.google.com/document-ai/docs/ocr
When I run
result = client.process_document(request=request)
I get this…

MegaSpeed45
- 85
- 1
- 1
- 6
5
votes
4 answers
Google Document AI training fails due to an error that is already addressed
I am training a model using Google's Document AI. The training fails with the following error (I have included only a part of the JSON file for simplicity but the error is identical for all documents in my dataset):
"trainingDatasetValidation": {
…

Aventinus
- 1,322
- 2
- 15
- 33
5
votes
1 answer
Google DocumentAI Java example fails with io.grpc.StatusRuntimeException: INVALID_ARGUMENT: Request contains an invalid argument
I wasted hours trying the Google Document AI java example from https://cloud.google.com/document-ai/docs/quickstart-client-libraries
If you enter your for projectId, location and processorId like this
String projectId = "6493xxxxxxxx";
…

schube
- 652
- 5
- 18
5
votes
1 answer
Google Document Ai giving different outputs for the same file
I was using Document OCR API to extract text from a pdf file, but part of it is not accurate. I found that the reason may be due to the existence of some Chinese characters.
The following is a made-up example in which I cropped part of the region…

iter07
- 61
- 7
3
votes
2 answers
Batch job submission error "Failed to process all documents", uris seem correct?
I've been trying to get Document AI batch submission working and having some difficulty. I have single file submission working using RawDocument and suppose I could just iterate over my data set (27k images) but chose batch since it seems like the…

Stephen
- 1,607
- 2
- 18
- 40
3
votes
2 answers
Firebase Deployment failure - Could not find functions.yaml. Must use http discovery
I'm trying to deploy a firebase cloud function, but I keep getting this error. The weirdest part is that I had it working fine but switched from firebase talking to cloud vision over to firebase talking to google documentation. And suddenly, this…

cristopher blunt
- 37
- 4
3
votes
1 answer
What argument is invalid for Google Document AI client library for Node.js?
I'm trying to run Document OCR from Google with a Node.js app.
So I used the client library for Node JavaScript @google-cloud/documentai
I did everything like in documentation sample
There is my code
const projectId = '*******';
const location =…

nikigan
- 41
- 3
3
votes
2 answers
How to read one column texts with Google Cloud Vision API
I have the next document image:
When I try to convert the image to text, the result is this:
Top Text
Ref: Rad: Dte: Ddo:
Ejecutivo 76520400300 Banco de Bogotá Luz Adriana
Bottom Text
The problem is Google API recongnize it like two columns so,…

Julian Solarte
- 555
- 6
- 29
2
votes
1 answer
No output when running jumpstart Document OCR Processor script
I'm a beginner playing around with the Doc AI Cloud library and I was trying the run the program below.
However, even after diligently following the instructions I get no output, no error report, or anything, just another line to try again like I…

Tyrone
- 21
- 3
2
votes
1 answer
Removing a Document AI operation
Using the Node client of Google Document AI we want to be able to check which review operations got completed after a certain time. However, it seems it's only possible to filter on CreateTime and not on UpdateTime:…

dndr
- 2,319
- 5
- 18
- 28
2
votes
2 answers
Specifying the document language in Google Document AI API
I'm trying to parse a handwritten document with Google Cloud Document AI. The document contains Cyrillic characters, however Document AI occasionally detects words with Latin characters. Is there way to specify the language of the document, so it…

Yuriy Chachora
- 739
- 6
- 18
2
votes
1 answer
Labelling multi page documents with Google Doc AI Workbench
I'm trying to label multi page document to train a custom processor for Google Doc AI. The one issue that I cannot seem to figure out, is how to handle places where a single field, let's say a detailed description of an item, happens to cross 2…

Jerry Kaiser
- 23
- 2
2
votes
2 answers
Japanese OCR for GCP Document AI custom processor
I am training the GCP Document AI custom processor for my project. It seems the processor does not recognize Japanese text at all. Is there an option to enable Japanese language support?

anonaka
- 85
- 8
2
votes
2 answers
Document AI Contract Processor - batchProcessDocuments ignores fieldMask
My aim is to reduce the json file size, which contains the base64 image sections of the documents by default.
I am using the Document AI - Contract Processor in US region, nodejs SDK.
It is my understanding that setting fieldMask attribute in…

redvivi
- 83
- 8
2
votes
1 answer
Is it possible to create labels/entity types for a Custom Document Extractor using .NET client libraries?
I have successfully set up a Custom Document Extractor processor in the Google Cloud Console as described here and was able to familiarize myself with Google Document AI workflow. I was also able to create a Document Schema with custom labels and…

Inubot
- 23
- 2