Skip to content

Google Vision OCR

The Google Vision OCR node allows you to perform Optical Character Recognition (OCR) on an image file using the Google Cloud Vision API. This task extracts text from a local image file and stores the result in a variable for use in subsequent workflow steps.


🧩 Node Type

Task


🏷️ TaskType

GoogleVisionOCR


🔧 Required Inputs

The following inputs must be provided by the user in the node configuration modal:

LabelInputTypeDescription
Local Full PathtextFull path to the local image file to be processed with OCR.
Output Variable NametextName of the variable that will store the recognized text from the image.

⚠️ Note: Ensure the file path points to a valid image file that exists on the server or accessible file system. Supported formats typically include .png, .jpg, and .pdf.


📤 Output

  • The recognized text content is returned from the Google Cloud Vision API and stored in the variable name specified in Output Variable Name.
  • This variable can then be referenced in downstream workflow tasks.

💡 Example Use Case

You want to extract text from a scanned invoice image to be used in a data extraction or validation task downstream.

FieldValue
Local Full PathC:\Automation\Input\InvoiceImage.png
Output Variable NameExtractedText

After execution, the text content of the image is available in the ExtractedText variable.


🛑 Errors and Troubleshooting

Error MessageMeaning
Failed to get serviceGoogle Cloud service account token retrieval failed.
Failed to get OCROCR API call failed to return a valid result.
Google Failure: [Exception Message]An unexpected error occurred during the task execution.

Ensure the file path is valid and accessible and that the Google Cloud service account is correctly configured with access to the Vision API.


🔐 Service Account

This task uses a service account labeled Google Cloud Platform to authenticate with the Google Cloud Vision API.


📦 Dependencies

This node depends on the following service library components:

  • ImageDetection.ImageOCRDetection() from MinuteView.Google.RestService.Services
  • Valid Google Cloud Vision credentials with OCR access

✅ Success Criteria

  • The image is successfully processed by the Vision API.
  • The OCR text is retrieved and stored under the provided output variable name.
  • The task returns "Completed" status with "taskSuccess = true".

Tentech 2024