Google Vision OCR

The Google Vision OCR node allows you to perform Optical Character Recognition (OCR) on an image file using the Google Cloud Vision API. This task extracts text from a local image file and stores the result in a variable for use in subsequent workflow steps.

🧩 Node Type

Task

🏷️ TaskType

GoogleVisionOCR

🔧 Required Inputs

The following inputs must be provided by the user in the node configuration modal:

Label	InputType	Description
Local Full Path	text	Full path to the local image file to be processed with OCR.
Output Variable Name	text	Name of the variable that will store the recognized text from the image.

⚠️ Note: Ensure the file path points to a valid image file that exists on the server or accessible file system. Supported formats typically include .png, .jpg, and .pdf.

📤 Output

The recognized text content is returned from the Google Cloud Vision API and stored in the variable name specified in Output Variable Name.
This variable can then be referenced in downstream workflow tasks.

💡 Example Use Case

You want to extract text from a scanned invoice image to be used in a data extraction or validation task downstream.

Field	Value
Local Full Path	`C:\Automation\Input\InvoiceImage.png`
Output Variable Name	`ExtractedText`

After execution, the text content of the image is available in the ExtractedText variable.

🛑 Errors and Troubleshooting

Error Message	Meaning
Failed to get service	Google Cloud service account token retrieval failed.
Failed to get OCR	OCR API call failed to return a valid result.
Google Failure: [Exception Message]	An unexpected error occurred during the task execution.

Ensure the file path is valid and accessible and that the Google Cloud service account is correctly configured with access to the Vision API.

🔐 Service Account

This task uses a service account labeled Google Cloud Platform to authenticate with the Google Cloud Vision API.

📦 Dependencies

This node depends on the following service library components:

ImageDetection.ImageOCRDetection() from MinuteView.Google.RestService.Services
Valid Google Cloud Vision credentials with OCR access

✅ Success Criteria

The image is successfully processed by the Vision API.
The OCR text is retrieved and stored under the provided output variable name.
The task returns "Completed" status with "taskSuccess = true".

Tasks

ACC

Autodesk Vault

Azure

BlueBeam

Google Cloud

Mesh

Monday

Sharepoint

Completions

Document

General

MinuteView

Open AI

Publishers

User Management

Google Vision OCR

🧩 Node Type

🏷️ TaskType

🔧 Required Inputs

📤 Output

💡 Example Use Case

🛑 Errors and Troubleshooting

🔐 Service Account

📦 Dependencies

✅ Success Criteria

Tasks

ACC

Autodesk Vault

Azure

BlueBeam

Google Cloud

Mesh

Monday

Sharepoint

Completions

Document

General

MinuteView

Open AI

Publishers

Google Vision OCR ​

🧩 Node Type ​

🏷️ TaskType ​

🔧 Required Inputs ​

📤 Output ​

💡 Example Use Case ​

🛑 Errors and Troubleshooting ​

🔐 Service Account ​

📦 Dependencies ​

✅ Success Criteria ​

Google Vision OCR

🧩 Node Type

🏷️ TaskType

🔧 Required Inputs

📤 Output

💡 Example Use Case

🛑 Errors and Troubleshooting

🔐 Service Account

📦 Dependencies

✅ Success Criteria