Search K
Appearance
Appearance
The GoogleVision node connects to the Google Cloud Vision API to perform image and document analysis tasks such as Optical Character Recognition (OCR) and Object Detection.
It supports both local file OCR and Google Cloud Storage (GCS)-based document and image operations.
This node allows workflows to automatically extract text from images or PDFs, detect objects within images, and feed these results into downstream nodes like AI analyzers, Excel writers, or document classifiers.
Image OCR, Document OCR, or Object Detection).| Action | Description |
|---|---|
| Image OCR (local file) | Extracts text from a local image (PNG, JPG, BMP, etc.) using Google Vision OCR. |
| Document OCR (GCS pdf/tif) | Processes multi-page PDF or TIFF documents stored in Google Cloud Storage and exports text to a GCS bucket. |
| Object Detection (GCS image) | Detects and labels objects within a GCS-hosted image, including confidence scores. |
| Field | Type | Description | Required |
|---|---|---|---|
| ThirdParty - Google Cloud | Third-Party Token | Reference to a valid Google Cloud OAuth credential stored in MinuteView. | ✅ |
| Action | Picklist | The analysis type to perform: Image OCR (local file), Document OCR (GCS pdf/tif), or Object Detection (GCS image). | ✅ |
| Field | Type | Description | Required |
|---|---|---|---|
| Local Image Path | Text | Full path to the image file on the automation server. | ✅ |
| Field | Type | Description | Required |
|---|---|---|---|
| GCS Source URI | Text | Source file URI (e.g. gs://bucket/path/document.pdf). | ✅ |
| GCS Destination Bucket | Text | Destination bucket where OCR JSON will be written. | ✅ |
| GCS Destination Prefix | Text | Destination prefix (folder) for OCR results. | ✅ |
| MIME Type Override | Text | (Optional) Custom MIME type for special formats. | ❌ |
| Field | Type | Description | Required |
|---|---|---|---|
| GCS Image URI | Text | URI of the image stored in GCS (e.g. gs://bucket/images/photo.jpg). | ✅ |
| Confidence Threshold | Number | Minimum detection confidence between 0.0 and 1.0. Default = 0.0. | ❌ |
| Max Results | Number | Maximum number of detected objects to return. Default = all. | ❌ |
| Output Variable | Type | Description |
|---|---|---|
| out | Object / String | The raw or structured output of the selected action. |
| taskMessage | String | Message describing the outcome. |
| statusReturn | String | Completed on success or Fail on error. |
{
"out": "Valve No. 204-B\nPressure Rating: 25 bar\nLast Service: 2024-08-05",
"taskMessage": "Image OCR (local file) completed successfully",
"statusReturn": "Completed"
}{
"out": {
"Source": "gs://engineering-docs/invoices/invoice123.pdf",
"Output": "gs://engineering-docs/ocr-results/invoice123/",
"Status": "QueuedOrCompleted",
"Result": "Operation-123456789"
},
"taskMessage": "Document OCR request submitted successfully",
"statusReturn": "Completed"
}{
"out": {
"Image": "gs://project-assets/inspection/site_photo.jpg",
"MinScore": 0.6,
"MaxResults": 10,
"Objects": [
{ "name": "Hardhat", "score": 0.92 },
{ "name": "Person", "score": 0.87 },
{ "name": "Excavator", "score": 0.85 }
]
},
"taskMessage": "Object Detection completed successfully",
"statusReturn": "Completed"
}| Scenario | Description |
|---|---|
| 🔎 Drawing OCR | Extract text and dimensions from scanned PDFs or TIFF drawings. |
| 📄 Document Digitization | Read legacy engineering documents and export OCR text into databases. |
| 🧰 Object Recognition | Automatically tag and classify images (e.g., identify equipment, safety gear, or site conditions). |
| 🧾 Invoice OCR Pipeline | Read invoice PDFs from a GCS bucket, parse text via OCR, and load results into SharePoint or SQL. |
| Step | Action |
|---|---|
| 1 | Validates the selected action type. |
| 2 | Retrieves Google Cloud third-party credentials. |
| 3 | Executes the appropriate Vision API function. |
| 4 | Processes and filters the response. |
| 5 | Returns structured results or GCS reference URIs. |
Confidence Threshold and Max Results.If the task fails, a clear error message will be logged and returned. Common error causes include:
graph LR
A[Get File From SharePoint] --> B[GoogleVision (Image OCR)]
B --> C[Extract Keywords]
C --> D[Add Metadata to Vault]Category: AI & Google Cloud Task Name: GoogleVision