Search K
Appearance
Appearance
The Mesh Indexer Sharepoint node indexes documents and metadata from a Microsoft SharePoint Document Library into the Mesh platform using a configurable indexer pipeline. This task allows selective or full indexing of SharePoint documents, including support for full content extraction, text vectorization (embeddings), OCR, and thumbnails.
This node enables users to connect to a SharePoint Online source and index records into an ElasticSearch-backed Mesh instance. It supports full indexing, incremental updates based on a date filter, or indexing of a single specified document.
The modal form includes the following input fields to configure the node:
Label | Description |
---|---|
Select Index | The name of the index to target (e.g., documents-sharepoint ). |
Index Action | Choose the indexing mode: - Full Index - Update From Date - Update Single Record |
ServiceAccount-OpenAI | The OpenAI service account token (used for embeddings). |
ServiceAccount-Elastic Search | The ElasticSearch service account token. |
ServiceAccount-Microsoft Azure | The Microsoft Azure (SharePoint) service account token. |
Label | Description |
---|---|
Include Text Vectorization | Enable text vector embedding using OpenAI. |
Include Full Text Content | Extract and include full content from documents. |
Include Thumbnail | Include a thumbnail preview of the document (if available). |
Include OCR | Apply Optical Character Recognition (OCR) to scanned documents. |
Document ID | (Used in Update Single Record mode) Specify the SharePoint document ID to reindex. |
Start Date | (Used in Update From Date mode) Only index documents modified after this date. |
White List Folders | Optional list of folder paths to restrict indexing scope. |
Record Title | Optional format string for the record title (e.g., "{FileName} - {Modified}" ). |
Sub Title | Optional format string for the subtitle/description. |
Domain | SharePoint tenant domain (e.g., yourtenant.sharepoint.com ). |
Site Name | The name of the SharePoint Site (e.g., project-site ). |
Document Library | The specific SharePoint library name (e.g., Documents , Shared Documents ). |
SPDataSource
object for SharePoint access.ItemQuery
to filter the SharePoint content based on mode and inputs.ItemIndexer
, processes the SharePoint items, and pushes the data into the Mesh index.On success:
statusReturn
: Completed
taskMessage
: Indexing Completed Successfully
taskSuccess
: true
On failure:
statusReturn
: Fail
taskMessage
: Includes error messagetaskSuccess
: false
Ensure the configured service accounts have appropriate API permissions:
This node is optimized for SharePoint Online, not on-premises deployments.
OCR may significantly increase processing time for image-based or scanned documents.