Inter-Cloud Data Monitor
Disclaimer
Your use of this download is governed by Stonebranch’s Terms of Use, which are available at https://www.stonebranch.com/integration-hub/Terms-and-Privacy/Terms-of-Use/
Overview
This Universal Extension is an interface to Rclone that provides the capability to monitor files/directories from across different cloud storages, as well as local or distributed file systems. Additionally and upon successful monitor, this extension publishes Local Universal Events. To properly handle these events and take action on them, this Universal Extension optionally can be attached as a Publisher to a Universal Monitor Task (see more details in section 150079617).
Rclone, is the open source command line program, that is utilized to accomplish all the actions supported in this Extension.
Version Information
Template Name | Extension Name | Extension Version |
---|---|---|
Cloud Data Monitor | ue-cloud-dm | 1.0.1 |
Refer to Changelog for version history information.
Software Requirements
This integration requires a Universal Agent, a Python runtime to execute the Universal Task, and a Rclone executable.
Software Requirements for Universal Template and Universal Task
Requires Python 3.7.0 or higher. Tested with the Universal Agent bundled Python distribution.
Software Requirements for Universal Agent
Both Windows and Linux agents are supported.
- Universal Agent for Windows x64 Version 7.2.0.0 and later with python options installed.
- Universal Agent for Linux Version 7.2.0.0 and later with python options installed.
Software Requirements for Universal Controller
Universal Controller Version 7.2.0.0 and later.
Software Requirements for Rclone
Rclone needs to be installed on the same server where the Universal Agent is installed. Rclone binary should be stored on a location where Universal Agent has access and permissions to execute it.
This Universal Extension has been tested with Rclone v1.58.1. It should be working with later Rclone versions, as long as these are backwards compatible.
Network and Connectivity Requirements
Universal Agent should be able to establish connection with:
- The remote storages that are provided as input for the Universal Task.
- The Universal Controller host. This is required only in case Refresh Storage Credentials field is used.
Key Features
This Universal Extension supports the following key features:
- Actions
- Monitor objects creation
- Monitor objects change - change based on modification time
- Monitor objects deletion
- Options
- Trigger On Existence - available for Action
Monitor On Create
. - Advanced filtering capability for objects to be monitored.
- Support for providing additional Rclone options according to the user needs.
- Trigger On Existence - available for Action
- Universal Events
- Event published upon single object creation, holding information about the new object.
- Event published upon single object change, holding information about the updated file including the latest object size and modification time.
- Event published upon single object deletion, holding information about the deleted object.
- Event published upon single object existence, holding information about the object.
Integrated Storage Systems
Rclone is integrated with multiple storage systems. This Universal Extension has been tested against the following Storage Systems.
- Amazon S3
- Google Cloud Storage
- Microsoft OneDrive Business
- Local file system (Linux, Windows)
This integration should work properly against other Storage Systems, as long as the integration interface (see chapter Configure Universal Task) and functionalities listed in this document fulfil the needs of the Storage System.
For functionalities required for specific Storage Systems, users and customers are encouraged to open a Feature Request in our Customer Support Portal.
Import Universal Template
To use the Universal Template, you first must perform the following steps.
This Universal Task requires the Resolvable Credentials feature. Check that the Resolvable Credentials Permitted system property has been set to true.
To import the Universal Template into your Controller, follow these instructions.
When the files have been imported successfully, refresh the Universal Templates list; the Universal Template will appear on the list.
Modifications of this integration, applied by users or customers, before or after import, might affect the supportability of this integration. For more information refer to Integration Modifications.
Configure Universal Task
To configure a new Universal Task, there are three steps required:
- Create required Resolvable Credentials. Required as Input Fields on the Universal Task Configuration.
- Create a new Script of type Data, for the Configuration File and populate it according to the following section of Setup Rclone Configuration File. Required as Input Fields on the Universal Task Configuration.
- Create a new task, and enter the task-specific details that were created in the Universal Template.
Setup Rclone Configuration File
The configuration file is following the INI format, and contains all required parameters and credentials to connect to a Storage System. All configuration options per Storage System can be found in the respective "Config Rclone documentation" .
It is advised that account credentials, tokens, or any other essential information to be passed as an encrypted Resolvable Credential field. To set a value with the respective Credential field, please make sure to use the appropriate Credential function inside the configuration file.
'Runtime Password' Credentials field supports values with up to 512 characters.
Hard coded values can be provided in the script file, as well. However, it is not advised for security reasons.
It is recommended to configure a storage system section, one to one with the Credential fields in this Universal Extension (please refer to the Input Fields section below).
For local file system Storages, no credentials are required in the configuration file. The selected action will be executed with the Credentials that are provided in the "Agent Details" of the Universal Task. If no Credentials are provided for the used Agent, then the same user that runs ubroker daemon will be used.
Script Type should be Data, and option Resolve UAC Variables should be enabled.
The indicative Configuration File below, contains the basic connection parameters (flags):
Utilize the built-in Credential Functions, in order to properly resolve the associated Universal Task Credentials.
[ue_aws_s3]
type = s3
provider = AWS
access_key_id = ${_credentialUser('${ops_ue_cloud_dm_storage_credentials}')}
secret_access_key = ${_credentialPwd('${ops_ue_cloud_dm_storage_credentials}')}
region = us-east-2
[google_cloud_storage]
type = google cloud storage
service_account_file = ${_credentialPwd('${ops_ue_cloud_dm_storage_credentials}')}
object_acl = bucketOwnerFullControl
project_number = johnprojectno
location = europe-west3
[one_drive]
type = onedrive
token = ${_credentialToken('${ops_ue_cloud_dm_storage_credentials}')}
drive_id = ${_credentialUser('${ops_ue_cloud_dm_storage_credentials}')}
drive_type = business
auth_url = https://login.microsoftonline.com/<tenant id>/oauth2/v2.0/authorize
token_url = https://login.microsoftonline.com/<tenant id>/oauth2/v2.0/token
[local_filesystem]
type = local
Input Fields
The input fields for this Universal Extension are described below.
Field | Input type | Default value | Type | Description |
---|---|---|---|---|
Action | Required | Monitor On Create | Choice | Action to be performed. Available actions:
|
Trigger On Existence | Optional | False | Boolean | When this flag is enabled and objects already exist on the given Storage Filepath , an event will be published. Otherwise, Universal Task will keep monitoring for new objects creation. Optional when Action Monitor On Create is selected. |
Configuration File | Required | - | Script | Rclone Configuration file, that contains all required parameters to connect to the Storage System. |
Storage System | Optional | - | Dynamic Choice Field | Remote or Local storage to execute the selected Action. Execute the Dynamic Choice Field and retrieve all the available Storages from Configuration File. This field is included as an Event Attribute. |
Storage Credentials | Optional | - | Credential | Credentials needed to connect to the selected Storage System. For Storage System of type local , the standard Agent's Credentials field should be provided. |
Storage Filepath | Optional | - | Text | Path to monitor objects for the selected Action. This field is included as an Event Attribute. |
Update Credentials | Optional | False | Boolean | This field needs to be used when Rclone executable updates Fields on its Configuration File during execution, that need to be stored as Credential Fields on the Universal Controller. For example: when OneDrive Storage type is used. In this case, the existing configured token is exchanged by Rclone with a new one to establish the connection. This new token should be stored on Universal Controller for the next task execution. Please read Rclone limitations on OneDrive token refresh here. This field is used in conjunction with fields Controller URL, Controller Credentials, Refresh Storage Credentials. |
Controller URL | Optional | - | Text | Controller URL where Storage Credentials are stored. Required when Update Credentials is checked. |
Controller Credentials | Optional | - | Credentials | Controller user's credentials, used for logging and updating Storage Credentials will be updated. The Credentials definition should be as follows:
Required when Update Credentials is checked. |
Refresh Storage Credentials | Optional | - | Array | This Array field is available when Update Credentials is set to True and used as a mapping table that indicates which field from the Configuration File of Rclone is used to update which Credential entry field on Universal Controller. Populate this array with the following format:
Note: Refresh of Storage Credentials is performed via Controller's REST API. See Network and Connectivity Requirements |
Monitor Time Interval | Optional | 300 | Integer | Interval period in seconds used to monitor the given Storage Filepath. |
Use Filter | Optional | -- None -- | Choice | The filter type that is applied on the Action. Available options:
|
Filter | Optional | - | Text | Filter based on Rclone filtering rules that is applied on the Action. Required when Use Filter is checked. |
Files Only | Optional | False | Boolean | When enabled, directories will be ignored and only files will be monitored. |
Use Storage System Modified Time | Optional | True | Boolean | When enabled, the time of the Storage System will be observed as the object's modification time, instead of the original object's modified time stored as metadata. Note: Having this option enabled, increases performance. |
Recursion Depth | Optional | 1 | Integer | Recursion depth that is applied. Default value 1 means that no recursion will be applied. |
Additional Options | Optional | - | Text | Space separated Rclone options that applied to the selected Action. |
Correlation ID | Optional | - | Text | Use this field optionally to pass extra custom information to the published event (for example: provide the parent Workflow task instance ID). This field is included as an Event Attribute. |
Cloud Data Monitor Events
This Universal Extension publishes Universal Events when objects are monitored succesfully, based on the selected Action. The suggested implementation to accomplish this behavior is to associate a Cloud Data Monitor Task(event publisher) with a Universal Monitor Task(event consumer).
The Universal Events for this Universal Extension are described below.
Event Monitor On Create
Attribute Description Type Storage System The Storage System where the monitored object is stored. This is the input field Storage System
.Text Storage Filepath The Storage System where the monitored object is stored. This is the input field Storage Filepath
.Text Filename The name of the monitored object. Text Correlation ID Custom information field. This is the input field Correlation ID
.Text Event Monitor On Change
Attribute Description Type Storage System The Storage System where the monitored object is stored. This is the input field Storage System
.Text Storage Filepath The Storage System where the monitored object is stored. This is the input field Storage Filepath
.Text Filename The name of the monitored object. Text Filesize The updated size of the monitored object. Text File Modified Time The time when the object last changed. Text Correlation ID Custom information field. This is the input field Correlation ID
.Text Event Monitor On Delete
Attribute Description Type Storage System The Storage System where the monitored object is stored. This is the input field Storage System
.Text Storage Filepath The Storage System where the monitored object is stored. This is the input field Storage Filepath
.Text Filename The name of the monitored object. Text Correlation ID Custom information field. This is the input field Correlation ID
.