Polling event

Overview

Version 5.4 of the Cinchy platform introduced data polling, a source option which uses the Cinchy Event Listener to continuously monitor and sync data entries from your Oracle, SQL Server, or DB2 server into your Cinchy table. This capability makes data polling a much easier, effective, and streamlined process and avoids implementing the complex orchestration logic that was previous necessary.

The Polling Event source supports real-time syncs.

The Polling Event Source supports Oracle, DB2 and SQL Server databases.

Info tab

You can find the parameters in the Info tab below (Image 1).

Values

Parameter

Description

Example

Title

Mandatory. Input a name for your data sync

Polling Event Sync

Variables

Optional. Review our documentation on Variables here for more information about this field.

Permissions

Data syncs are role based access systems where you can give specific groups read, write, execute, and/or all of the above with admin access. Inputting at least an Admin Group is mandatory.

Source tab

The following table outlines the mandatory and optional parameters you will find on the Source tab (Image 2).

The following parameters will help to define your data sync source and how it functions.

Parameter

Description

Example

Source

Mandatory. Select your source from the drop down menu.

Polling Event

To set up a real-time sync, you must configure your Listener values. You can do so through the Connections UI.

Note that If there is more than one listener associated with your data sync, you will need to configure the addition listeners via the Listener Configuration table.

Reset behaviour

Parameter

Description

Example

Auto Offset Reset

Earliest, Latest or None. In the case where the listener is started and either there is no last message ID, or when the last message ID is invalid (due to it being deleted or it's just a new listener), it will use this column as a fallback to determine where to start reading events from.

Earliest will start reading from the beginning on the queue (when the CDC was enabled on the table). This might be a suggested configuration if your use case is recoverable or re-runnable and if you need to reprocess all events to ensure accuracy. Latest will fetch the last value after whatever was last processed. This is the typical configuration. None won't read or start reading any events. You are able to switch between Auto Offset Reset types after your initial configuration through the process outlined here.

None

Optional AppSettings configurations

DataPollingConcurrencyIndex: This property allows only a certain number of threads to run queries against the source database, which works to reduce the load against the database.
- The default number of threads is set to 12.
- To configure this property, navigate to your appSettings.json deployment file > "DataPollingConcurrencyIndex": <numberOfThreads>
QueueWriteConcurrencyIndex: This property allows only a certain number of threads to be concurrently sending messages to the queue. This works to provide a more consistent batching by the worker and reduce your batching errors. run queries against the source database, which works to reduce the load against the database.
- The default number of threads is set to 12.
- To configure this property, navigate to your appSettings.json deployment file > "QueueWriteConcurrencyIndex": <numberOfThreads>.
- Note that this index is shared across all listener configs, meaning that if it's set to 1 only one listener config will be pushing the messages to the queue at a single moment in time.

// App Settings JSON Example
// Example of the configurable propeties: DataPollingConcurrencyIndex (set to "1" and QueueWriteConcurrencyIndex (set to "1")
"AppSettings": {
    "GetNewListenerConfigsInterval": "",
    "StateFileWriteDelaySeconds": "",
    "KafkaClientConfig": {
      "BootstrapServers": ""
    },
    "KafkaRealtimeDatasyncTopic": "",
    "KafkaJobCancellationTopic": "",
    "DataPollingConcurrencyIndex":  1,
    "QueueWriteConcurrencyIndex":  1
  }

Topic JSON

The below table can be used to help create your Topic JSON needed to set up a real-time sync.

Parameter

Description

Example

CursorConfiguration

Mandatory. The parameters here are used in a basic query which searches for all records in a particular table.

Note that in our example we need to use a sub-query to prevent an infinite loop if the "CursorColumn" parameter isn't unique.

Example basic query:

SELECT Id, Name
FROM [SourceTable]
WHERE Id IN (SELECT TOP (100) Id
    FROM [SourceTable]
    WHERE Id > 0 AND Name IS NOT NULL
    ORDER BY Id)
 AND Id > 0 AND Name IS NOT NULL

FromClause

Mandatory. This must contain at least the table name but can also contain Joined tables as written in SQL language.

Example: [Source Table]

CursorColumn

Mandatory. Column name that's used in any 'WHERE' condition(s) and for ordering the result of a query

Example: [Id]

BatchSize

Mandatory. Minimum size of a batch of data per query. This can be larger to prevent infinite loops if the CursorColumn isn't unique.

Example: 100

FilterCondition

All filtering options used in any 'WHERE' condition(s) of the query

Example: Name IS NOT NULL

Columns

Mandatory. A list of columns that we want to show in a result.

Example:Id, Name

ReturnDataConfiguration

The parameters here are used in more complex queries. This example has 2 related tables, but want to show the contents of one of them based on the 'CursorColumn' from a second table. Since Timestamp values aren't unique, we need to find all combinations of Id, Timestamp that match the filter condition in a subquery, and then join this result with the outer-query to get the final result. In `ReturnDataConfiguration`, our parameters area of concern is everything outside of first open parenthesis `(` and last closing parenthesis `)`. For example:

FROM
(
...
) AS t INNER JOIN [Table1] ts ON ts.[Id] = t.[Id]
WHERE ts.[Id] > 0
ORDER BY Id

Example complex query:

SELECT ts.[Id],ts.[Name] FROM
(
SELECT Id,Timestamp
FROM [Table2]
WHERE Timestamp IN (SELECT TOP (2)
Timestamp
FROM [Table2]
WHERE Timestamp > '2022-11-18 11:34:09 AM'
AND Timestamp <= '2022-11-19 11:34:09 AM'
AND 1=1
ORDER BY Timestamp)
AND Timestamp > '2022-11-18 11:34:09 AM'
AND Timestamp <= '2022-11-19 11:34:09 AM'
AND 1=1
) AS t
INNER JOIN [Table1] ts ON ts.[Id] = t.[Id]
WHERE ts.[Id] > 0
ORDER BY Id

CursorAlias

Mandatory. This is the alias for a subquery result table. It's used in 'JoinClause', and can be used in 'Columns' if we want to return values from a subquery table.

Example: "t"

JoinClause

Mandatory. Our result table to which we join the subquery result, plus the condition of the join.

Example: [Table1] ts ON ts.[Id] = t.[Id]

FilterCondition

All filtering options used in any 'WHERE' conditions.

Example: "ts.[Id] > 0"

OrderByClause

Mandatory. This is the column(s) that we want to order our final result by.

Example: "Id"

Columns

Mandatory. A list of columns that we want to show in the final result.

Example: "ts.[Id]" "ts.[name]"

Delay

Mandatory. This represents the delay, in second, between data sync cycles once it no longer finds any new data.

Example: 10

messageKeyExpresssion

Optional, but recommended to mitigate data loss. See Appendix A for more information on this parameter.

CursorConfiguration.CursorColumnDataType

Mandatory. This property works in tandem with an update that ensures that the database query always moves the offset, regardless of if the query returned the records or not—this helps to ensure that the performance of the source database isn't being weighed down by constantly running heavy queries on a wide range of records when the queries returned no data. This value of this mandatory property must match the column type of the source database system for proper casting of parameters.

int

CursorConfiguration. Distinct

Mandatory. This property is a true/false Boolean type that, when set to true, applies a distinct clause on your query to avoid any duplicate records.

true

**Example Topic JSON**

{
  "CursorConfiguration": {
    "FromClause": "[Source Table]",
    "CursorColumn": "Id",
    "BatchSize": 100,
    "FilterCondition": "Name IS NOT NULL",
    "Columns": [
      "Id", "Name"
            "Distinct": "true"
            "CursorColumnDataType" : "int"
    ]
  },
  "ReturnDataConfiguration": {
    "CursorAlias": "t",
    "JoinClause": "[Table1] ts ON ts.[id] = t.[id]",
    "FilterCondition": "ts.[id] > 0",
    "OrderByClause": "id",
    "Columns": [
      "ts.[Id]"
      "ts.[Name]",
    ]

  },
  "Delay": 10
}

Connection Attributes

The below table can be used to help create your Connection Attributes JSON needed to set up a real-time sync.

Parameter

Description

Example

databaseType

Mandatory. TSQL, Oracle, or DB2

TSQL

connectionString

Mandatory. This should be the connection string for your data source.

{
  "databaseType": "TSQL",
  "connectionString": "Server=;Database=;User ID=cinchy;password=example;Trusted_Connection=False;Connection Timeout=30;Min Pool Size=10;"
}

Next steps

Configure your Destination
Define your Sync Actions.
Add in your Post Sync Scripts, if required.
If more than one listener is needed for a real-time sync, configure it/them via the Listener Config table.
To run a real-time sync, enable your Listener from the Execution tab.

Appendix A

messageKeyExpression

The messageKeyExpression parameter is an optional, but recommended, parameter that can be used to ensure that you aren't faced with a unique constraint violation during your data sync. This violation could occur if both an insert and an update statement happened at nearly the same time. If you choose not to use the messageKeyExpression parameter, you could face data loss in your sync.

This parameter was added to the Data Polling event stream in Cinchy v5.6.

Each of your Event Listener message keys a message key. By default, this key is unique for every message in the queue.

When the worker processes your Event Listener messages it does so in batches and, for efficiency and to guarantee order, messages that contain the same key won't be processed in the same batch.

The messageKeyExpression property allows you to change the default message key to something else.

Example:

  "MessageKeyExpression": "id"

PreviousOracle (query and table)NextPolling event example config

Last updated 1 year ago