Data Freshness Custom Configuration

For information about how to use the Freshness feature in Data Observability, see Data Observability.

This is an experimental feature and as such upgradability cannot be guaranteed. Connection methods for unsupported data sources and custom configurations might change with later releases.

For the sources that support freshness by default (Snowflake, PostgreSQL, BiqQuery, and Oracle), it is not necessary to modify connection settings unless you want to: for example, if you have a custom definition for freshness within the database.

If you do have a custom freshness definition, or want to check freshness on items in databases which are not supported, it is possible to add additional configuration by selecting Modify connections settings.

Freshness checks are implemented using SQL to query database metadata tables or user tables. When database metadata tables are being queried, the required info is already exposed by database engines, as is the case with the data sources that support data freshness by default. Querying user tables means users can mimic freshness arbitrarily, using queries which produce predefined results.

Both options share the same configuration structure. The default per database engine is configured in driver properties and is used as fallback when user configuration is not supplied with the job data.

Freshness configuration

Driver level configuration is present for database engines supporting table modification timestamp natively where supported drivers are:

PostgreSQL
MS SQL
Oracle
Snowflake
BigQuery
Azure Synapse

Remaining drivers do not support it natively.

PostgreSQL implementation is based on commit timestamps which is row level data. Thus deleting rows in fact doesn’t contribute to freshness, it might even distort it if applied on most recently changed rows.

Driver level

This configuration is part of DPE properties and available for customization in the DPM Admin Console.

It is expressed with the following properties:

Property Description

plugin.jdbcdatasource.ataccama.one.driver.<driver>.freshness.enabled

plugin.jdbcdatasource.ataccama.one.driver.<driver>.freshness.chunkSize

Size of catalog item batch sent in single freshness query.

Can be set as low as 1 if you want to send individual queries for each catalog item.

Default value: 1000.

plugin.jdbcdatasource.ataccama.one.driver.<driver>.freshness.supported-query

SQL query returning supported as Boolean (true).

When true is not present, freshness is considered as implicitly supported.
When false is returned, the job fails.

plugin.jdbcdatasource.ataccama.one.driver.<driver>.freshness.prepare-query

SQL statement without result, prepares the database session for freshness query.

plugin.jdbcdatasource.ataccama.one.driver.<driver>.freshness.clean-query

SQL statement without result, counterpart for actions in prepare-query.

plugin.jdbcdatasource.ataccama.one.driver.<driver>.freshness.check-query.pattern