ONE Data Setup for Hybrid Deployments
In environments running on hybrid Data Processing Engines (purely hybrid DPEs or in combination with cloud DPEs), ONE Data and all its features are turned off by default.
To turn on ONE Data, follow the instructions in Enable ONE Data.
Before making ONE Data available, review the Data location and security section.
Depending on your deployment type Ataccama Professional Services or your system administrator need to perform the technical setup. |
Data location and security
The data that you load from catalog items to ONE Data is sent to and stored in a ONE Data database hosted on Ataccama Cloud. There are no alternative storage options available.
The connection to Ataccama Cloud is secured using HTTPS/TLS 1.3.
Enable ONE Data
To make ONE Data available, you need to:
-
Reach out to Ataccama Professional Services to perform the necessary setup on Ataccama side.
-
If using a combination of cloud and hybrid DPEs, make sure your data sources are configured to use the correct DPEs. See Source configuration in mixed environments.
Enable ONE Data plugin
Turn on the ONE Data plugin (dmm-datasource
) in the hybrid DPE configuration.
DMM is a service that lets ONE Data store, update, delete, and access data.
To enable the dmm-datasource
plugin in your DPE, the plugin.dmm-datasource.enabled
property must be set to true
in the DPE application.properties
.
Modify the property in your Ansible or Helm configuration accordingly.
Set ONE Data feature flags
Configure the ONE Data using available feature flags.
Default settings
The default setup after the deployment is:
-
ONE Data: Enabled.
-
ONE Data Load: Enabled.
-
ONE Data Load limited to Reference data extraction: Disabled.
In this setup all ONE Data features are available including loading records to ONE Data on Ataccama Cloud from Data Catalog.
Change the settings
You can change the default settings to turn off loading data from Data Catalog to Ataccama Cloud, or to limit the imported data to reference data only.
For more information, see Limit or Turn Off ONE Data Capabilities.
Source configuration in mixed environments
When using a combination of cloud and hybrid DPEs, you can predefine at the source level which DPE should be used for processing:
-
Cloud-accessible sources: Data sources like Snowflake, Databricks, or other cloud databases that are accessible via the internet can use either cloud or hybrid DPEs.
-
Private network sources: Data sources located in your private network can only be accessed through hybrid DPEs. If you configure a cloud DPE for such sources, the data loading job fails.
Was this page useful?