Catalog Items
What are catalog items?
Catalog items represent the actual data that you plan to analyze. They are structured into attributes. You can think of catalog items as database tables and attributes as columns in a table.
Catalog items are created automatically by importing metadata from various data sources.
ONE does not import full database tables, only the metadata and a small sample of the actual data. |
In addition to import, you can create catalog items in the following ways:
-
Virtual catalog items (VCIs) - Created in ONE Desktop. In ONE, these catalog items are labeled as
Other
.VCIs are useful when instead of working with raw data, you want to apply ONE Desktop or in-source (SQL) transformation logic first, or when you want to integrate non-natively supported sources using APIs and other connectors. See Virtual Catalog Items.
Where can I find catalog items?
All catalog items can be found and managed in the Data Catalog section of the platform (Data Catalog > Catalog Items).
Here you can see available information about catalog items, such as their attributes, added rules and terms, history, as well as a sample of records. For more information, see Catalog items: Deep dive.
You can search within this list using filters or full-text search. Additionally, the listing is sorted by the catalog item state: Published, Unpublished, All.
Data Catalog is also where you find the key tools for data management, including importing metadata from data sources, viewing, discovering, and profiling data.
To find out more about a specific attribute, select the attribute from the catalog item Overview tab. Or, use the three dots menu and select Show details. |
Catalog item tabs
For easier access, catalog items are further grouped into the following tabs:
- Anomaly Overview
-
Filters catalog items with detected anomalies.
- Domain Suggestions
-
Shows all catalog items with suggested terms that you can accept or reject.
- Master Data
-
Shows catalog items imported from ONE MDM, if your version of ONE includes MDM.
- Reference Data
-
Shows catalog items imported from ONE RDM, if your version of ONE includes RDM.
- Not Monitored
-
Lists catalog items that are not currently included in any monitoring projects. See Monitoring Projects.
How are catalog items analyzed?
The process of analyzing your data involves these steps:
-
Connect to a data source.
-
Import metadata from the data source. This creates catalog items in Data Catalog.
-
Analyze the data. Run either profiling or automatic documentation flow, with or without data quality evaluation, on records in catalog items. See Profiling and documentation flows.
-
View the results. See View profiling results and View results of DQ evaluation.
Profiling and documentation flows
Profiling refers to the process of examining and analyzing data in order to collect statistics, discover patterns and distributions, and provide other useful metrics about data. There are two types of profiling:
-
Sample profiling: This type of profiling retrieves a small sample of records from your catalog item, profiles it, and then applies detection rules to find relevant business domains. As a result, the catalog item metadata is enriched with information about detected business terms.
-
Full profiling: Full profiling works on all records from the catalog item. In addition to profiling and term detection, it also includes anomaly detection, which attempts to discover irregularities in the data.
For more information, see Run Profiling.
Documentation flow is profiling applied to a whole data source, which allows you to document the entire source in a single click. By default, three options are available:
-
Import: Loads source metadata to ONE.
-
Discover: Runs sample profiling on the source.
-
Document: Runs full profiling on the source, including DQ evaluation on selected items.
For more information, see Run Documentation Flow.
In addition to out-of-the-box options, both profiling and documentation flows can use a custom configuration. For more information, see Configure Profiling and Configure Documentation Flow.
View profiling results
To view profiling results, navigate to the catalog item or attribute you’re interested in and switch to the Profile tab. For more information, see Understand Profiling Results.
From this tab, you can also run Sample or Full profiling to recalculate the results.
View results of DQ evaluation
To view results of data quality evaluation, navigate to the catalog item or attribute you’re interested in and switch to the Data Quality tab. From this tab, you can also run DQ evaluation to recalculate the results.
For more information about data quality in ONE, see Data Quality.
Edit catalog item metadata
To change the catalog item name or description, in Data Catalog > Catalog Items, open the catalog item you want to modify and in the three dots menu select Edit.
After you’re finished, publish your changes.
Delete catalog item
This action cannot be reversed. |
To delete a catalog item, in Data Catalog > Catalog Items, select the catalog item you want to delete and then Delete. Confirm your choice.
Or, open the catalog item and in the three dots menu select Delete.
Depending on your access rights, deleted catalog items may not be removed instantly and the change has to be approved by a user with corresponding access rights. See Publish Changes. |
Add attribute
To add an attribute, on the catalog item Overview tab, select Add Attribute.
Fill in the information as needed and select Save. Different fields are available for different attribute types.
Finally, publish your changes.
Edit attribute
To edit an attribute, on the catalog item Overview tab, select the three dots menu and then Edit. Or, select the attribute first and then from the attribute Overview tab select Edit.
Fill in the new details for the editable fields and select Save. Publish your changes.
Delete attribute
To delete an attribute, on the catalog item Overview tab, select the three dots menu and then Delete.
Or, select the attribute first and on the attribute Overview tab, select the three dots menu and then Delete.
Manage access
You can view permissions settings for the users in the Access tab.
The access is managed by the users with the Owner rights. For more information on editing the access roles, see Manage Users and Roles. |
To manage access:
-
From the three dots menu for the catalog item, select Manage access.
-
In the dialog that opens, select the role:
-
Owner - who has full permissions with publishing and managing access availability.
-
Editor - who has viewing and editing permissions, but cannot publish changes. Viewer - for whom only viewing is possible.
-
-
In Users or Group select one or several users or groups of users to whom the selected role will apply.
-
Save the changes.
Catalog items: Deep dive
To find out more about a particular catalog item, select it from the Catalog Items screen.
Depending on your global role, access level, and whether the asset was shared with you or not, you might not see all the information described here. For more information, see user-access-management:governance-roles.adoc. If you do not see something you are looking for, you can request access by following the instructions in tasks-and-workflows:workflows.adoc. However, keep in mind that some tabs or options might not be available for the particular catalog item type. |
On the details view (Overview tab), you can see a summary of some key metadata, such as:
-
Catalog item attributes, their data types, and business terms.
If term detection is enabled, this also includes suggested business terms. They are displayed with a confidence score and the option to approve or reject them.
-
The number of attributes and records.
-
Data quality results. See Run DQ Evaluation.
-
Detected anomalies. See Anomaly Detection in Catalog Items.
-
Business terms added to the catalog item.
-
Catalog item type (file, table, view, or other), description, data source, and purpose.
The purpose is a custom tag that is helpful when you intend to use the catalog item in a particular monitoring project stage. To keep track of multiple instances of the same catalog item that have a different purpose, use Linked instances. -
Related assets, such as monitoring projects, reports, or other catalog items.
View live data
Open the catalog item Data tab to view a sample of the first 50 lines of live data. Live means that all the changes are displayed regardless of whether they’re published or in draft.
For ONE Data catalog items, a link is shown to easily navigate to the corresponding ONE Data table. |
View catalog item structure
For a more detailed look into the catalog item attributes, open the Data Structure tab. Here you can find information such as:
-
Original type: The data type of the attribute in the origin source.
-
Order: Attribute order in the catalog item.
-
Constraints: Whether all attribute records meet the SQL constraints defined for the attribute.
In case the catalog item contains partitions, the latest three partitions are displayed as well.
Here, you can also create an index for the catalog item: select Create, then provide a name for the column and set the uniqueness constraint. Make sure to save your changes.
View related assets
To see connected assets represented on a graph, open the Relationships tab. This includes the catalog item attributes, other catalog items, reports, and data sources.
Initially, only the catalog item and its attributes are shown. To enrich the graph and discover more relationships, run full profiling or a documentation flow. Or, define relationships manually from the Relations section on the Overview tab.
For custom created entities, the Relationships tab is only available if it is explicitly configured for your entity. For more details, see Traits, section Enable the Relationships tab. |
Relationships graph: Tips & tricks
-
Use the plus icon to expand the connections to the level of detail required. The number next to the connection (for example, 2/14) shows how many connections are currently shown on the graph.
-
Use the dropdown arrow next to the catalog item name to hide or display the list of attributes.
-
To highlight a connection, hover over the connection and select Highlight.
-
Customize the graph to better suit your needs. You can choose:
-
Which information about the catalog item should be shown: terms, data validity, anomalies. Select the settings icon and then select all options that apply.
-
Which entities should be displayed. Select Assets Configuration and then select the relevant Assets. Confirm with Redraw graph.
-
The style of the relationship view. Select Assets Configuration and then select from the list of Relationships: either (1) arrow from source to target or (2) boxed target inside the source. Confirm with Redraw graph.
-
View lineage
The Lineage tab shows the graph of connections of the catalog item with other entities from different data sources that is generated as a result of full profiling or documentation flow. The graph can be visualized through Manta Flow, which automatically completes the physical lineage.
If no profiling was run on the item yet, there are no connections visualized, only the catalog item and its attributes.
Manta integrated Lineage is only available for catalog items from MS SQ and PostgreSQL sources. For more information, see Relational Database Connection. |
Lineage graph: Tips & tricks
You can expand the graph to the required details level:
-
Use the plus icon to expand the connections to the level of detail required.
-
Use the dropdown arrow next to the catalog item name to hide or display the list of attributes.
-
To highlight a connection, hover over the connection and select Highlight.
Customize the graph to better suit your needs. You can choose:
-
Which information about the catalog item should be shown: terms, data validity, anomalies. Select the settings icon and then select all options that apply.
-
Which entities should be displayed. Select Assets Configuration and then select the relevant Assets. Confirm with Redraw graph.
-
The style of the relationship view. Select Assets Configuration and then select from the list of Relationships: either (1) arrow from source to target or (2) boxed target inside the source. Confirm with Redraw graph.
To view details, select View detailed lineage.
To be able to see the detailed lineage graph, the connection with Manta is required to be set by power users via configuring the corresponding properties. For more information, see Lineage Configuration. |
Was this page useful?