Dremio Lineage Scanner
The Dremio lineage scanner supports metadata ingestion using combination of JDBC and REST API:
Running the scanner
To run the scanner, use the following command in the directory containing the scanner’s application .jar
file:
java -jar --add-opens=java.base/java.nio=org.apache.arrow.memory.core,ALL-UNNAMED mde-lineage-toolkit-X.Y.Z.jar dremio.json
Replace X.Y.Z with the version number of your downloaded toolkit.
|
Supported Sources
-
Cross lineage is supported for the following Dremio Sources:
-
Oracle
-
MS SQL
-
Snowflake
-
Supported statement types and SQL syntax
-
Supported Statements Types:
-
CREATE VIEW
-
-
Supported Database Objects for Design Time Lineage:
-
Views
-
-
Not yet supported SQL constructs:
-
Statements containing these SQL constructs will not be included in the lineage diagram:
-
Table Functions
-
UNNEST Clause
-
AT SNAPSHOT Clause
-
-
Note: Most of the unsupported syntax is planned to be supported in the upcoming versions of the scanner, based on prioritization.
Scanner configuration
Online (JDBC) based extraction connects to Dremio and extracts the necessary metadata.
Scope: Design time views and stored procedures.
Limitations: Currently, only username/password authentication is supported (other types are planned).
Property | Description |
---|---|
|
Unique name of the scanner job |
|
Must contain DREMIO |
|
Human readable description |
|
Arrow Flight JDBC url |
|
Dremio username |
|
Dremio password, can be encrypted |
|
Dremio host - required for REST API calls |
|
Dremio port - required for REST API calls |
|
By default, is set to true. For development/docker Dremio can be set to false disable SSL API requests. |
|
By default, is set to false. For development/docker Dremio can be set to true to accept self-signed SSL certificate. Used only, when useSSLApiCalls is set to true. |
|
List of spaces to include in lineage extraction. The filter is case-insensitive and supports SQL wildcards: '%' and '_'. I.e.: to include all schemas ending with "PROD", add filter "%PROD". |
|
List of spaces to exclude from lineage extraction. Similarly to includeSpaces, this filter is case-insensitive and supports SQL wildcards |
Legend: *mandatory
{
"scannerConfigs": [
{
"name": "dremio-localhost",
"sourceType": "DREMIO",
"description": "Scan dremio export files",
"includeSpaces": [],
"excludeSpaces": ["@admin", "DevSpace%"],
"connection": {
"arrowFlightJdbcUrl": "jdbc:arrow-flight-sql://localhost:32010/?useEncryption=false",
"username": "admin",
"password": "some_password",
"apiHost": "localhost",
"apiPort": 9047
}
}
]
}
Scanner special features
Dremio scanner supports the same SQL "anomaly" detection checks as the Snowflake scanner. See the SQL "DQ" (Anomaly) detections for more details.
Was this page useful?