MS SSIS Lineage Scanner
Scanned and supported objects
|
The scanner automatically generates lineage metadata for all currently supported sources and connections. However, lineage coverage might vary depending on system capabilities, dynamic transformations, or access permissions. While we strive for comprehensive lineage capture, certain dataflows and transformations might be incomplete or unavailable due to technical constraints. We continuously work to expand coverage and accuracy. |
The MS SSIS Lineage Scanner supports SQL Server Integration Services (SSIS) 2012 and later.
Scanned resources
The extracted metadata includes the following objects:
-
Folders
-
Projects
-
Packages and connection management files
Supported components
-
CharacterMap -
ConditionalSplit -
DerivedColumn -
FlatFileSource -
Inserter -
Lookup -
ManagedComponentHost -
Merge -
MergeJoin -
OLEDBDestination -
OLEDBSource -
RowCount -
Sort -
UnionAll
SQL queries extraction
SQL query extraction is supported only within the Data Flow Task component, and only for the following SSIS components:
-
OLEDBSource -
OLEDBDestination -
Lookup
Note:
-
Only single-statement SQL queries are supported for extraction.
-
Queries containing multiple statements (for example, using
DECLAREbeforeSELECT, or any batch execution) are not processed. -
The
EXECstatement is not supported. -
The scanner builds SQL lineage without using any database dictionary metadata. Therefore, SQL statements that require access to the underlying database object definitions (tables or views) to resolve lineage are not supported and will not produce lineage.
Variable-based table definition limitations
When a variable is used instead of a literal table name, the variable value must be defined in one of the following formats:
-
[schema].[table] -
'schema'.'table' -
schema.table
Any other formats or dynamic concatenations of schema and table names are not supported for SQL extraction.
Currently, the scanner supports only variables that are defined within SSIS packages. Variables defined externally or through project parameters are currently not supported.
Variable-defined SQL queries
If an SQL query is defined through a variable that contains the full SQL statement (for example, when the component references a variable holding the query text), it is also supported for extraction. The scanner detects and parses SQL statements stored in variable definitions used by supported components.
Supported connectivity
-
Connector type: JDBC.
-
Authentication method: SQL Server Authentication using SQL login and password.
Supported connection types
The scanner currently supports the following SSIS connection types:
| Connection type | Description |
|---|---|
|
OLE DB connections to MS SQL (other technologies do not provide any guarantee of data lineage quality). |
|
Connections to flat files such as |
|
Connections to Microsoft Excel workbooks. |
|
Connections to SSIS cache files used by components. |
|
Connections using ADO.NET with the |
Connection processing
The scanner identifies and categorizes connections by type to standardize lineage extraction and reduce duplication:
-
OLE DB / ADO.NET: Extracts host, database, and schema information.
-
Excel: Captures file paths and sheet names.
-
Flat File: Records file paths and basic structure (columns and delimiters).
-
Cache: Groups all cache connections under a unified logical connection.
Connections with identical or equivalent properties are consolidated automatically to prevent duplicates in the extracted metadata.
SSIS permissions
The user used for metadata extraction must have the following permissions:
-
SELECTon tables and views of thessisdb.catalogschema. -
EXECUTEonssisdb.internal.get_project_internal. -
READon every extracted folder. -
READon every extracted folder’s project. -
READon every extracted project’s package.
Scanner configuration
All fields marked with an asterisk (*) are mandatory.
| Property | Description |
|---|---|
|
Unique name for the scanner job. |
|
Specifies the source type to be scanned.
Must contain |
|
A human-readable description of the scan. |
|
List of Ataccama ONE connection names for future automatic pairing. |
|
The full JDBC connection string used to connect to SQL Server. |
|
The SQL Server user. |
|
SQL server user password. |
{
"scannerConfigs": [
{
"name": "SSIS Scan Connection",
"sourceType": "SSIS",
"description": "Scan SSIS project",
"oneConnections": [],
"jdbcUrl": "jdbc:sqlserver://mqlserver.database.windows.net;encrypt=true;trustServerCertificate=true",
"username": "admin",
"password": "@@ref:ata:[SSIS_USER_PASSWD]"
}
]
}
Was this page useful?