Objective :
Let's think of a large scale enterprise where we have a heterogeneous data stores such as SQL servers, No-SQL stores, Big data stores like ADL, ADF..etc spreads across different business groups.
Our objective is to build a lineage service at an enterprise level, which will provide insights on the data lineage between different data store instances within an enterprise. This will help us in gaining insights on the data proliferation within the enterprise.
For this, as a phase 1 we want to take SQL server and want to build the lineage within and across SQL servers and It's databases.
Is there a way we can build / extract (if available) the data lineage (table & Column level) from a SQL server by leveraging SQL profiling or transaction logs within or across instances.
I looked at Determining Impact and Data Lineage and looks like the SQL server should have the Dependency Services enabled.
As a enterprise level service, I may have access to the SQL Servers and may not have control over the sql server instances on enabling any services.
Could anyone share any insights or experiences with respect to building or extracting Data Lineage within or across SQL Servers.
Thanks in Advance !
Mydeen