I have several tables that have different column names which are mapped through ETL. There are a total of around 200 tables and 500 attributes, so the set is not massive.
Some column mappings are as follows:
startDate EFT_DATE
startDate START_DATE
startDate entryDate
As you can see the same column name can be mapped to different names across different tables.
I'm trying to solve the following problem :
Given two schemas I want to find matches between attribute names.
I was wondering if there is a way to leverage gensim to solve this problem similar to source-words from Google example. The challenge I'm facing is which dataset to use to train the model. Also I am wondering if there is another approach to solve the problem.