I have a generic requirement to group data and get the common string out of it. I am not sure which technology will work for this specific requirement . Hence mentioned it as generic. I currently have access to SQL,Unix commands, SAS , MSTR, Informatica, and Python.
Actual Data:
Field A| Field B | Field c
A | 001 | Alan J
A | 001 | Alan Walker
A | 001 | Walker Alan
A | 001 | Alexander AlanJoseph
Required Output:
Field A | Field B | Field C
A | 001 | Alan
Note: Based on grouping of Field A and Field B, need to find out the common pattern among the data in Field C
Could someone please help me out on this ?