I have a csv file that contains data about clients on separate lines that I want to convert.
The current layout is:
Client_Name,Client_ID,Client_Group,Attribute_Name,Date,Attribute_Value
Each row contains info about a client e.g.:
Acme,0001,Marketing,Sales_Amt,2010-10-01,100
Acme,0001,Marketing,Queries,2010-10-01,3
Smiths,0002,Retail,Sales_Amt,2010-10-01,1200
Smiths,0002,Retail,Queries,2010-10-01,11
What I want to do is transform it to the following time series layout:
Date,Client_Name,Sales_Amt,Queries
So that each row reads:
2010-10-01,Acme,100,3
2010-10-01,Smiths,1200,11
So I can see per day, per client attributes. Is this a job for ETL or can I do this easily just with file manipulation tools like sed & awk?