I have two files:
f1.csv - it contains 800 rows with unique id
values
id
1
2
3
4
5
f2.xlsx (Sheet1) - it contains 20 columns and many rows (200Mb).
typeID col2 col2 ...
1
1
1
2
2
2
2
2
3
4
10
10
...
I want to reduce the volume of f2.xlsx
in order to be open the data file in Jupyter Notebook (Python) and analyze it with pandas. In particular, I want to select only those typeID
values that match id
in f1.csv
.
Is there any way to use terminal commands in order to do this filtering and then save a filtered file in CSV format?