I have a data set with 10000 rows and 32 columns. I am wondering is it possible we choose some rows whose have the same value for some features?
Here is an example which make my question more clear.
col1 col2 col3 col4 col5
1 2 3 4 5
3 4 3 6 8
2 2 5 4 5
4 2 7 4 5
5 4 `8 6 8`
2 3 1 0 9
3 4 1 5 2
In this data set there are 5 columns. Suppose I want to select some rows whose have same value in column 2,4 and 5.
As it can be seen, the first, third and forth row have same value in col2 , col4 and col5 also second and 5-th rows have same value in those columns. So I will pick these rows and new data set will be
col1 col2 col3 col4 col5
1 2 3 4 5
3 4 3 6 8
2 2 5 4 5
4 2 7 4 5
5 4 `8 6 8`