I have a data frame which is in .csv format. This data frame includes 34500 rows. In this file, list of a RNAseq analysis result is present. Here the problem is some genes have multiple results and I should pick 1 entry for each gene and this entry should have the most p value. I edited my data and I have just "Gene symbol" and "p value" information.
How can i remove/eliminate rows which includes genes that should be eliminated according to my rule. I will add a screenshot which shows my problem.
Thanks in advance.