I'm looking for help with a problem I'm trying to solve in R.
I have DNA alignment results, a big dataset (more than 200 000 rows, and 20 columns) and I want to clean it and delete non-specific sequence and have at the end just one DNA sequence name for one species.
I've tried unique,duplicate and distinct function but they always keep the first duplicate rows and I don't want them, I would like to delete ALL the duplicate rows.
Do you have an idea how to solve my problem?