If there are double entries in a dataframe how do I delete both the entries?

Question

The idea is not removing one of the duplicate rows but all repeated rows.

Data that I Have now

Name	Score
AB	75
AB	75
BC	50
CD	70

Expected Result

Name	Score
BC	50
CD	70

score 1 · Accepted Answer · answered Apr 12 '22 at 03:46

One option is to use group_by to detect the combinations of variables with duplicates and filter based on group size.

library(tidyverse)
#> Warning: package 'readr' was built under R version 4.1.2
withdups <- data.frame(
  stringsAsFactors = FALSE,
              Name = c("AB", "AB", "BC", "CD"),
             Score = c(75L, 75L, 50L, 70L)
)

withdups %>% 
  group_by(Name, Score) %>% 
  filter(n() == 1)
#> # A tibble: 2 x 2
#> # Groups:   Name, Score [2]
#>   Name  Score
#>   <chr> <int>
#> 1 BC       50
#> 2 CD       70

^{Created on 2022-04-12 by the reprex package (v2.0.1)}

or `group_by_all()` if you have multiple columns – Gnueghoidune Apr 12 '22 at 03:47 — Gnueghoidune, Apr 12 '22 at 03:47

score 0 · Answer 2 · answered Apr 12 '22 at 04:00

0

library(tidyverse)
withdups %>%
add_count(Name, Score) %>%
filter(n == 1)

answered Apr 12 '22 at 04:00

Lennyy

5,932
2
10
23

If there are double entries in a dataframe how do I delete both the entries?

2 Answers2