This is what I have:
df <- structure(list(Sample = structure(c(1L, 1L, 2L, 2L, 3L, 3L, 4L,
4L), .Label = c("19-0001", "19-0002", "19-0003", "19-0004"), class = "factor"),
Replicate = c(1L, 2L, 1L, 2L, 1L, 2L, 1L, 2L), X24854000 = structure(c(1L,
2L, 2L, 1L, 2L, 2L, 1L, 1L), .Label = c("", "CC"), class = "factor"),
X24854056 = structure(c(3L, 3L, 2L, 1L, 1L, 1L, 1L, 1L), .Label = c("",
"AA", "GG"), class = "factor"), X24854764 = structure(c(1L,
1L, 1L, 1L, 1L, 1L, 1L, 1L), .Label = "TA", class = "factor"),
X24854903 = structure(c(1L, 1L, 1L, 1L, 2L, 2L, 1L, 1L), .Label = c("",
"CT"), class = "factor"), X24855066 = structure(c(1L, 1L,
3L, 3L, 2L, 2L, 2L, 2L), .Label = c("", "CA", "CC"), class = "factor"),
X24855114 = structure(c(2L, 1L, 3L, 3L, 2L, 2L, 2L, 2L), .Label = c("",
"GA", "GG"), class = "factor"), X24855316 = structure(c(2L,
2L, 1L, 1L, 2L, 2L, 2L, 1L), .Label = c("", "TC"), class = "factor"),
X24855449 = structure(c(1L, 2L, 2L, 2L, 1L, 1L, 1L, 1L), .Label = c("CC",
"GG"), class = "factor"), X24855925 = structure(c(2L, 1L,
1L, 3L, 2L, 2L, 1L, 1L), .Label = c("", "GA", "GG"), class = "factor"),
X24856070 = structure(c(2L, 1L, 1L, 1L, 1L, 1L, 1L, 1L), .Label = c("CC",
"CT"), class = "factor"), X24856086 = structure(c(2L, 1L,
2L, 2L, 2L, 2L, 2L, 2L), .Label = c("CC", "CT"), class = "factor"),
X24856329 = structure(c(2L, 1L, 2L, 2L, 2L, 2L, 2L, 2L), .Label = c("",
"AG"), class = "factor"), X24856389 = structure(c(2L, 1L,
1L, 1L, 2L, 2L, 2L, 2L), .Label = c("", "GG"), class = "factor"),
X24857235 = structure(c(1L, 1L, 1L, 1L, 2L, 2L, 2L, 1L), .Label = c("",
"CT"), class = "factor"), X24857350 = structure(c(3L, 3L,
1L, 1L, 2L, 2L, 1L, 1L), .Label = c("", "GA", "GG"), class = "factor"),
X24857404 = structure(c(1L, 3L, 1L, 1L, 2L, 2L, 1L, 1L), .Label = c("",
"AT", "TT"), class = "factor")), class = "data.frame", row.names = c(NA,
-8L))
This generates this table
Sample Replicate X24854000 X24854056 X24854764 X24854903 X24855066 X24855114 X24855316 X24855449 X24855925 X24856070 X24856086 X24856329 X24856389 X24857235 X24857350 X24857404
19-0001 1 GG TA GA TC CC GA CT CT AG GG GG
19-0001 2 CC GG TA TC GG CC CC GG TT
19-0002 1 CC AA TA CC GG GG CC CT AG
19-0002 2 TA CC GG GG GG CC CT AG
19-0003 1 CC TA CT CA GA TC CC GA CC CT AG GG CT GA AT
19-0003 2 CC TA CT CA GA TC CC GA CC CT AG GG CT GA AT
19-0004 1 TA CA GA TC CC CC CT AG GG CT
19-0004 2 TA CA GA CC CC CT AG GG
This is what I want:
Sample Replicate X24854000 X24854056 X24854764 X24854903 X24855066 X24855114 X24855316 X24855449 X24855925 X24856070 X24856086 X24856329 X24856389 X24857235 X24857350 X24857404
19-0001 1 CC GG TA GA TC 99 GA 99 99 AG GG GG TT
19-0002 1 CC AA TA CC GG GG GG CC CT AG
19-0003 1 CC TA CT CA GA TC CC GA CC CT AG GG CT GA AT
19-0004 1 TA CA GA TC CC CC CT AG GG CT
Merging of replicate 1 and 2 under the same sample name. Missing or same score can be replaced by the other but any mismatches should be replaced by "99" so they can be removed later.
I tried:
data_merge <- data %>%
group_by(Sample) %>%
summarise_all(ifelse(statement), (if_true), (if_false))
I only subset the data, the real data have 44 of X numbers.