Gene manipulation in R

Asked Jul 24 '22 at 05:00

Active Jul 24 '22 at 11:15

Viewed 48 times

I want to create the following freq columns for the below df/table (clean_df_mut_counts)
Get the 2nd highest frequency allele at each patient*gene

patient gene A_count C_count G_count T_count A_freq. C_freq G_freq C_freq

"ptp_1" "BRCA1" values. values values. values

"ptp_1" "BRCA2" values.

"ptp_2" "BRCA1"

"ptp_2" "BRCA2"

tried as below. was able to get the pivot_wider to work which gave result as below-

clean_df_mut_counts_wide <- clean_df_mut_counts %>%  pivot_wider(
  names_from = base,
  values_from = count
)

result of above code-

clean_df_mut_counts_wide %>% 
    group_by(A, T, C, G) %>% 
      summarise(n = n()) %>% 
        mutate(freq= n/sum(n)) %>%
            top_n(n=2)

result of above code-

edited Jul 24 '22 at 11:15

asked Jul 24 '22 at 05:00

analog_kid

6

[Please provide a reproducible example of your data](https://stackoverflow.com/questions/5963269/how-to-make-a-great-r-reproducible-example). – user438383 Jul 24 '22 at 08:11
Please check now. – analog_kid Jul 24 '22 at 11:16
Please use ``dput()`` to share data and not as an image. – user438383 Jul 24 '22 at 11:29
https://reprex.tidyverse.org/ – Baraliuh Jul 24 '22 at 16:39

0 Answers0