Finding the row names of my P values in R

Question

I have hundreds of P values which correspond to row names in my data frame. I put those values into a new row of the original table:

df1$Pvalues<-lapply(1:nrow(data1), function(i) { 
    wilcox.test(as.numeric(data1[i, ]), as.numeric(data2[i, ]))$p.value
}))

I found the top 20 most significant P values and now need to find out which column name they correspond to. I have tried:

which(rownames(df1) %in% c("1.136925e-12"))

But the answer given is integer(0)

Another way would be to print the top 20 most significant P values along with column names straight away but all I have is the actual P values. In this command wilcoxon is the name of the dataframe where I have subset the P values:

head(sort(wilcoxon),20)

I'm a beginner, any help would be appreciated!

Can you make your example [reproducible](http://stackoverflow.com/questions/5963269/how-to-make-a-great-r-reproducible-example)? Generally, it might be easier for you to separate input from output. — Heroka, Nov 02 '15 at 15:29
If you want to find which column they belong to, why you are comparing with `row.names`? — akrun, Nov 02 '15 at 15:32

zielinskipp · Answer 1 · 2015-11-05T19:23:43.427

0

So, first you need to find the 20 smallest values. E.g. sort the vector of values, then index first 20 elements. When you know which values you are looking for you can index row.names with logical vector.

x <- sort(df1$Pvalues)[1:20]

row.names(df1)[df1$Pvalues %in% x]

edited Nov 05 '15 at 19:23

answered Nov 02 '15 at 21:04

zielinskipp

120
5

1

try to explain why that would work, and not just give a one-liner. – BobbyTables Nov 03 '15 at 07:34

Finding the row names of my P values in R

1 Answers1