I have one continuous variable with 76 observations and 29877 more continuous variables i would like to run linear regressions against. The code for one might look something like:
lm(data=spread_gene_exp, spread_gene_exp[,i] ~ spread_gene_exp[,2])
where i is one of the 29877 Y variables.
I would then like to extract the coefficient estimate and P-values from the results and add these to some kind of data frame where i can easily identify which variables were the most significant. This would probably need some kind of loop but i'm not sure where to start.
I would love to run this off of my own desktop but it will most likely take quite some time and so will probably have to run it on the university server, any estimation on how long it will take on a midrange/semi-powerful desktop would be helpful.