Calculate the percentage by column total and not by the grouped variable

Asked Mar 20 '17 at 15:30

Active Mar 20 '17 at 15:30

Viewed 470 times

As a follow up to this question, what is the best way if you would like to calculate the percentage not as a share inside of the group, but rather as a share of the column total.

d <- read.table(text="Fuel     Year   Region   Count
Gasoline 2013       GE  169600
                Diesel   2013       GE   46790
                Hybrid   2013       GE    2268
                Electric 2013       GE      85
                Other    2013       GE     532
                Gasoline 2013       VS  149232
                Diesel   2013       VS   50591
                Hybrid   2013       VS    1028
                Electric 2013       VS     268
                Other    2013       VS     261", header = TRUE)


d <- data.table(d)

I would then like to calculate the share of fuels irrespective of the regions. So in a first step I would like to have this:

d[, .(Car.Total = sum(Count)), by = "Fuel"]

Is there a better way to calculate the percentage than this:

d[, .(Car.Total = sum(Count)
    , Car.Share = sum(Count)/sum(d[,Count])), by = "Fuel"]

This seems pretty inefficient, but works. Is there any more efficient way using only data.table methods.

asked Mar 20 '17 at 15:30

hannes101

2,410
1
17
40

2

`d[, .(ct = sum(Count)), by=Fuel][, s := ct/sum(ct)][]` ? – Frank Mar 20 '17 at 15:36
This works, but there's no function, which allows to calculate the total sum in the first `[]`, or? – hannes101 Mar 20 '17 at 15:57
1

No, not really. Everything in `j` is computed by group. Some more convoluted approaches here: http://stackoverflow.com/a/30944435/ – Frank Mar 20 '17 at 16:03
Another option: `d[, sum(Count) / sum(d$Count), Fuel]` – BrodieG Mar 20 '17 at 17:21
@BrodieG, well that's basically the same as the one I am using in the question ;-) – hannes101 Mar 20 '17 at 19:44
this is true... – BrodieG Mar 20 '17 at 20:11

Calculate the percentage by column total and not by the grouped variable

0 Answers0