R Name change after unstacking

Question

Imagine I have the following stacked data matrix:

mY <- data.frame(matrix(c(c(1:10),c("A 1","A 1","A 1","A 1","A 1","B 1","B 1","B 1","B 1","B 1")),10))

Resulting in:

This is just an example of a data frame which I want to unstack, where entries in X2 contain a space character. It could also have been 'hot dog', or 'boiled egg'.

When I use

mB <- unstack(mY, X1~X2)

I get

Notice that the name of the columns have changed to A.1 and B.1, which were previously defined as 'A 1' and 'B 1'. When I use mB["A 1"] it returns null, whereas mB["A.1"] returns column A.1. How can I overcome this?

Thanks in advance.

See also [this question](http://stackoverflow.com/questions/3411201/specifying-column-names-in-a-data-frame-changes-spaces-to) — ROLO, Sep 15 '13 at 11:56
`unstack` doesn't let you pass `check.names = FALSE` to it (see `utils:::unstack.data.frame` for the code), so you're somewhat stuck with R making what it considers syntactically valid names and having to manually rename them later (if you want to break the rules). — A5C1D2H2I1M1N2O1R2T1, Sep 15 '13 at 12:08

Josh O'Brien · Accepted Answer · 2013-09-15T12:26:45.720

1

Using column names with spaces is a mostly bad idea, but if you want to go ahead and use them anyway, here's a simple workaround. It uses setNames() to rename the columns to the names stored in unique(mY$X2).

setNames(unstack(mY, X1~X2), unique(mY$X2))
#   A 1 B 1
# 1   1   6
# 2   2   7
# 3   3   8
# 4   4   9
# 5   5  10

edited Sep 15 '13 at 12:26

answered Sep 15 '13 at 12:11

Josh O'Brien

159,210
26
366
455

Note that this may set names in the wrong order if some are "fixed" and others are "valid" – Michael Schubert Sep 23 '22 at 09:29

Henrik · Answer 2 · 2013-09-15T17:04:12.910

Out of curiosity I checked this: Unstacking with dcast keeps the "syntactically invalid names".

library(reshape2)

# need to create an id variable that is used as 'row variable', LHS in the casting formula
mY$id <- ave(mY$X2, mY$X2, FUN = seq_along)

dcast(data = mY, id ~ X2, value.var = "X1")

#   id A 1 B 1
# 1  1   1   6
# 2  2   2   7
# 3  3   3   8
# 4  4   4   9
# 5  5   5  10

@Josh O'Brien's solution is much cleaner though.

R Name change after unstacking

2 Answers2