12

I have a data frame with the columns city, state, and country. I want to create a string that concatenates: "City, State, Country". However, one of my cities doesn't have a State (has a NA instead). I want the string for that city to be "City, Country". Here is the code that creates the wrong string:

# define City, State, Country
  city <- c("Austin", "Knoxville", "Salk Lake City", "Prague")
  state <- c("Texas", "Tennessee", "Utah", NA)
  country <- c("United States", "United States", "United States", "Czech Rep")
# create data frame
  dff <- data.frame(city, state, country)
# create full string
  dff["string"] <- paste(city, state, country, sep=", ")

When I display dff$string, I get the following. Note that the last string has a NA,, which is not needed:

> dff["string"]
                               string
1        Austin, Texas, United States
2 Knoxville, Tennessee, United States
3 Salk Lake City, Utah, United States
4               Prague, NA, Czech Rep

What do I do to skip that NA,, including the sep = ", ".

Jose R
  • 930
  • 1
  • 11
  • 22
  • 2
    There is a general discussion of suppressing NAs in paste [here](http://stackoverflow.com/questions/13673894/suppress-nas-in-paste), should you have more than one column containing NAs. – JWilliman Sep 08 '15 at 03:09

2 Answers2

10

The alternative is to just fix it up afterwards:

gsub("NA, ","",dff$string)

#[1] "Austin, Texas, United States"       
#[2] "Knoxville, Tennessee, United States"
#[3] "Salk Lake City, Utah, United States"
#[4] "Prague, Czech Rep"   

Alternative #2, is to use apply once you have your data.frame called dff:

apply(dff, 1, function(x) paste(na.omit(x),collapse=", ") )
thelatemail
  • 91,185
  • 12
  • 128
  • 188
7

Late to the party, but unite provides a one-step approach:

dff %>% unite("string", c(city, state, country), sep=", ", remove = FALSE, na.rm = TRUE)
                              string           city     state       country
1        Austin, Texas, United States         Austin     Texas United States
2 Knoxville, Tennessee, United States      Knoxville Tennessee United States
3 Salk Lake City, Utah, United States Salk Lake City      Utah United States
4                   Prague, Czech Rep         Prague      <NA>     Czech Rep
broti
  • 1,338
  • 8
  • 29