I'm practicing text mining for a PhD project in Human Science on Tweets (Twitter).
I have some difficulties to strip the special characters (“”)
which are coded by Twitter API to enclose the new "retweet with comment" function.
I have tested to use (\“)
or (\'')
without success.
I encountered the following error each time:
(...error tex...)'\�'(...error text...)
which means that R
do not recognize this both special characters (“)
and (”)
.
For instance, I have replace each first character before a (@)
by a (")
which allow me apply basic statistics on the variable "retweet with comment", but I can't go further with text mining functions to work on the characters inside (“@...”)
.
Have anyone ever encounter this type of trouble ?
R.Version()
$platform
[1] "x86_64-apple-darwin10.8.0"
$arch
[1] "x86_64"
$os
[1] "darwin10.8.0"
$system
[1] "x86_64, darwin10.8.0"
$status
[1] ""
$major
[1] "3"
$minor
[1] "1.0"
$year
[1] "2014"
$month
[1] "04"
$day
[1] "10"
$`svn rev`
[1] "65387"
$language
[1] "R"
$version.string
[1] "R version 3.1.0 (2014-04-10)"
$nickname
[1] "Spring Dance"