I have the dataset below to perform complete analysis on it including Principal components analysis, correspondence analysis, Multiple correspondence analysis.
https://archive.ics.uci.edu/ml/datasets/Tarvel+Review+Ratings
I've found some problems in performing CA:
*first in understanding the dataset. We know that CA is for categorical variables, and in my dataset, as I understand, there's only one categorical variable which is the "attractions from 24 categories across Europe". so could it be that the 2nd categorical variable are the users perhaps?
*Another problem, using the R code below to perform the CA using the function ca() in FactoMiner Package, it gives me the error below:
Code R:
library(FactoMineR)
data <- read.csv(file.choose(),row.names=1)
View(data)
data.active<-data[,0:24]
res.data <-CA(data.active,graph = FALSE)
summary(res.data,nb.dec = 2)
Error:
res.data <-CA(data.active,graph = FALSE) Error in eigen(crossprod(t(X), t(X)), symmetric = TRUE) : infinite or missing values in 'x'
I don't understand what could it be the error. When I read the dataset and put it in the "data" variable, I find that another Column called "X" with N/A value in all of its cells. I didn't understand where it came from so I've used the "data.active" in the R code to delete column "X", and then I checked the data.active table if it contains any N/A values but it seems legit!
So anyone please can help me solving this problem, It'll be much appreciated. Thank you