I am using kaggle data set. Due to large size of dataset it is hard to insert dput
output. But I am trying to do the sum of yearly food production by region. And I am using aggregation for that purpose. For some reason it is displaying below error:
Aggregation R code:
years<-colnames(p[,11:63])
agg<-aggregate(years~area, data=p, sum)
Error:
Error in model.frame.default(formula = years ~ area, data = p) :
variable lengths differ (found for 'area')
I tried below link, but it seems not to be very useful to me:
Note: Dataset contains N/A. It was deleted using na.omit
function
Update after bk18 comment
> p[, lapply(.SD, class)]
area_abb area_code area item_code item element_code element Unit latitude longitude
1: factor integer factor integer factor integer factor factor numeric numeric
Y1961 Y1962 Y1963 Y1964 Y1965 Y1966 Y1967 Y1968 Y1969 Y1970 Y1971
1: integer integer integer integer integer integer integer integer integer integer integer
Y1972 Y1973 Y1974 Y1975 Y1976 Y1977 Y1978 Y1979 Y1980 Y1981 Y1982
1: integer integer integer integer integer integer integer integer integer integer integer
Y1983 Y1984 Y1985 Y1986 Y1987 Y1988 Y1989 Y1990 Y1991 Y1992 Y1993
1: integer integer integer integer integer integer integer integer integer integer integer
Y1994 Y1995 Y1996 Y1997 Y1998 Y1999 Y2000 Y2001 Y2002 Y2003 Y2004
1: integer integer integer integer integer integer integer integer integer integer integer
Y2005 Y2006 Y2007 Y2008 Y2009 Y2010 Y2011 Y2012 Y2013
1: integer integer integer integer integer integer integer integer integer
Thanks in advance,