-1

Can I cluster data with one variable instead of many (What I had already test) using mahout K-means Algorithm ? if yes (I hope so :) )could you give me an Example of clustering and thinks

user3260388
  • 313
  • 1
  • 4
  • 10

1 Answers1

1

How big is your data? If it is not exabytes, you would be better off without Mahout.

If it is exabytes, use sampling, and then process it on a single machine.

See also:

and many more.

Mahout is not your general go-to place for data anlysis. It only shines when you have Google scale data. Otherwise, the overhead is too large.

Community
  • 1
  • 1
Has QUIT--Anony-Mousse
  • 76,138
  • 12
  • 138
  • 194