I am trying to run machine learning models on Customers trying to segment customers using similar products together. My dataset is huge with 2.4 million records and is in the following format:
customer_id prod_1 prod_2 prod_3 prod_4 ..... prod_10
000 1 0 0 1 ..... 1
001 0 0 1 1 ..... 1
011 0 1 0 1 ..... 0
021 1 0 1 1 ..... 0
...
Each row has customer number and 1 or 0 based on whether or not they have a product. I ran k-means and the results did not look impressive.
Any other suggestions on what type of models can be run on such data to segment customers based on the products they use together?