I am working on a web app where I collect data about mobile phones from several websites. The problem is that websites use slightly different naming of mobile phones. For eg the websites use the following variation in names for these two mobiles:
HTC One X+ (Black); HTC One X+ Black; HTC One X Plus; HTC One X Plus, black
Samsung Galaxy S3 (Pebble Blue, with 16GB); Samsung Galaxy S III (Blue); Samsung Galaxy S3 I9300 16GB Pebble Blue; Samsung I9300 Galaxy S III (16 GB); Samsung Galaxy S3 (I9300), pebble blue
Since I read this data off these websites using a crawler I need my program to resolve all these different strings into same product.
Any ideas? If it matters, I am using python.