Really don't know where to start seeking for the right algorithm.
I'm building a web application that collects schema.org data from different webshops as Amazon, Shopify, etc. It collects data every 6h and shows the current and lowest price. It is used for monitoring products and buying at the lowest price.
My goal is to recognize products from different shops as the same product. Every shop has its own title for the same product.
Example:
Google Pixel 2 64GB Clearly White (Unlocked) Smartphone
Google Pixel 2 GSM/CDMA Google Unlocked (Clearly White, 64GB, US warranty)
Problems:
- don't have a lot of data (only products chosen by the user)
- needs to support every new product that app doesn't have data history