Problem I have at hand is to build a product suggestion model which suggest products based on the context of the search query of a user. My plan is to get a pre-trained model from the sentence-transformers pre-trained models and embed product descriptions using that and then get a list of top k most semantically similar products to the encoded query which user typed using the same model. I want to know how should I prepare my dataset to finetune the model to the product domain (e-commerce). Data set I have is called the amazon-esci dataset which has products relevant to search queries by users, labelled.
I tried to directly use the pre-trained model. But it suggest irrelevant products. I would like to know how to fine-tune the model.