I've been assigned a task to extract features/attributes from product description.
Levi Strauss slim fit jeans
Big shopping bag in pink and gold
I need to be able to extract out attributes such as "Jeans" and "slim fit" or "shopping bag" and "pink" and "gold". The product description listings are not just for clothes, they can basically be anything.
I am not sure how to approach this problem. I tried implementing a Named Entity Recognizer solution and also a POS implementations, The NER implementation fails to recognize any token and most of the tokens show up as NNP(Proper Nouns) in he POS solution, which doesn't help me out a lot. I need a way to be able to distinguish between the brand name and the features of the Product(like if it is a t-shirt, the color or design(round neck, v-neck) etc).
I did implement a KMean solution which did cluster like products together, but then again it is not the result I am looking for.
Just looking for someone to direct me in the correct direction.