3

I am using word2vec pretrained embedding in PyTorch (following code here). However, it does not seem to handle unseen words. Is there any good way to solve it?

Mr.cysl
  • 1,494
  • 6
  • 23
  • 37

1 Answers1

1

FastText builds character ngram vectors as part of model training. When it finds an OOV word, it sums the character ngram vectors in the word to produce a vector for the word. You can find more detail here.

polm23
  • 14,456
  • 7
  • 35
  • 59