0

Examples: mapping a plural noun to its singular form: children --> child, boxes ---> box mapping comparative and superlative of adjectives and adverbs to their basic form mapping verb ....

I once found an open source resource to achieve it. As I remember, it is a datasheet?/spreadsheet?/database? of the mapping. Does anyone know the open source resource, or any alternative/better solution? I'm coding in java.

Johannes Pille
  • 4,073
  • 4
  • 26
  • 27
lex
  • 25
  • 4

2 Answers2

2

That is called stemming and Lucene (specifically snowball contrib) can do that.

There is a number of different stemming algorithms.

Community
  • 1
  • 1
om-nom-nom
  • 62,329
  • 13
  • 183
  • 228
  • Actually the process is called lemmatization. See answer [here](http://stackoverflow.com/a/4025259/547040). – Khairul Apr 07 '12 at 10:32
  • @Khairul Interesting. [Wikipedia says](http://en.wikipedia.org/wiki/Stemming#Lemmatisation_algorithms) that lemmatisation is the subset of stemming. – om-nom-nom Apr 07 '12 at 13:26
0

There was a paper in the '80s about the UNIX spell checker which explained how stemming worked. This Wikipedia UNIX Spell article gives a link to the paper, and also identifies some alternatives.

gbulmer
  • 4,210
  • 18
  • 20