7

Can someone offer a suggestion on where to find a dictionary word list with frequency information?

Ideally, the source would be English words of the North American variety.

jimmym715
  • 1,512
  • 1
  • 16
  • 25
AlgoMan
  • 2,785
  • 6
  • 34
  • 40

4 Answers4

3

How about this?

Patrick Beardmore
  • 1,026
  • 1
  • 11
  • 35
  • You could use Google Refine (http://code.google.com/p/google-refine/) to extract the data like they do in this tutorial video http://www.youtube.com/watch?v=45EnWK-fE9k. – Pat Mar 26 '11 at 15:52
3

Try Kevin's Word List.

http://wordlist.sourceforge.net/

It's opensource, plain text and has many dictionaries.

Yada
  • 30,349
  • 24
  • 103
  • 144
3

Check the following link, contains unigrams/bigrams/trigrams corpus

http://blog.afterthedeadline.com/2010/07/20/after-the-deadline-bigram-corpus-our-gift-to-you/

tszming
  • 2,084
  • 12
  • 15
1

I don't know about frequency information but the Open Office dictionaries would be a good place to look for LGPL word lists.

Dean Taylor
  • 40,514
  • 3
  • 31
  • 50