This is what I do:
for word in doc:
if len(word) < 3:
doc.remove(word)
But, if I do this:
for word in doc:
if len(word) < 3:
print word
The results returned are: 'O,' 'Of' '30' '4.' 'I.' 'IF' and more.
Most two character items are removed, but a few still remain, am I doing something wrong?