from pyconll import load_from_file
data = load_from_file("filename.conll")
data
I'm following the documentation of pyconll to read in a .conll
file, yet the following error occurs and I don't understand what it means. The dataset should be readable since it's kinda a benchmark dataset. And I don't see any other parameters of pyconll.load_from_file()
that can be specified in the documentation. Can anybody help me out here?
Plus, is there a way to read .conll
file by the nltk
package?
ParseError Traceback (most recent call last)
<ipython-input-14-06859f7ce8b2> in <module>()
----> 1 data = load_from_file("filename.conll")
2 data
5 frames
/usr/local/lib/python3.6/dist-packages/pyconll/unit/token.py in __init__(self, source, empty)
661 error_msg = 'The number of columns per token line must be 10. Invalid token: {}'.format(
662 source)
--> 663 raise ParseError(error_msg)
664
665 # Assign all the field values from the line to internal equivalents.
ParseError: The number of columns per token line must be 10. Invalid token: @paulwalk O