Before someone points out duplicate, this is not the same question as this.
In that question, his error was
ValueError: Some errors were detected !
Line #88 (got 1435 columns instead of 1434)
having 1 more column than expected (likely an extra delimiter somewhere).
I am processing a file with two columns separated by a tab ('\t') and am using the following
movies = np.genfromtxt('imdb/movie_keywords', delimiter = '\t', dtype = None)
I receive the following error
ValueError: Some errors were detected !
Line #44209 (got 1 columns instead of 2)
Line #44210 (got 1 columns instead of 2)
Line #44211 (got 1 columns instead of 2)
Line #93460 (got 1 columns instead of 2)
...
Here are four lines (raw text) from the file,
The first two are line #1 and line #, which do not throw an errors
'$ (1971)\tbank-heist'
'Angela (1954)\tamerican-car-salesman'
These are from lines #44209 # 93463, which throw an error
'Animated (1989)\taustralian'
'Animated Motion #1 (1976)\tindependent-film'
Might some sleuth point out the difference here which causes numpy not to pick up the tab in the error throwing lines?
To add, I receive no error if using pandas and this code:
keywords = pd.read_csv('imdb/movie_keywords', delimiter = '\t', dtype = None, names = ['movie', 'keyword'])
Pandas however is not sufficient for the operations I wish to conduct.