I'm trying to work with a (very) large 45gb .txt file that cannot be opened using normal text editors.
Data within each row is separated by a spacing, although there are also spaces within each parameter. For example, 1 row looks somewhat like this:
University of Cambridge CB2 1TQ 0001234567 2011-01-25 12345 11.12345 12.12345 13.12345 14.1234 16.2716)
What I would want to is essentially to clean it up, filter some data, sum up some cells/parameters, append some additional data to each row, and perform clustering analysis (probably with python or C# since I'm starting to pick it up).