I have a product data table given to me in text file and the format of the data is like below:
**START
IB 1107518415
BI Paperback
BC JHBC
BC JFDD
CO United Kingdom
ED Alvarez, R. Michael
IU 46 b/w illus. 2 maps 18 tables
MP No
PD 20160307
I3 9781107518414
**
Basically, a TAG system is followed where each TAG has a unique 2-digit identifier followed by the relevant data itself. A CR/LF will be at the end of each field.
** Markers is used to indicate the beginning of a new record.
The file is 3 GB and it has 6 million such record, each record having upto 43 fields and some fields having multiple values like the "BC" field in the above sample, I'd need both the values.
What would be the best way to get this file into a collection?
Thanks Mohamed