I have a large text file around 450 mb. I have read it and out come is as string.
import pandas as pd
import numpy as np
import re
def readInChunks(fileObj, chunkSize=2048):
while True:
data = fileObj.read(chunkSize)
if not data:
break
yield data
result=[]
f = open("textfile.txt")
for chunk in readInChunks(f):
result.append(chunk)
f.close()
Result I got is a big string file, let say it result. And result[0] is given below
Alin Deutsch, Mary F. Fernandez, 1998
Alin Deutsch, Daniela Florescu, 1998
Alin Deutsch, Alon Y. Levy, 1998
Now I want this string to converted to dataframe in following way
c1 c2 c3
r1 Alin Deutsch Mary F. Fernandez 1998
r2 Alin Deutsch Daniela Florescu 1998