0

File format: GZIP, Stored in: GCP Storage

When I read the file with this piece of code:

fs = gcsfs.GCSFileSystem(project='myproject',token='mytoken')
with fs.open(file,'rt') as f:
   file_tmp = pd.read_csv(f, sep=",", header=0)
   print(file_tmp)

Some string fields come incomplete.

User A - IDFV (expected to be read): C305F2DB-56FC-404F-B6C1-BC52E0B680D8

User A - IDFV (actually read): C3

I have also tried the csv.reader() method, but same results. The only way that these fields come incomplete is when I do the import from GCS Storage to BigQuery.

Please I appreciate your insights on how to solve this problem. Thanks!

0 Answers0