I have my code here:
df = pd.read_parquet(r"C:\path\to\parquet.parquet", engine='fastparquet')
print(df)
And my source data looks like this:
date
----
2022-02-10
2022-05-03
2164-09-09
My target data in the parquet file looks like this:
date
----
2022-02-10 00:00:00.00000000
2022-05-03 00:00:00.00000000
2164-09-09 00:52:00.03019401
I am migrating data from SQL Server to AWS S3 via DMS and it is being stored as parquet files. Why is Pandas automatically converting the date
to datetime64[ns]
? I know there is no date
datatype in Python, it automatically converts to datetime64[ns]
in Python. But why isn't the YYYY-MM-DD format being retained? For some columns it is, and for others it shows the time with the date? I'm confused. Is there any documentation on this?