Change the file format used by to_sql method

Asked Jul 07 '21 at 13:33

Active Aug 04 '21 at 07:56

Viewed 156 times

This works as expected and creates a new table. But the data is stored in a format that only spark can read. How do I store the data in csv format?

from pyathena.pandas.util import to_sql

to_sql(
    mrdf,
    "mrdf_table3",
    conn,
    "s3://" + bucket + "/tutorial/s3dir3/",
    schema="hunspell",
    index=False,
    if_exists="replace",
)

I tried flavor="csv" or flavor="textfile" but the file that is generated is still not readable.

Update: Connection string

from pyathena import connect
bucket = "hunspell"

conn = connect(
    aws_access_key_id="XXX",
    aws_secret_access_key="XXX",
    s3_staging_dir="s3://" + bucket + "/tutorial/staging/",
    region_name="us-east-1",
)

edited Aug 04 '21 at 07:56

asked Jul 07 '21 at 13:33

shantanuo

31,689
78
245
403

What is the connection string of the engine you use? – Chananel P Jul 13 '21 at 12:59
@ChananelP added the connection string. – shantanuo Jul 16 '21 at 11:45
Perhaps this conn may work `conn_str = "awsathena+rest://:@athena...` see https://pypi.org/project/pyathena/ – Chananel P Aug 08 '21 at 09:55

Change the file format used by to_sql method

0 Answers0