With aws athena services, I try to import csv file including new line data
Importing data uses hive serde format.
If data is like this, (each data is enclosed in double quotes. "")
"DataA"|"DataB"|"DataC"
"Data1"|"Data2
with new line"|"Data3"
"Data가"|"Data2나"|"Data나"
then how to write regular expressions to below table DDL?
CREATE EXTERNAL TABLE ssdm_schema.ABCTable_regex (
Data_A VARCHAR(100)
, Data_B VARCHAR(100)
, Data_C VARCHAR(100)
) ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
"input.regex" = '?????????'
)
I'm asking to this question referring to the following answer.
How to handle embed line breaks in AWS Athena
Thank you