I created a table in Hive and the loaded the data from external csv file. When I try to print the data from python, I get output like "['\x00"\x00m\x00e\x00s\x00s\x00a\x00g\x00e\x00"\x00']". When I query Hive GUI, the results are proper. Please tell me how can I get the same results via python program.
My python code :
import pyhs2
with pyhs2.connect(host='192.168.56.101',
port=10000,
authMechanism='PLAIN',
user='hiveuser',
password='password',
database='anuvrat') as conn:
with conn.cursor() as cur:
cur.execute('SELECT message FROM ABC_NEWS LIMIT 5')
print cur.fetchone()
Output is :
/usr/bin/python2.7 /home/anuvrattiku/SPRING_2017/CMPE239/Facebook_Fake_news_detection/code_fake_news/code.py
['\x00"\x00m\x00e\x00s\x00s\x00a\x00g\x00e\x00"\x00']
Process finished with exit code 0
When I query the same table in Hive, I get the following output :
This is how I created the table :
CREATE TABLE ABC_NEWS(
ID STRING,
PAGE_ID INT,
NAME STRING,
MESSAGE STRING,
DESCRIPTION STRING,
CAPTION STRING,
POST_TYPE STRING,
STATUS_TYPE STRING,
LIKES_COUNT SMALLINT,
COMMENTS SMALLINT,
SHARES_COUNT SMALLINT,
LOVE_COUNT SMALLINT,
WOW_COUNT SMALLINT,
HAHA_COUNT SMALLINT,
SAD_COUNT SMALLINT,
THANKFUL_COUNT SMALLINT,
ANGRY_COUNT SMALLINT,
LINK STRING,
IMAGE_LINK STRING,
POSTED_AT STRING
)
ROW FORMAT DELIMITED FIELDS TERMINATED BY "," ESCAPED BY '\\';
The csv file for loading the table is in the path below : https://www.dropbox.com/s/fiwygyqt8u9eo5s/abc-news-86680728811.csv?dl=0