Impyla is a Python client for HiveServer2 implementations (e.g., Impala, Hive) for distributed query engines.
Impyla is a Python client for HiveServer2 implementations (e.g., Impala, Hive) for distributed query engines.
Features:
HiveServer2 compliant; works with Impala and Hive, including nested data
Fully DB API 2.0 (PEP 249)-compliant Python client (similar to sqlite or MySQL clients) supporting Python 2.6+ and Python 3.3+.
Works with Kerberos, LDAP, SSL
SQLAlchemy connector
Converter to pandas DataFrame, allowing easy integration into the Python data stack (including scikit-learn and matplotlib); but see the Ibis project for a richer experience
References:
Related tags: