I'm trying to write some data from my InfluxDB database to a csv. While the influxdb-cli does allow me to export data to csv, I am not allowed to run the cli on the server myself. I'm using the influx-db python package to query the database.
Now, here's the code that I'm using:
import csv
from influxdb import InfluxDBClient
USER='root'
PASSWORD='root'
DBNAME='sensordata'
HOST='localhost'
PORT=8086
client = InfluxDBClient(HOST,PORT,USER,PASSWORD,DBNAME)
query="select * from home where time > '2017-11-15 14:55:00' AND time <= '2017-11-15 14:55:05' tz('Asia/Kolkata')"
result = client.query(query,epoch='ns')
exported_data = list(result.get_points())
with open("output.csv", "a", newline='') as fp:
writer = csv.writer(fp, dialect='excel')
for line in exported_data:
print(line)
writer.writerow(line)
The problem is that when I print the lines, I get both key and values like so:
{'time': 1510737900336919297, 'value1': 18.84, 'value10': 19.83, 'value2': 18.56, 'value3': 12.61, 'value4': 17.57, 'value5': 16.6, 'value6': 16.81, 'value7': 12.84, 'value8': 11.54, 'value9': 14.26}
{'time': 1510737901370333995, 'value1': 11.32, 'value10': 12.98, 'value2': 12.34, 'value3': 12.22, 'value4': 11.08, 'value5': 12.07, 'value6': 17.62, 'value7': 14.68, 'value8': 16.87, 'value9': 11.4}
{'time': 1510737902403461281, 'value1': 12.37, 'value10': 16.18, 'value2': 18.83, 'value3': 14.59, 'value4': 11.79, 'value5': 18.52, 'value6': 11.25, 'value7': 17.28, 'value8': 10.54, 'value9': 19.1}
{'time': 1510737903436997966, 'value1': 13, 'value10': 12.04, 'value2': 10.02, 'value3': 14.28, 'value4': 14.51, 'value5': 17.3, 'value6': 16.14, 'value7': 15.04, 'value8': 13.16, 'value9': 10.47}
{'time': 1510737904470366806, 'value1': 16.2, 'value10': 10.83, 'value2': 12.64, 'value3': 13.51, 'value4': 13.74, 'value5': 11.52, 'value6': 13.42, 'value7': 13.14, 'value8': 16.6, 'value9': 11.24}
But, the csv file contains just the keys like so:
time,value1,value10,value2,value3,value4,value5,value6,value7,value8,value9
time,value1,value10,value2,value3,value4,value5,value6,value7,value8,value9
time,value1,value10,value2,value3,value4,value5,value6,value7,value8,value9
time,value1,value10,value2,value3,value4,value5,value6,value7,value8,value9
time,value1,value10,value2,value3,value4,value5,value6,value7,value8,value9
What am I doing wrong here?
Also, is it possible for me to have the csv formatted like:
time_heading, value1_heading, value2_heading ....
time_value, value1_value, value2_value ....
time_value, value1_value, value2_value ....
time_value, value1_value, value2_value ....
.
.
.
I did try out a few solutions that I googled but none of them helped. Does anyone know what's going wrong?
Edit
I also wanted to "natural_sort" the headers in since it makes sense to have it in that format. To do that I referred to this answer on SO. But, to do that, I needed to ignore the first item which is "time" and sort the "value" headers only which I did using this
One last thing that I needed is to calculate the offset of the text since some of the measurements have "val" as the field name and some have "value". The offset is 3 and 5 respectively which I will use in the natural sorting snippet. To do that, I referred to this. I finally ended up having the data look like
time,value1,value2,value3,....value10
and not time,value1,value10,value2,value3,...value9
which is what I was looking for.
Here's what the final code looks like:
import csv
from influxdb import InfluxDBClient
USER = 'root'
PASSWORD = 'root'
DB_NAME = 'sensordata'
HOST = 'localhost'
PORT = 8086
client = InfluxDBClient(HOST, PORT, USER, PASSWORD, DB_NAME)
query = "select * from home where time > '2017-11-15 14:55:00' AND time <= '2017-11-15 14:55:05' tz('Asia/Kolkata')"
result = client.query(query, epoch='ns')
exported_data = list(result.get_points())
header_list = list(exported_data[0].keys())
with open("output.csv", "w", newline='') as fp:
writer = csv.writer(fp, dialect='excel')
# print(header_list[1:])
value_header = header_list[1]
offset = sum(c.isalpha() for c in value_header)
# print(offset)
header_list[1:] = sorted(header_list[1:], key=lambda x: int(x[offset:]))
# print(header_list)
writer.writerow(header_list)
for line in exported_data:
# print(line)
writer.writerow([line[kn] for kn in header_list])
Thanks to @be_good_do_good for the answer