df = pd.read_csv('./data/flights_2015.csv', low_memory=False)
print('Dataframe dimensions:', df.shape)
Dataframe dimensions: (5819079, 31)
I tried to count how many flights there for each airport
in the entire dataset using
count_flights = df['ORIGIN_AIRPORT'].value_counts()
Its output looks like this:
ATL 346836
ORD 285884
DFW 239551
DEN 196055
LAX 194673
...
13541 11
10165 9
14222 9
13502 6
11503 4
Most of counts look correct, but why did I get 13541, 10165, 14222, 13502, 11503
these numbers in the index column?
flights_2015.csv
does not even have that kind of number on 'ORIGIN_AIRPORT'
column.
What happened here?