xtick formatting with matplotlib - weekly data shown annually

Question

I am hoping there is someway to handle this as there are numerous questions out there about xtick formatting in pyplot but I have not found anything approaching this question.

I have the following code:

fig, ax=plt.subplots(1,1,figsize=[10, 5]) # Set dimensions for figure
plt.plot(organic_search.groupby('week-year').Amount.sum(), color='g')
plt.title('Organic Search Revenue Time Series')
fmt = '${x:,.0f}'
tick = mtick.StrMethodFormatter(fmt)
ax.yaxis.set_major_formatter(tick)
plt.ylabel('Revenue')
plt.xlabel('Week')
plt.grid(True)
plt.show()

It works all well and good but the output is a bit messy because this is weekly data.

Some sample data:

Week | Week_Start_Date |  Amount |  year |     week-year |
Week 1      2018-01-01   42920     2018     Week 1 2018
Week 2      2018-01-08   37772     2018     Week 2 2018
Week 3      2018-01-15   41076     2018     Week 3 2018
Week 4      2018-01-22   38431     2018     Week 4 2018
Week 5      2018-01-29  101676     2018     Week 5 2018

output:

The xtick labels are unreadable as is and I was wondering if someone know how to have the same weekly graph but the xticks represent just the year. I have tried a few different ways but either (1) it shrinks the graph to the far left of the screen or (2) I get an error saying in effect "tick labels (4) do not match data points(186)).

I just want a neater display - not critical to analysis but help is appreciated

Trenton McKinney · Accepted Answer · 2021-08-21T19:37:05.823

The issue is the x-axis, 'week-year' is string type.
- See What is plotted when string data is passed to the matplotlib API? for a thorough explanation of what happens when passing strings to the API.
The resolution is to pass axis values that are a numeric type (int, float, datetime) which the API handles better.
- matplotlib: Fixing common date annoyances
In this case, once the x-axis is properly formatted as a datetime, other matplotlib formatting methods can be applied.
See pandas.DataFrame.plot and pandas.DataFrame.resample
Tested in python 3.8.11, pandas 1.3.1, and matplotlib 3.4.2.

import pandas as pd
import matplotlib.pyplot as plt
import pandas_datareader as web  # for test data; not part of pandas

# load 1 year of sample data
df = web.DataReader('amzn', data_source='yahoo', start='2020-01-01', end='2021-01-01').reset_index()

# display(df.head())
        Date     High      Low    Open    Close   Volume  Adj Close
0 2020-01-02  1898.01  1864.15  1875.0  1898.01  4029000    1898.01
1 2020-01-03  1886.20  1864.50  1864.5  1874.97  3764400    1874.97

# if the 'Date' column or a date index is not a datetime dtype then convert it
# df.index = pd.to_datetime(df.index)
df.Date = pd.to_datetime(df.Date)

# set the Date column as the index, if it isn't already there
df = df.set_index('Date')

# resample to a Weekly beginning on Monday; .sum() can be used, .mean() is correct for this sample data
dfr = df.resample('W-MON').mean()

# display(dfr.head())
               High      Low     Open    Close      Volume  Adj Close
Date                                                                 
2020-01-06  1895.97  1862.88  1866.50  1891.95  3951733.33    1891.95
2020-01-13  1909.53  1887.02  1901.82  1894.87  3270940.00    1894.87

# plot; plot a single column with y='High'
ax = dfr.plot(y=['High', 'Low'], figsize=(10, 5), grid=True, title='Weekly Resampled Mean High / Low Price', ylabel='Price')
ax.yaxis.set_major_formatter('${x:,.0f}')
plt.show()

That did the trick. I actually had a date in there already formatted as datetime. I passed that in lieu of the string and that did the trick. Thank you!! — Antonio, Aug 21 '21 at 23:03

xtick formatting with matplotlib - weekly data shown annually

1 Answers1