1

How can I calculate the weighted coefficient of variation (CV) over a NumPy array in Python? It's okay to use any popular third-party Python package for this purpose.

I can calculate the CV using scipy.stats.variation, but it's not weighted.

import numpy as np
from scipy.stats import variation

arr = np.arange(-5, 5)
weights = np.arange(9, -1, -1)  # Same size as arr
cv = abs(variation(arr))  # Isn't weighted
bad_coder
  • 11,289
  • 20
  • 44
  • 72
Asclepius
  • 57,944
  • 17
  • 167
  • 143

1 Answers1

2

This can be done using the statsmodels.stats.weightstats.DescrStatsW class in the statsmodels package for calculating weighted statistics.

from statsmodels.stats.weightstats import DescrStatsW

arr = np.arange(-5, 5)
weights = np.arange(9, -1, -1)  # Same size as arr

dsw = DescrStatsW(arr, weights)
cv = dsw.std / abs(dsw.mean)  # weighted std / abs of weighted mean

print(cv)
1.6583123951777001

For a related statistic, i.e. the weighted gini, see this answer.


Credit: This answer is motivated by one on calculating the weighted standard deviation.

Asclepius
  • 57,944
  • 17
  • 167
  • 143