I am building a Python package. This package consists of some scripts and several datafiles (~500 Mb) stored as small csv files. I use setuptools and I track the development of the package on Gitlab.
From time to time, I need update the csv files. Crucially, I always replace all of them at the same time. The problem is that every time that I do so, the size of the Gitlab repo and the python package increases insanely, because git keeps version control of the files.
I was wondering if you have some suggestion on the best practices in such a case and how to keep the package reasonably small in particular. Is git lfs the best option?