I am using the python NLTK for a Django app. Locally I have the NLTK corpora downloaded and things work fine. For heroku, I tried putting the corpora onto the filesystem and pushing to heroku (as described here: LookupError: Resource 'corpora/stopwords' not found) but this exceeded the 1GB limit that heroku has.
Now I've added the corpora to an AWS S3 bucket, but can't figure out how to import the nltk data into the django app. How would I do this? Thanks!