I'm an avid reader and someone interested in coding. As all you readers know that searching for the next book to read is a rather ritual and process of its own. I want to do a small little thing towards the same.
What I want to do is crawl over all the pages of Goodreads and extract those books that satisfy the following criteria.
- Have more than 20,000 reviews
- Has more than 4 star rating
- Its 1 star and 2 star ratings should be less than 2% each
- Its 3 star rating should be less than 20%
I'm decent with python and know a little bit of Beautiful Soup. Equipped with these tools can someone please guide me how to proceed with my quest?
Thank you!