Questions tagged [project-gutenberg]

12 questions
4
votes
2 answers

How to apply stopwords accurately in French using R

I'm trying to pull a book using the Gutenberg library and then remove French stopwords. I've been able to do this accurately in English by doing this: twistEN <- gutenberg_download(730) twistEN <- twistEN[118:nrow(twistEN),] twistEN <- twistEN %>% …
Litmon
  • 247
  • 3
  • 18
3
votes
2 answers

Regex doesn't capture numbers written out as words

I'm looking at Oliver Twist in both English and French. I found this site (https://cran.r-project.org/web/packages/tidytext/vignettes/tidytext.html) that provides code to apply the chapter number per row of text. When I apply it to the English text,…
Litmon
  • 247
  • 3
  • 18
2
votes
0 answers

Attempting to use Project Gutenberg in Python 3

I was using the Project Gutenberg API (I'm new, forgive me if this isn't actually the right term) with Python 2 just fine. But I needed to start using Python 3 for other reasons. I'm having a lot of difficulties getting Gutenberg to work with Python…
Will
  • 351
  • 4
  • 15
1
vote
3 answers

Simple error : Wordpress Gutenberg Block not outputting php variable in javascript console. Any help appreciated

I'm creating a gutenberg dynamic block. The problem is my I can't access the attributes value outside of render.php , even in my main plugin random-jokes.php file. My console is just outputting this not the concatenated value that I want it to give…
1
vote
0 answers

Connection refused when trying to download all books from project gutenberg (wget)

I am trying to follow Project Gutenberg's guide to downloading all their english books in txt format but I am running into issues. When I run the following command wget -w 2 -m -H…
Luca Guarro
  • 1,085
  • 1
  • 11
  • 25
1
vote
1 answer

Project Gutenberg accessing text with url

I'm trying to access a text file from project gutenberg's url. Hence I've copyed the same code from nltk book's, the result was different. from urllib import request url = "http://www.gutenberg.org/files/2554/2554-0.txt" response =…
Lee
  • 11
  • 2
0
votes
1 answer

Is there a way to bulk-download all texts from a certain Project Gutenberg bookshelf?

I need to download all .txt files from this bookshelf. Is it possible via wget? I can't seem to find any parametres to pass regarding the bookshelf categorizing.
0
votes
0 answers

Guternberg module not showhing on drupal 9

I'm using drupal 9 and I want to use gutenberg a page builder, to make it easy for my co-worker who don't know anything about code. The thing is that I completed the installation but the edito isn't showing. I don't understand why. I try all the…
sarah
  • 1
  • 1
0
votes
0 answers

Is there a way to infer the author and or title of a work based on a project Gutenburg file name?

Sometimes project Gutenberg includes the author or book name in a machine readable way in the raw text files but many times it doesn't. I have a collection of project Gutenberg raw text files that I would like to use and quote from using software…
Ohiovr
  • 977
  • 1
  • 12
  • 22
0
votes
0 answers

Trying to get book summaries from Wikipedia of Project Gutenberg books

I have the complete Project Gutenberg English library as alphabetized csv files with the columns - id, title, text. Here id is in the format /ebooks/15809. Then I am using the Wikipedia python package. I can get the full text of pages and a lot of…
daddyodevil
  • 184
  • 2
  • 13
0
votes
1 answer

Basics of connecting python to the web and validating user input

I'm relatively new, and I'm just at a loss as to where to start. I don't expect detailed step-by-step responses (though, of course, those are more than welcome), but any nudges in the right direction would be greatly appreciated. I want to use the…
Will
  • 351
  • 4
  • 15
-3
votes
1 answer

scrape book body text from project gutenberg de

I am new to python and I am looking for a way to extract with beautiful soup existing open source books that are available on gutenberg-de, such as this one I need to use them for further analysis and text mining. I tried this code, found in a…
Grig
  • 109
  • 1
  • 10