Questions tagged [project-gutenberg]
12 questions
4
votes
2 answers
How to apply stopwords accurately in French using R
I'm trying to pull a book using the Gutenberg library and then remove French stopwords. I've been able to do this accurately in English by doing this:
twistEN <- gutenberg_download(730)
twistEN <- twistEN[118:nrow(twistEN),]
twistEN <- twistEN %>%
…

Litmon
- 247
- 3
- 18
3
votes
2 answers
Regex doesn't capture numbers written out as words
I'm looking at Oliver Twist in both English and French. I found this site (https://cran.r-project.org/web/packages/tidytext/vignettes/tidytext.html) that provides code to apply the chapter number per row of text. When I apply it to the English text,…

Litmon
- 247
- 3
- 18
2
votes
0 answers
Attempting to use Project Gutenberg in Python 3
I was using the Project Gutenberg API (I'm new, forgive me if this isn't actually the right term) with Python 2 just fine. But I needed to start using Python 3 for other reasons.
I'm having a lot of difficulties getting Gutenberg to work with Python…

Will
- 351
- 4
- 15
1
vote
3 answers
Simple error : Wordpress Gutenberg Block not outputting php variable in javascript console. Any help appreciated
I'm creating a gutenberg dynamic block. The problem is my I can't access the attributes value outside of render.php , even in my main plugin random-jokes.php file.
My console is just outputting this
not the concatenated value that I want it to give…

rawaha johns
- 33
- 4
1
vote
0 answers
Connection refused when trying to download all books from project gutenberg (wget)
I am trying to follow Project Gutenberg's guide to downloading all their english books in txt format but I am running into issues.
When I run the following command
wget -w 2 -m -H…

Luca Guarro
- 1,085
- 1
- 11
- 25
1
vote
1 answer
Project Gutenberg accessing text with url
I'm trying to access a text file from project gutenberg's url.
Hence I've copyed the same code from nltk book's, the result was different.
from urllib import request
url = "http://www.gutenberg.org/files/2554/2554-0.txt"
response =…

Lee
- 11
- 2
0
votes
1 answer
Is there a way to bulk-download all texts from a certain Project Gutenberg bookshelf?
I need to download all .txt files from this bookshelf.
Is it possible via wget? I can't seem to find any parametres to pass regarding the bookshelf categorizing.

really_anxious
- 25
- 5
0
votes
0 answers
Guternberg module not showhing on drupal 9
I'm using drupal 9 and I want to use gutenberg a page builder, to make it easy for my co-worker who don't know anything about code.
The thing is that I completed the installation but the edito isn't showing. I don't understand why. I try all the…

sarah
- 1
- 1
0
votes
0 answers
Is there a way to infer the author and or title of a work based on a project Gutenburg file name?
Sometimes project Gutenberg includes the author or book name in a machine readable way in the raw text files but many times it doesn't. I have a collection of project Gutenberg raw text files that I would like to use and quote from using software…

Ohiovr
- 977
- 1
- 12
- 22
0
votes
0 answers
Trying to get book summaries from Wikipedia of Project Gutenberg books
I have the complete Project Gutenberg English library as alphabetized csv files with the columns - id, title, text. Here id is in the format /ebooks/15809. Then I am using the Wikipedia python package. I can get the full text of pages and a lot of…

daddyodevil
- 184
- 2
- 13
0
votes
1 answer
Basics of connecting python to the web and validating user input
I'm relatively new, and I'm just at a loss as to where to start. I don't expect detailed step-by-step responses (though, of course, those are more than welcome), but any nudges in the right direction would be greatly appreciated.
I want to use the…

Will
- 351
- 4
- 15
-3
votes
1 answer
scrape book body text from project gutenberg de
I am new to python and I am looking for a way to extract with beautiful soup existing open source books that are available on gutenberg-de, such as this one
I need to use them for further analysis and text mining.
I tried this code, found in a…

Grig
- 109
- 1
- 10