Saving results from a Python BeatifoulSoup to a file

Question

I am trying to save the results of a BeatifoulSoup iteration that extract/parse text from a Wikipedia URL to a text file. I have not been successful creating the text file and adding text while I am iterating on my loop to parse sentences. I would like to send the output of my code to a Text File. Printing to the screen works fine. Hope you can guide me here.

import requests
import string
from bs4 import BeautifulSoup

url_to_text = "https://en.wikipedia.org/wiki/Santiago"

url_open = requests.get(url_to_text)
soup = BeautifulSoup(url_open.content,'html.parser')

for i in range(1,50):
    doc_text = print((soup('p')[i].text))

Does this answer your question? [Python Save to file](https://stackoverflow.com/questions/9536714/python-save-to-file) — drum, Aug 09 '21 at 22:29

Eliaz Bobadilla · Answer 1 · 2021-08-09T22:51:37.200

0

How to write a file:

with open('text.txt', 'w') as file:
    file.write('text')

You can read this question to have more information on how to save a file in Python.

Implementation:

from requests import get
from bs4 import BeautifulSoup

soup = BeautifulSoup(
    get("https://en.wikipedia.org/wiki/Santiago").content, "html.parser"
)

# mode w = writing mode
with open(file="text.txt", mode="w",encoding="utf-8") as file:
    for line in range(1, 50):
        file.write(soup("p")[line].text)

I would like to add that it is not necessary for the file to exist prior to execution, Python will create it if it does not exist.

edited Aug 09 '21 at 22:51

answered Aug 09 '21 at 22:39

Eliaz Bobadilla

479
4
16

Thank You Elias. It works. It does what I was expecting. Now I have my text on the text file with the correct encoding. – raulfloresp Aug 09 '21 at 23:31
Then you should accept the answer, to make it easier for other people to find it. – Eliaz Bobadilla Aug 10 '21 at 14:09

Prakhar Srivastava · Answer 2 · 2021-08-09T22:45:31.370

0

Please try this,

with open(file="my_text.txt", mode="w", encoding="UTF-8") as dest_file:
  for i in range(1, 50):
    dest_file.write(soup('p')[i].text)

The problem is mainly due to encoding. By default Python uses UNICODE. Switching to UTF-8 would do the trick. Please feel free to reach out if issue still persists.

Thanks.

edited Aug 09 '21 at 22:45

answered Aug 09 '21 at 22:40

Prakhar Srivastava

91
3

Prakhar Srivastava, it works. It does what I was expecting. I appreciate your support. Thank You. Raúl – raulfloresp Aug 09 '21 at 23:23
Awesome! Glad to be of help :) – Prakhar Srivastava Aug 09 '21 at 23:25

Saving results from a Python BeatifoulSoup to a file

2 Answers2