Print only links that end of .jpg extension?

Question

I have multiple links that ends with .png and .jpg extension.

I want to print and save in txt file only links that ends with .jpg extension.

I tried this code but only save the first result:

for item in soup.find_all('img'):
    hotel_image = (item['src'])
    print(hotel_image)
    file1 = open("myfile.txt", "w")
    file1.writelines(hotel_image)
    file1.close()  # to change file access modes

Ex of links:

https://cf.bstatic.com/images/hotel/max300/288.jpg

https://cf.bstatic.com/static/img/flags/12/eg.png

https://cf.bstatic.com/images/hotel/max300.jpg

https://cf.bstatic.com/static/img/review.png

what i want:

https://cf.bstatic.com/images/hotel/max300.jpg

https://cf.bstatic.com/images/hotel/max300/288.jpg

Any kind of help please?

`endswith('jpg')` [endswith](https://docs.python.org/3/library/stdtypes.html#str.endswith) — Brown Bear, May 18 '21 at 10:57
Also open the file once outside loop and close when loop is done. — kuro, May 18 '21 at 10:58

score 1 · Accepted Answer · answered May 18 '21 at 11:01

Make sure hotel_image is string otherwise convert it to string and use endswith function.

try this:

with open("myfile.txt", "w") as fp:
    for item in soup.find_all('img'):
        hotel_image = (item['src'])
        if hotel_image.endswith('.jpg'):
            fp.writelines(hotel_image)

score 0 · Answer 2 · answered May 18 '21 at 11:06

0

Try using a RegEx inside the "find_all" method, as in:

soup.find_all('img', alt="" ,src=re.compile(".jpg"))

answered May 18 '21 at 11:06

Nati Ghatan

1

Print only links that end of .jpg extension?

2 Answers2