How to parse from string?

Question

I have string with tags "Key", I need get text inside tags.

string = "<Key>big_img/1/V071-e.jpg</Key>"

Need "big_img/1/V071-e.jpg"?

possible duplicate of [RegEx match open tags except XHTML self-contained tags](http://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags) — Uyghur Lives Matter, May 07 '15 at 17:22

score 2 · Accepted Answer · answered May 07 '15 at 16:47

2

Using regular expressions:

import re

s = "<Key>big_img/1/V071-e.jpg</Key>"

re.findall(r"<Key>(.*)</Key>",s)
['big_img/1/V071-e.jpg']

answered May 07 '15 at 16:47

ODiogoSilva

2,394
1
19
20

score 0 · Answer 2 · edited May 07 '15 at 18:13

0

The most simple solution:

string.trim()[5:-6]

This will work for any length string provided it starts with <Key> and ends with </Key>.

It works because:

trim() removes any extraneous whitespace characters
<Key> will always be in the first 5 chars of the string, so start 1 char after (remember sequence/string indexes are 0-based, so starting at 5 is really starting at the 6th char)
the beginning of </Key> will always be 6 chars from the end of the string, so stop before that point

edited May 07 '15 at 18:13

Zach Young

10,137
4
32
53

answered May 07 '15 at 16:44

Klaus D.

13,874
5
41
48

I have big string with many ``. I need do it automatically – cmashinho May 07 '15 at 16:46

score 0 · Answer 3 · answered May 07 '15 at 17:36

Use Python's xml.etree.ElementTree module to parse your XML string. If your file looks something like:

<root>
    <Key>big_img/1/V071-e.jpg</Key>
    <Key>big_img/1/V072-e.jpg</Key>
    <Key>big_img/1/V073-e.jpg</Key>
    <Key>...</Key>
</root>

First, parse your data:

from xml.etree import ElementTree

# To parse the data from a string.
doc = ElementTree.fromstring(data_string)

# Or, to parse the data from a file.
doc = ElementTree.parse('data.xml')

Then, read and print out the text from each <Key>:

for key_element in doc.findall('Key'):
    print(key_element.text)

Should output:

big_img/1/V071-e.jpg
big_img/1/V072-e.jpg
big_img/1/V073-e.jpg

How to parse from string?

3 Answers3