Questions tagged [html-xml-utils]
11 questions
4
votes
1 answer
Parse local HTML python (lxml)
I'm trying to parse a local HTML with lxml, but I'm getting an error, but I don't know why (sorry in advance for the bad code, I'm new to this).
from lxml import etree, html
from StringIO import StringIO
parser = etree.HTMLParser()
doc =…

Lara M.
- 855
- 2
- 10
- 23
2
votes
1 answer
xml2csv package error '_IterParseIterator' object has no attribute 'next'
I am using xmlutils package to convert xml file to csv. My code is following :
from xmlutils.xml2csv import xml2csv as x
input_path='/media/ishan/Local Disk/doc.xml'
output_path='media/ishan/Local…

Ishan
- 996
- 3
- 13
- 34
1
vote
2 answers
How can I use hxselect to generate array-ish result?
I'm using hxselect to process a HTML file in bash.
In this file there are multiple divs defined with the '.row' class.
In bash I want to extract these 'rows' into an array. (The divs are multilined so simply reading it line-by-line is not…

Daniel
- 2,318
- 2
- 22
- 53
1
vote
1 answer
Middle Selectors Ignored in hxselect
I'm attempting to extract some text from a webpage using hxselect from html-xml-utils 7.4. According to the man, hxselect will accept a comma delimited list of CSS selectors. I have three selectors:
/usr/local/bin/hxnormalize -x -i 0 -l 5000…

A Bowler Cat
- 305
- 1
- 2
- 9
1
vote
1 answer
Groovy: keeping of CR/LF of attributes while serializing XMLSlurper
I'm new with groovy (a few weeks of experience). Currently I'm trying to process some visual studio .vcproj files using groovy: replacing some paths, that will be found by a regexp patterns. This works fine for me.
To write the changes to the file,…

Alfred Meier
- 47
- 4
0
votes
1 answer
Vbscript/UFT how to get the Tag elements name to validate Xpath of tag elements
Attached XML sample, In the attached XML I want to verify the tag Elements presence
for ex: PayloadList/IFXResp/IFX/GeneralStatus/StatusCode
it would be great if anyone can help me to get the above xpath code. Looking for code in vbscript/UFT to…

Anil
- 3
- 2
0
votes
1 answer
Parsing XML in Python
I have a large XML file and I need to format it to get some needed data from particular elements in it and print out only data needed into another file. In the XML file I have a number of text tags belonging to different conversations with id's and…

T. A
- 1
- 3
0
votes
1 answer
html-xml-utils Get a specific value
I am using Xubuntu 16.04.2 with html-xml-utils
I tried this command
hxnormalize -x site/sciz.html | hxselect 'table.vitals-table:first-child'
but it doesn't work, it outputs nothing. There are about 3-5 of these tables in the html.
Doing this…
knuxyl
- 129
- 3
- 12
0
votes
2 answers
make[1]: exec(f77) failed (No such file or directory) on DragonFly BSD
I get this error when trying to compile html-xml-utils-6.9 on DragonFly BSD. Funny thing is, when doing grep -r f77 in the source directory, the result is empty. So no such option is in the constructed Makefile after ./configure.
This is my…
Olle Härstedt
- 3,799
- 1
- 24
- 57
0
votes
1 answer
Extract HTML table content based on "thead"
Here is a basic HTML table :
bar
rows
…
Suppose there are several such tables in the source file. Is there an option of hxextract, or a CSS3…
Skippy le Grand Gourou
- 6,976
- 4
- 60
- 76
-1
votes
1 answer
Replace anything inside a HTML tag ID
Have searched all over Stackoverflow and Google, and the closest answer:
sed -i -e 's|\([0-9]\{0,\}\) |'"$(date -d @${contents})"' |g' /var/www/html/index.html
works only when the…
dani 'SO learn value newbies'
- 151
- 1
- 15
I am using Xubuntu 16.04.2 with html-xml-utils
I tried this command
hxnormalize -x site/sciz.html | hxselect 'table.vitals-table:first-child'
but it doesn't work, it outputs nothing. There are about 3-5 of these tables in the html.
Doing this…

knuxyl
- 129
- 3
- 12
0
votes
2 answers
make[1]: exec(f77) failed (No such file or directory) on DragonFly BSD
I get this error when trying to compile html-xml-utils-6.9 on DragonFly BSD. Funny thing is, when doing grep -r f77 in the source directory, the result is empty. So no such option is in the constructed Makefile after ./configure.
This is my…

Olle Härstedt
- 3,799
- 1
- 24
- 57
0
votes
1 answer
Extract HTML table content based on "thead"
Here is a basic HTML table :
Suppose there are several such tables in the source file. Is there an option of hxextract, or a CSS3…
bar | rows | …

Skippy le Grand Gourou
- 6,976
- 4
- 60
- 76
-1
votes
1 answer
Replace anything inside a HTML tag ID
Have searched all over Stackoverflow and Google, and the closest answer:
sed -i -e 's|\([0-9]\{0,\}\) |'"$(date -d @${contents})"' |g' /var/www/html/index.html
works only when the…

dani 'SO learn value newbies'
- 151
- 1
- 15