How it is possible to extract textual content of an XML document preferably using XSLT.
For such fragment,
<record>
<tag1>textual content</tag1>
<tag2>textual content</tag2>
<tag2>textual content</tag2>
</record>
the desired result is :
textual content, textual content, textual content
What's the best format for output (table, CSV, etc,) in which the content be processable for further operation, such as text mining?
Thanks
Update
To extend the question, how it’s possible to extract content of each record separately. For example, for the below XML:
<Records>
<record id="1">
<tag1>textual co</tag1>
<tag2>textual con</tag2>
<tag2>textual cont</tag2>
</record>
<record id="2">
<tag1>some text</tag1>
<tag2>some tex</tag2>
<tag2>some te</tag2>
</record>
</Records>
The desired result should be such as:
(textual co, textual con, textual cont) , (some text, some tex, some te)
or in better format for further processing operations.