I have a data which can be downloaded from here http://mips.helmholtz-muenchen.de/proj/ppi/ At the end of the page , it is written "You can get the full dataset"
Then I tried to use xml
package
library(XML)
doc <- xmlTreeParse("path to/allppis.xml", useInternal = TRUE)
root <- xmlRoot(doc)
but it seems empty
what do I want ?
if I open allppi.xml downloaded from that website,
I want to parse specific lines into a txt file, it starts with <fullName>
and end with </fullName>
for example if I open that file , I can see this
<fullName>S100A8;CAGA;MRP8; calgranulin A (migration inhibitory factor-related protein 8)</fullName>
Then I want to have this
Proteins description
S100A8;CAGA;MRP8 calgranulin A (migration inhibitory factor-related protein 8)