I'm trying to convert a 30GB XML file into an XSD file but I run into trouble because I can't even parse the XML file. Normally I'd do this using an online converter but the file is too huge for any of these converters. I've tried doing this in both Python and R but the error seems to point to the same problem at the same line. Here is the R-code:
library(XML)
file <- 'xmlfile.xml'
data <- xmlParse(file)
But I get the following errors:
Error: 1: input conversion failed due to input error, bytes 0x81 0xC5 0x70 0x6E
2: input conversion failed due to input error, bytes 0x81 0xC5 0x70 0x6E
3: encoder error4: Premature end of data in tag source line 12088028
5: Premature end of data in tag html line 12088027
6: Premature end of data in tag content line 12088025
7: Premature end of data in tag delivery line 12087957
8: Premature end of data in tag collection-delivery line 2
Is there a way to ignore lines like these? Or maybe there are other ways to convert from XML to XSD?
It's perfectly fine to give answers in both Python or R.