5

Java XML Schema validation - ResourceResolver trouble

I am trying to build a component into a web application that will validate different XML documents against a set of schemas.

I have the validator class in java package com.example.xml and then I have a "package" for the schemas com.example.xml.Schemas where they are organized according to the namespace along with any included schemas, like so:

com.example.xml/SchemaValidator.java

com.example.xml.Schemas/2008/07/05/Message.xsd
com.example.xml.Schemas/2008/07/05/Person.xsd
com.example.xml.Schemas/2008/07/05/Address.xsd

com.example.xml.Schemas/2010/09/21/Message.xsd
com.example.xml.Schemas/2010/09/21/Organization.xsd
com.example.xml.Schemas/2010/09/21/Person.xsd
com.example.xml.Schemas/2010/09/21/Address.xsd

So most of the document-elements are called Message but in different namespaces and different internals. Via a mapping I can determine where the correct schema is located, and then I load the XSD matching the root element name - but to make the parser able to read the included schemas, I made an attempt at an org.w3c.dom.ls.LSResourceResolver implementation to fetch the correct files based on the systemId.

public class ResourceResolver implements LSResourceResolver {
    private String basePath;

    public ResourceResolver( String baseDirectory ){
        basePath = baseDirectory;
        if( !basePath.endsWith("/")){
            basePath += "/";
        }
    }

    @Override
    public LSInput resolveResource(String type, String namespaceURI, String publicId, String systemId, String baseURI) {
        System.out.println("Resolving: " + type + ", " + namespaceURI + ", " +  publicId  + ", " + systemId + ", " + baseURI + " (basepath:" + basePath + ")");
        String mypath = basePath + systemId;
        InputStream resourceAsStream = getClass().getResourceAsStream(mypath);
        if( resourceAsStream == null){ System.out.println("Ups! Stream is null"); }

        String contents = null;
        byte[] bytes = null;
        try {
            bytes = new byte[resourceAsStream.available()];
            resourceAsStream.read(bytes);
            contents = new String(bytes, Charset.forName("UTF-8"));
        } catch (IOException e) { } 
        finally{
            try { resourceAsStream.close(); } 
            catch (IOException e) { }
        }
        Input xmlInput = new Input(resourceAsStream, type, namespaceURI, publicId, systemId, basePath);    
        xmlInput.setStringData(contents);  // avoid problems with inputstream position
        return xmlInput;
    }
}

And my ResourceResolver is used this way:

public boolean validate(Document doc, String xsdPath) throws SAXException, IOException, ParserConfigurationException {
    SchemaFactory schemaFactory = SchemaFactory.newInstance(XMLConstants.W3C_XML_SCHEMA_NS_URI);
    String rootElmtName = doc.getDocumentElement().getLocalName();
    String resourcePath = "/com/example/xml/" + xsdPath ;
    InputStream topSchema = getClass().getResourceAsStream(xsdPath + "/" + rootElmtName + ".xsd");
    Source schemaSource = new StreamSource(topSchema);
    ResourceResolver resolver = new ResourceResolver(resourcePath);
    schemaFactory.setResourceResolver(resolver);
    Schema schema = schemaFactory.newSchema(schemaSource);
    Validator validator = schema.newValidator();
    validator.setResourceResolver(resolver);
    Source source = new DOMSource(doc);

    ValidationErrorHandler handler = new ValidationErrorHandler();
    validator.setErrorHandler(handler);
    validator.validate(source);
    if(handler.hasErrors()){
        for( ValidationError e : handler.getErrors()){
            log(e);
        }
        return false;
    }
    return true;
}

However, it doesn't work! It resolves a few schema resources and then stops I tried setting a breakpoint in Eclipse in my resource resolver, and the inputstream is not null - it points to the right file containing a XSD. So something in the Java XML parser classes must expect some other behavior from my resolver.

Resolving: http://www.w3.org/2001/XMLSchema, http://com/example/xml/schemas/2005/08/07/, null, Letters.xsd, file:///com/example/xml/Schemas/2005/08/07/Message.xsd (basepath:/com/example/xml/Schemas/2005/08/07/)
Resolving: http://www.w3.org/2001/XMLSchema, http://com/example/xml/schemas/2005/08/07/, null, GeneralElements.xsd, file:///com/example/xml/Schemas/2005/08/07/Message.xsd (basepath:/com/example/xml/Schemas/2005/08/07/)
Resolving: http://www.w3.org/2001/XMLSchema, http://com/example/xml/schemas/2005/08/07/, null, GeneralTypes.xsd, file:///com/example/xml/Schemas/2005/08/07/Message.xsd (basepath:/com/example/xml/Schemas/2005/08/07/)
Exception in thread "main" org.xml.sax.SAXParseException: src-resolve: Cannot resolve the name 'Letters' to a(n) 'group' component.
    at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createSAXParseException(ErrorHandlerWrapper.java:195)
    at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.error(ErrorHandlerWrapper.java:131)
    at com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportError(XMLErrorReporter.java:384)
    at com.sun.org.apache.xerces.internal.impl.xs.traversers.XSDHandler.reportSchemaErr(XSDHandler.java:2537)
    at com.sun.org.apache.xerces.internal.impl.xs.traversers.XSDHandler.reportSchemaError(XSDHandler.java:2528)
    at com.sun.org.apache.xerces.internal.impl.xs.traversers.XSDHandler.getGlobalDecl(XSDHandler.java:1472)
    at com.sun.org.apache.xerces.internal.impl.xs.traversers.XSDGroupTraverser.traverseLocal(XSDGroupTraverser.java:72)

The contents of the Message.xsd contains a bit like this:

<xs:include schemaLocation="Letters.xsd"/>
<xs:include schemaLocation="GeneralElements.xsd"/>
<xs:include schemaLocation="GeneralTypes.xsd"/>
<xs:element name="Message"> 

-and the Letters.xsd also contains

<xs:include schemaLocation="GeneralElements.xsd"/>
<xs:include schemaLocation="GeneralTypes.xsd"/>
<xs:group name="Letters">

The outline is of the ResourceResolver and Input classes are from another SO post - so I might have misunderstood something.

Is it the proper way I have created the resource resolver, or have I missed something completely?

Kenned
  • 568
  • 4
  • 11

0 Answers0