Hi all. 

I have some experience with XML in general but I am new to Xerces, so please 
provide some hints. 

I wish to schema-validate an in-memory xml-string, coming in from a CORBA 
client, against an in-memory schema. But the parser (DOMBuilder) does not seem 
to associate the xml-data with the provided schema. I provide both from within 
MemBufInputSource and I am unaware of the real functionalty of the so-called 
'fake system id'(argument 3: bufId) and if it has something to do with the 
failure. 

I even considered to write the schema temporarily to a xsd-file, build a DOM 
from the incoming xml-string, adding a schemaLocation attribute to the still 
unvalidated DOM, serializing this DOM into a new string and the reparsing and 
validating against the xsd-file but that seems error-prone, cumbersome and does 
not work yet, too.

I provided some test results I encountered when dealing with files and the 
interesting part of the code below. 

I would appreciate your help. 
 Thanks in advance. 
   Frank Gierschner 


// small excerpt on dealing with files now 
In order to get track of the problem I modified the SCMPrint example around 
line 335 to be: 
<code> 
   parser->loadGrammar(xsdFile, Grammar::SchemaGrammarType, true); 
   if (handler.getSawErrors()) 
   { 
      handler.resetErrors(); 
   } 
   else 
   { 
      parsedOneSchemaOkay = true; 
   } 

   parser->parse(xmlFile); // NEW 
</code> 
where xmlFile denotes a file which is a XML-Document having a schemaLocation 
attribute directing to xsdFile. If in this case xmlFile contains an invalid 
document the previously applied ErrorHandler will see fatal errors - that is 
correct. But when I remove the schemaLocation attribute from xmlFile the same 
procedure results in errors like 'unknown element ...' which reveal that 
xml-data and schema seem to be unrelated. I guess the latter is the same 
problem occuring with the above stated in-memory procedure. 
// end of excerpt 


The interesting part of the code I use for the in-memory variant looks somewhat 
like the following: 

<code> 
// the in memory schema 
static const char gMySchema[] = 
"<?xml version=\"1.0\" encoding=\"utf-8\" ?> \ 
<xs:schema xmlns:xs=\"http://www.w3.org/2001/XMLSchema\"; 
targetNamespace=\"urn:rtc:tgam:MNCfgTmpl\" \ 
elementFormDefault=\"qualified\" attributeFormDefault=\"unqualified\"> \ 
<xs:element name=\"root\"> \ 
<xs:complexType> \ 
<xs:sequence> \ 
<xs:element name=\"FirstElement\" type=\"xs:string\" /> \ 
<xs:element name=\"SecondElement\" minOccurs=\"0\" type=\"xs:string\" /> \ 
<xs:element name=\"ShirdElement\" type=\"xs:string\" /> \ 
</xs:sequence> \ 
</xs:complexType> \ 
</xs:element> \ 
</xs:schema> \ 
"; 

... -- omitted other stuff -- 

static DOMBuilder * getDOMBuilder(MyDOMErrorHandler *& myErrorHandler, const 
char pSchema2use[], Grammar *& grammar) 
{ 
   DOMImplementation *impl = 
DOMImplementationRegistry::getDOMImplementation(L"LS"); 
   DOMBuilder* theBuilder = 
((DOMImplementationLS*)impl)->createDOMBuilder(DOMImplementationLS::MODE_SYNCHRONOUS,
 L"http://www.w3.org/2001/XMLSchema";); 

   if (theBuilder->canSetFeature(XMLUni::fgDOMNamespaces, true)) 
      theBuilder->setFeature(XMLUni::fgDOMNamespaces, true); 

   if (theBuilder->canSetFeature(XMLUni::fgDOMValidation, true)) 
      theBuilder->setFeature(XMLUni::fgDOMValidation, true); 

   if (theBuilder->canSetFeature(XMLUni::fgDOMDatatypeNormalization, true)) 
      theBuilder->setFeature(XMLUni::fgDOMDatatypeNormalization, true); 

   if (theBuilder->canSetFeature(XMLUni::fgXercesSchemaFullChecking, true)) 
      theBuilder->setFeature(XMLUni::fgXercesSchemaFullChecking, true); 

   myErrorHandler = new MyDOMErrorHandler(); 
   theBuilder->setErrorHandler(myErrorHandler); 

   if (pSchema2use) 
   { 
      grammar = NULL; 
      MemBufInputSource *myXSDInputSource = new MemBufInputSource((const 
XMLByte *const)gMySchema, (unsigned int) strlen(gMySchema), 
L"http://www.w3.org/2001/XMLSchema";, false); // unaware about functionality of 
argument 3 
      Wrapper4InputSource wrapper(myXSDInputSource); // will adopt 
myXSDInputSource 
      grammar = theBuilder->loadGrammar(wrapper, Grammar::SchemaGrammarType); 
   } 

   return theBuilder; 
} 

... -- omitted other stuff -- 

static int buildDOMDocument(const XMLByte* const buf, const unsigned int 
uiSize, DOMDocument* &doc) 
{ 
   DOMBuilder* theBuilder = NULL; 
   MyDOMErrorHandler* myErrorHandler = NULL; 
   Grammar *grammar = NULL; 
   theBuilder = getDOMBuilder(myErrorHandler, gMySchema, grammar); // see above 


   MemBufInputSource *myInputSource = new MemBufInputSource(buf, uiSize, 
L"TEST", false); // unaware about functionality of argument 3

   Wrapper4InputSource wrapper(myInputSource); // will adopt myInputSource 

   int retVal = -1; 
   try 
   { 
      DOMDocument* newDoc = theBuilder->parse(wrapper); 
      retVal = (myErrorHandler->sawFatals() ? -9 : (myErrorHandler->sawErrors() 
? -5 : 0)); 

      if (newDoc) 
         doc = (DOMDocument*) newDoc->cloneNode(true); 
   } 
   catch (const XMLException& toCatch) 
   { 
      cout << "Exception message is: \n" << X2c(toCatch.getMessage()) << "\n"; 
   } 
   catch (const DOMException& toCatch) 
   { 
      cout << "Exception message is: \n" << X2c(toCatch.msg) << "\n"; 
   } 
   catch (...) 
   { 
      cout << "Unexpected Exception \n" ; 
   } 

   theBuilder->release(); 

   delete myErrorHandler; 

   return retVal; 
} 

// the in-memory xml created previously via ... 
        ... 
      DOMImplementation* impl = 
DOMImplementationRegistry::getDOMImplementation(L"Range"); 
      DOMDocumentType* doctyp= 0; // impl->createDocumentType(L"root", NULL, 
L"TEST"); 
      DOMDocument* doc = impl->createDocument(L"urn:rtc:tgam:MNCfgTmpl", 
L"root", doctyp); 
      doc->setEncoding(L"UTF-8"); 
      // doc->setDocumentURI(L"http://www.w3.org/2001/XMLSchema-instance";);// 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"; ??? 
      DOMElement* root = doc->getDocumentElement(); 
        ... 
// ... and generated via serialization by DOMWriter::writeNode() from this 
DOMDocument (from an example) looks like this 

<?xml version="1.0" encoding="UTF-8" standalone="no" ?> 
<root xmlns="urn:rtc:tgam:MNCfgTmpl"> 
  <FirstElement>aTextNode</FirstElement> 
  <ThirdElement>aTextNode2</ThirdElement> 
</root> 

</code>

Reply via email to