I just looked at this again and I think the problem is that the message
is referring to the garbage string of characters "2a" where a
looks like a decimal numeric character reference but the letter 'a' is a
hex digit. I'll have to go back to my OCR cleanup routine ... Thanks
for reading.
Ph
Hello all,
I'm indexing a body of OCR and encountered this exception. Apparently
it's some kind of XML parser error. Out of thousands of documents,
which I create with significant processing to make sure they are XML
compliant, only this one appears to have a problem. But can anyone tell
me