Hi all,

I've got some fields that will contain embedded XML. Two questions relating to that:

1. It appears as though I'll need to XML-escape the field data, as otherwise Solr complains about find a start tag (one of the embedded tags) before it finds the end tag for a field.

Is this an expected constraint?

And is XML-escaping the data the best way to handle it? This is kind of related to question #2...

2. What would be the easiest way to ignore XML tag data while indexing these types of XML-containing fields? It seems like I could define a new field type (e.g. text_xml) and set the associated tokenizer class to something new that I create. Though I'd have to un-escape the data (ick) before parsing it to skip tags.

Thanks,

-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
"Find Code, Find Answers"

Reply via email to