Hi all,
I've got some fields that will contain embedded XML. Two questions
relating to that:
1. It appears as though I'll need to XML-escape the field data, as
otherwise Solr complains about find a start tag (one of the embedded
tags) before it finds the end tag for a field.
Is this an expected constraint?
And is XML-escaping the data the best way to handle it? This is kind
of related to question #2...
2. What would be the easiest way to ignore XML tag data while
indexing these types of XML-containing fields? It seems like I could
define a new field type (e.g. text_xml) and set the associated
tokenizer class to something new that I create. Though I'd have to
un-escape the data (ick) before parsing it to skip tags.
Thanks,
-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
"Find Code, Find Answers"