Yes - you can index HTML text only while keeping the tags in place in
the stored field using HTMLCharFilter (or possibly XMLCharFilter). But
you will find that embedding HTML inside XML can be problematic since
HTML tags don't have to follow the well-formed constraints that XML
requires. For example, old-style paragraph tags in HTML were often not
closed, just <p> with no </p>. If you have stuff like that, you won't
be able to embed in XML without quoting the < character. You never said
why you are embedding HTML in XML though.
-Mike
On 9/25/2011 5:06 PM, okayndc wrote:
Here is a representation of the XML file...
<root>
<commenter>
<comment><p>Text here</p><img src="image.gif" /><p>More text
here....</p></comment>
</commenter>
</root>
I want to keep the HTML tags because it keeps the formatting (paragraph
tags, etc) intact for the output. Seems like you're saying that the HTML
can be kept intact with the use of a HTML field type without having to
escape the HTML tags?