thorsten, see the following for discussion. your case is indeed an annoyance--the thread below discusses motivations for it and ways of working around it. (i too confess that i wish it were not so.)
http://www.mail-archive.com/solr-user@lucene.apache.org/msg01483.html -edward On 1/2/07, Mike Klaas <[EMAIL PROTECTED]> wrote:
Hi Thorsten, The highlighter does not escape anything itself: you are seeing the results of solr's automatic escaping of xml data within its xml response. This should be transparent (your xml decoder should un-escape the values on the way out). I'm not really familiar with xslt so I'm unsure why that isn't so (perhaps it is automatically html-escaping the values after un-xml-escaping them?) Be careful of documents containing html fragments natively. cheers, -MIke On 1/2/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote: > Hi all, > > I am playing around with the highlighter and found that all highlight > terms get escaped. > > I mean solr will return > <em>TERM</em> and not > <em> TERM </em> > > I am not sure where this escaping is happening but I would need the > highlighting to NOT escape the hl.simple.pre and hl.simple.post tag > since it is horror to work with cdata sections in xsl. > > I had a look in the lucene highlighter and it seem that it does not > escape the tags. > > Can somebody point me to code which is responsible for escaping and > maybe give me a tip how I can patch to make it configurable. > > TIA > > salu2 > >
-- Edward Garrett Visiting Fellow (2006-07) Endangered Languages Academic Programme School of Oriental and African Studies London, UK 0207 898 4536 Assistant Professor, Linguistics Program Eastern Michigan University 612 Pray-Harrold Building Ypsilanti, MI, USA