There is no Solr feature that would break up your HTML file - you will have
to do that yourself, either before you send the file to Solr or by
developing a custom update processor that extracts the sections and directs
each to a specific field for the language. The former is probably easier
since any generic processor that extracts text from an HTML file will strip
out all HTML comments.
-- Jack Krupansky
-----Original Message-----
From: Michael Clivot
Sent: Tuesday, March 25, 2014 6:29 AM
To: solr-user@lucene.apache.org
Subject: Indexing parts of an HTML file differently
Hello,
I have the following issue and need help:
One HTML file has different parts for different countries.
For example:
<!-- Country: FR, BE --->
....
Address for France and Benelux
....
<!-- Country End -->
<!-- Country: CH -->
....
Address for Switzerland
....
<!-- Country End -->
Depending on a parameter, I show or hide the parts on the website
Logically, all parts are in the index and therefore all items are found by
SolR.
My question is: how can I have only the items for the current country in my
result list?
Thanks a lot
Regards
Michael
_______________________________
cli...@netmedia.de
netmedia - the Social Workplace Experts
netmedianer GmbH, Neugrabenweg 5-7, 66123 Saarbr?cken, Germany
fon: +49 681 37988-12, fax: +49 681 37988-99, mobil: +49 151 54775197
Gesch?ftsf?hrer: Boris Brenner, Tim Mik?a | HRB Saarbr?cken 13975
https://twitter.com/netmedianer, https://www.facebook.com/netmedianer