There is no Solr feature that would break up your HTML file - you will have to do that yourself, either before you send the file to Solr or by developing a custom update processor that extracts the sections and directs each to a specific field for the language. The former is probably easier since any generic processor that extracts text from an HTML file will strip out all HTML comments.

-- Jack Krupansky

-----Original Message----- From: Michael Clivot
Sent: Tuesday, March 25, 2014 6:29 AM
To: solr-user@lucene.apache.org
Subject: Indexing parts of an HTML file differently

Hello,

I have the following issue and need help:

One HTML file has different parts for different countries.
For example:

<!-- Country: FR, BE --->
....
Address for France and Benelux
....
<!-- Country End -->
<!-- Country: CH -->
....
Address for Switzerland
....
<!-- Country End -->

Depending on a parameter, I show or hide the parts on the website
Logically, all parts are in the index and therefore all items are found by SolR. My question is: how can I have only the items for the current country in my result list?

Thanks a lot
Regards
Michael

_______________________________
cli...@netmedia.de
netmedia - the Social Workplace Experts

netmedianer GmbH, Neugrabenweg 5-7, 66123 Saarbr?cken, Germany
fon: +49 681 37988-12, fax: +49 681 37988-99, mobil: +49 151 54775197
Gesch?ftsf?hrer: Boris Brenner, Tim Mik?a | HRB Saarbr?cken 13975

https://twitter.com/netmedianer, https://www.facebook.com/netmedianer

Reply via email to