Thank you! That's even more I wanted to know. ;)
Georg
On Tue, Mar 2, 2010 at 10:05 PM, Walter Underwood wrote:
> You are in luck, because Avi Rappoport has just written a tutorial about
> how to do this. It is available from Lucid Imagination:
>
>
> http://www.lucidimagination.com/solutions/wh
You are in luck, because Avi Rappoport has just written a tutorial about how to
do this. It is available from Lucid Imagination:
http://www.lucidimagination.com/solutions/whitepapers/Indexing-Text-and-HTML-Files-with-Solr
I've just started reviewing it, but knowing Avi, I expect it to be very he
There is an HTML filter documented here, which might be of some help -
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory
Control characters can be eliminated using code like this -
http://bitbucket.org/cogtree/python-solr/src/tip/pythonsolr/pysolr.py#cl-44
Hi, How to index properly HTML documents? All the documents are HTML, some
containing charaters encodid like ží ... Is there a character
filter for filtering these codes? Is there a way to strip the HTML tags out?
Does solr weight the terms in the document based on where they appear?..
words in hea
m: Vinci <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Tuesday, March 25, 2008 4:25:10 PM
> Subject: Fields, Facets and Indexing html document
>
>
> Hi all,
>
> I want to Solr to index my html document collection. After I read number
> of
> tutorial and goog
From: Vinci <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Tuesday, March 25, 2008 4:25:10 PM
Subject: Fields, Facets and Indexing html document
Hi all,
I want to Solr to index my html document collection. After I read number of
tutorial and google search, I have some questions.
there
different?
7. Can facets/feature defined in another document?
Thank you,
Vinci
--
View this message in context:
http://www.nabble.com/Fields%2C-Facets-and-Indexing-html-document-tp16287762p16287762.html
Sent from the Solr - User mailing list archive at Nabble.com.