Re: Indexing HTML document

2010-03-03 Thread György Frivolt
Thank you! That's even more I wanted to know. ;) Georg On Tue, Mar 2, 2010 at 10:05 PM, Walter Underwood wrote: > You are in luck, because Avi Rappoport has just written a tutorial about > how to do this. It is available from Lucid Imagination: > > > http://www.lucidimagination.com/solutions/wh

Re: Indexing HTML document

2010-03-02 Thread Walter Underwood
You are in luck, because Avi Rappoport has just written a tutorial about how to do this. It is available from Lucid Imagination: http://www.lucidimagination.com/solutions/whitepapers/Indexing-Text-and-HTML-Files-with-Solr I've just started reviewing it, but knowing Avi, I expect it to be very he

Re: Indexing HTML document

2010-03-02 Thread Siddhant Goel
There is an HTML filter documented here, which might be of some help - http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory Control characters can be eliminated using code like this - http://bitbucket.org/cogtree/python-solr/src/tip/pythonsolr/pysolr.py#cl-44

Indexing HTML document

2010-03-02 Thread György Frivolt
Hi, How to index properly HTML documents? All the documents are HTML, some containing charaters encodid like ží ... Is there a character filter for filtering these codes? Is there a way to strip the HTML tags out? Does solr weight the terms in the document based on where they appear?.. words in hea

Re: Fields, Facets and Indexing html document

2008-03-25 Thread Vinci
m: Vinci <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Tuesday, March 25, 2008 4:25:10 PM > Subject: Fields, Facets and Indexing html document > > > Hi all, > > I want to Solr to index my html document collection. After I read number > of > tutorial and goog

Re: Fields, Facets and Indexing html document

2008-03-25 Thread Otis Gospodnetic
From: Vinci <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Tuesday, March 25, 2008 4:25:10 PM Subject: Fields, Facets and Indexing html document Hi all, I want to Solr to index my html document collection. After I read number of tutorial and google search, I have some questions.

Fields, Facets and Indexing html document

2008-03-25 Thread Vinci
there different? 7. Can facets/feature defined in another document? Thank you, Vinci -- View this message in context: http://www.nabble.com/Fields%2C-Facets-and-Indexing-html-document-tp16287762p16287762.html Sent from the Solr - User mailing list archive at Nabble.com.