nice...where? I'm trying to figure out 2 things: 1) How to create an analyzer that corresponds to the one in the schema.xml.
<analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> 2) I'd like to see the code that creates it reading it from schema.xml . On Tue, Jul 5, 2011 at 12:33 PM, Markus Jelsma <markus.jel...@openindex.io>wrote: > No. SolrJ only builds input docs from NutchDocument objects. Solr will do > analysis. The integration is analogous to XML post of Solr documents. > > On Tuesday 05 July 2011 12:28:21 Gabriele Kahlout wrote: > > Hello, > > > > I'm trying to understand better Nutch and Solr integration. My > > understanding is that Documents are added to Solr index from SolrWriter's > > write(NutchDocument doc) method. But does it make any use of the > > WhitespaceTokenizerFactory? > > -- > Markus Jelsma - CTO - Openindex > http://www.linkedin.com/in/markus17 > 050-8536620 / 06-50258350 > -- Regards, K. Gabriele --- unchanged since 20/9/10 --- P.S. If the subject contains "[LON]" or the addressee acknowledges the receipt within 48 hours then I don't resend the email. subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x) < Now + 48h) ⇒ ¬resend(I, this). If an email is sent by a sender that is not a trusted contact or the email does not contain a valid code then the email is not received. A valid code starts with a hyphen and ends with "X". ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈ L(-[a-z]+[0-9]X)).