Hello Otis, I have been looking for something similar for Jackrabbit's lucene index, but I still have some uncertainty about wether I understand correctly what the patches in SOLR-139 supply:
Do they just retrieve formerly stored fields of a lucene Document, change some field, and then analyze and tokenize the fetched fields again? I am merely interested in avoiding the analyzing and tokenisation of the entire Document when for example a single Field changes (think about 100 Mb pdf's in Jackrabbit which I do not want to extract the content from again when just a single small prop changes). I got some pointers before from Karl Wettin (see [1])when using term vectors that I can re-assemble the tokenstream without having the expensive analyzing again. Anyway, is this what is understood with modifying an existing lucene document, or is it done with retrieving stored fields and analyze them again? Thanks for any clarifications. [1] http://www.nabble.com/Reusing-indexed-and-analyzed-documents-tt15000023. html#a15000023 [EMAIL PROTECTED] - [EMAIL PROTECTED] - www.onehippo.com ------------------------------------------------------------- Amsterdam - Hippo B.V. Oosteinde 11 1017 WT Amsterdam +31(0)20-5224466 San Francisco - Hippo USA Inc. 101 H Street, suite Q Petaluma CA 94952-3329 +1 (707) 773-4646 ------------------------------------------------------------- > > Hi Galen, > > See SOLR-139 (this is from memory) issue in JIRA. Doable, > but not in Solr nightlies yet, I believe (also from memory), > and requires all your fields to be stored. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > ----- Original Message ---- > From: Galen Pahlke <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Tuesday, March 25, 2008 4:21:45 PM > Subject: Update a field without reindexing the entire document? > > > Hi, I'm wondering if theres a way to change a single field of > a document without re-indexing every field. I'd like to do > something like this: > > <add><doc><field name="id">1</field><field > name="field1">val1</field></doc></add> > > Then later: > > <add><doc><field name="id">1</field><field > name="field2">val2</field></doc></add> > > After the second statement, the document is overwritten, so > the value of > field1 is lost. Is there a way I can do something like this > so that documents are only updated, as opposed to > overwritten? I've looked through the docs but couldn't find anything. > > Thanks, > - Galen Pahlke > -- > View this message in context: > http://www.nabble.com/Update-a-field-without-reindexing-the-en > tire-document--tp16287718p16287718.html > Sent from the Solr - User mailing list archive at Nabble.com. > > > > >