Mail config

2010-01-26 Thread Bogdan Vatkov
Hi, I do not want to receive all the emails from this mail list, I only want to receive the answers to my questions, is this possible? If I am not mistaken when I unsubscribed I sent an email which did not reach the mail list at all (therefore there was of course no chance to get any replies). How

Re: Index gets deleted after commit?

2010-01-25 Thread Bogdan Vatkov
*:* :) Best regards, Bogdan On Sun, Jan 24, 2010 at 6:25 AM, Amit Nithian wrote: > Are you using the DIH? If so, did you try setting clean=false in the URL > line? That prevents wiping out the index on load. > > On Jan 23, 2010 4:06 PM, "Bogdan Vatkov" wrote: > >

Index gets deleted after commit?

2010-01-23 Thread Bogdan Vatkov
After mass upload of docs in Solr I get some "REMOVING ALL DOCUMENTS FROM INDEX" without any explanation. I was running indexing w/ Solr for several weeks now and everything was ok - I indexed 22K+ docs using the SimplePostTool I was first launching *:* then some 22K+ ... with a finishing But

Re: Extracting URLs while indexing

2010-01-20 Thread Bogdan Vatkov
DelimiterFilterFactory, > you're breaking your tokens up on non-alpha characters and > case change and letter<->number transitions. Then > you're asking that things "of a kind" be put back into > words. > > You might try StandardTokenizerFactory inste

Re: Extracting URLs while indexing

2010-01-20 Thread Bogdan Vatkov
limiterFactory with the proper catenate > options set to one, that'd do it. > > Can you post the relevant parts of your schema? > > Erick > > On Wed, Jan 20, 2010 at 12:46 PM, Bogdan Vatkov >wrote: > > > I am not absolutely sure about what I am saying but I t

Re: Extracting URLs while indexing

2010-01-20 Thread Bogdan Vatkov
actory tries to keep URLs > together as a token in the field, but it's just another > token... You should check though.... > > FWIW > Erick > > On Wed, Jan 20, 2010 at 9:52 AM, Bogdan Vatkov >wrote: > > > Sorry, I meant completely server-side - even more I want

Re: Unstemming after solr.PorterStemFilterFactory

2010-01-20 Thread Bogdan Vatkov
ng to preserve them, so I think you're on > the right track > > FWIW > Erick > > On Wed, Jan 20, 2010 at 9:38 AM, Bogdan Vatkov >wrote: > > > Hi Eric, > > > > I think I realize that and I am actually using this - I am using the > > stemm

Re: Extracting URLs while indexing

2010-01-20 Thread Bogdan Vatkov
ilters/analyzers reside on the server, not the client. > You'll have to do it with custom code > > Erick > > On Tue, Jan 19, 2010 at 5:48 PM, Bogdan Vatkov >wrote: > > > Hi, > > > > I want to extract URLs (http://..., as well as file://... or eve

Re: Unstemming after solr.PorterStemFilterFactory

2010-01-20 Thread Bogdan Vatkov
;The Old Man > and the Sea". This stored part of the index is never searched, it > is solely there for retrieval/display. > > I'd really get a copy of the book, it'll save you lots of time and > effort. > > HTH > Erick > > On Tue, Jan 19, 2010 a

Extracting URLs while indexing

2010-01-19 Thread Bogdan Vatkov
Hi, I want to extract URLs (http://..., as well as file://... or even //.) while pushing documents into Solr. Is it possible with the Filters/Analyzers available nowadays? I looked into the doc but could not find anything related to it. Best regards, Bogdan

Re: Unstemming after solr.PorterStemFilterFactory

2010-01-19 Thread Bogdan Vatkov
e - Nutch > > > > - Original Message > > From: Bogdan Vatkov > > To: solr-user@lucene.apache.org > > Sent: Tue, January 19, 2010 5:28:51 PM > > Subject: Unstemming after solr.PorterStemFilterFactory > > > > Hi, > > > > I am indexing with

Unstemming after solr.PorterStemFilterFactory

2010-01-19 Thread Bogdan Vatkov
Hi, I am indexing with the solr.PorterStemFilterFactory included but then I need to access the unstemmed versions of the terms, what would be the easiest way to get the unstemmed version? Thanks in advance. Best regards, Bogdan -- Best regards, Bogdan

Re: Stopwords not working as expected

2010-01-02 Thread Bogdan Vatkov
lysis stack > and is not stored. This output has the stopwords removed. The output > is not stored in one place, but parts of it are scattered around the > Lucene index data structures. When you search for one of these > stopwords, you should not get any documents. > > On Sat, Jan 2

Stopwords not working as expected

2010-01-02 Thread Bogdan Vatkov
Hi, I am using a default (example) configuration of Solr and there the stopwording seems to be enabled for both indexing and querying of fields of type "text". I have a custom field which is of the "text" type. I have extended the stopwords.txt file with lots of words but when I index some documen