Fetching the first 10 results and the last result

2008-05-21 Thread Tim Mahy
Hi all, is there a way to let Solr not only return the total number of found articles, but also the data of the last document when for example only requesting the first 10 documents ? we could do this with a seperate query by either letting the second query fetch 1 row from position = previous

Highlighting - field criteria highlights in other fields

2008-05-20 Thread Tim Mahy
Hi all, we have situation in which we have documents that have an introduction (text) , a body (text) and some meta data fields (integers mostly). when we create a query like this : q=( +(body_nl:( brussel) ) AND ( (+publicationid:("3430" OR "3451")) )&fq= +publishdateAsString:[20070520 TO 20

bitwise comparer

2008-05-16 Thread Tim Mahy
Hi all, is there any already existing patch or feature which allows to search bitwise ? so like you would do in mysql : " myField & 5 " which returns in this case all the documents which have myField values 4 and 1 ? greetings, Tim Please see our disclaimer, http://www.infosupport.be/Pages/

RE: hi umar

2008-05-16 Thread Tim Mahy
Hi, you can send a delete query, the delete query is the same syntax as a normal search. so if your id field is called "ID" you can send as query : ID:"1450" instead of *:* (which deletes everything) which will delete in this example the document with id 1450 greetings, Tim __

RE: solr search

2008-05-16 Thread Tim Mahy
Hi, 1) did you perform a commit after the delete ? 2) in the default schema there are some comments on the different analyzers which should help you get started I think greetings, Tim Van: dharhsana [EMAIL PROTECTED] Verzonden: vrijdag 16 mei 2008 13:56 A

RE: Duplicates results when using a non optimized index

2008-05-15 Thread Tim Mahy
sider moving to 1.3-dev. It sounds like the index is already "polluted" with duplicate documents, so you'll want to rebuild the index whether you decide to stay with 1.2 or move to 1.3-dev. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message -

RE: Duplicates results when using a non optimized index

2008-05-14 Thread Tim Mahy
iously deleted documents? - which Solr version are you using and can you try a recent nightly? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Tim Mahy <[EMAIL PROTECTED]> > To: "solr-user@lucene.apache.org" > Sent: Tuesd

Duplicates results when using a non optimized index

2008-05-13 Thread Tim Mahy
Hi all, is this expected behavior when having an index like this : numDocs : 9479963 maxDoc : 12622942 readerImpl : MultiReader which is in the process of optimizing that when we search through the index we get this : 15257559 15257559 17177888 11825631 11825631 The id field is decl

RE: how to clean an index ?

2008-05-13 Thread Tim Mahy
Hi, you can create a delete query matching al your documents like the query "*:*" greetings, Tim Van: Pierre-Yves LANDRON [EMAIL PROTECTED] Verzonden: dinsdag 13 mei 2008 11:53 Aan: solr-user@lucene.apache.org Onderwerp: how to clean an index ? Hello, I

RE: Delete's increase while adding new documents

2008-05-07 Thread Tim Mahy
Klaas [EMAIL PROTECTED] Verzonden: dinsdag 6 mei 2008 20:17 Aan: solr-user@lucene.apache.org Onderwerp: Re: Delete's increase while adding new documents On 6-May-08, at 4:56 AM, Tim Mahy wrote: > Hi all, > > it seems that we get errors during the auto-commit : > > > java.i

RE: multi-language searching with Solr

2008-05-06 Thread Tim Mahy
Hi, you could also use multiple Solr instances having specific settings and stopwords etc for the same field and upload your documents to the correct instance and than merge the indexes to one searchable index ... greetings, Tim Van: Eli K [EMAIL PR

RE: Delete's increase while adding new documents

2008-05-06 Thread Tim Mahy
its throughout xml messages we get the same kind of errors. Any idea what could be wrong in our configuration to cause these exceptions ? Greetings, Tim ____ Van: Tim Mahy [EMAIL PROTECTED] Verzonden: maandag 28 april 2008 12:11 Aan: solr-user@lucene.apache

RE: Delete's increase while adding new documents

2008-04-28 Thread Tim Mahy
ete's increase while adding new documents On 25-Apr-08, at 4:27 AM, Tim Mahy wrote: > > Hi all, > > we send xml add document messages to Solr and we notice something > very strange. > We autocommit at 10 documents, starting from a total clean index > (removed the data fold

Delete's increase while adding new documents

2008-04-25 Thread Tim Mahy
Hi all, we send xml add document messages to Solr and we notice something very strange. We autocommit at 10 documents, starting from a total clean index (removed the data folder), when we start uploading we notice that the docsPending is going up but also that the deletesPending is going up

Multi language, one "body" field, multi stopwords ?

2008-04-23 Thread Tim Mahy
Hi all, we are in the situation that we want to store documents from x number of languages but in the query we want to query the same field, but at indexing time we want different stopwords text file to be used for the language of the uploaded document. I thought perhaps creating a body field p

RE: Wildcard search + case insensitive

2008-04-02 Thread Tim Mahy
Hi all, I already found the answer to my question on the following blog : http://michaelkimsal.com/blog/2007/04/solr-case-sensitivty/ greetings, Tim -Oorspronkelijk bericht- Van: Tim Mahy [mailto:[EMAIL PROTECTED] Verzonden: wo 2-4-2008 13:19 Aan: solr-user@lucene.apache.org Onderwerp

Wildcard search + case insensitive

2008-04-02 Thread Tim Mahy
Hi all, I use this type definition in my schema.xml : When I have a document with the term "demo" in it and I search for dem* , I receive the document back from Solr, but when I search

Search exact terms

2008-04-02 Thread Tim Mahy
Hi all, is there a Solr wide setting that with which I can achieve the following : if I now search for q=onderwij, I also receive documents with results of "onderwijs" etc.. this is ofcourse the behavior that is described but if I search on "onderwij", I still get the "onderwijs" hits, I use fo

RE: Highlight - get terms used by lucene

2008-03-28 Thread Tim Mahy
Hi, Solr returns the max score and the score per document. This means that the best hit always is 100% which is not always what you want because the article itself could still be quite irrelevant... groeten, Tim -Oorspronkelijk bericht- Van: Chris Hostetter [mailto:[EMAIL PROTECTED] Ve

RE: Highlight - get terms used by lucene

2008-03-27 Thread Tim Mahy
Hi, thanks for the answer, with that information I can pull out the term frequency. Reason for all this, is that we want to use this scoring algorithm: http://download-uk.oracle.com/docs/cd/B19306_01/text.102/b14218/ascore.htm but is there a performance cost on the explain, that can be painfull

Term frequency

2008-03-26 Thread Tim Mahy
Hi All, is there a way to get the term frequency per found result back from Solr ? Greetings, Tim Info Support - http://www.infosupport.com Alle informatie in dit e-mailbericht is onder voorbehoud. Info Support is op geen enkele wijze aansprakelijk voor vergissingen of onjuistheden in dit

Highlight - get terms used by lucene

2008-03-25 Thread Tim Mahy
Hi All, we use highlighting and snippets for our searches. Besides those two, I would want to have a list of terms that lucene used for the highlighting, so that I can pull out of a "Tim OR Antwerpen AND Ekeren" the following terms : Antwerpen, Ekeren if let's say these are the only terms that