Re: Geographical distance searching

2007-09-27 Thread Guillaume Smet
Hi Patrick, On 9/27/07, patrick o'leary <[EMAIL PROTECTED]> wrote: > p.s after a little tidy up I'll be adding this to both lucene and solr's > repositories if folks feel that it's a useful addition. It's definitely very interesting. Did you compare performances of Lucene with a database allowi

Re: searching for non-empty fields

2007-09-27 Thread Pieter Berkel
While in theory -URL:"" should be valid syntax, the Lucene query parser doesn't accept it and throws a ParseException. I've considered raising this issue on lucene-dev but it didn't seem to affect many users so I decided not to pursue the matter. On 27/09/2007, Chris Hostetter <[EMAIL PROTECTED

Re: Geographical distance searching

2007-09-27 Thread patrick o'leary
As far as I'm concerned nothings going to beat PG's GIS calculations, but it's tsearch was a lot slower than myisam. My goal was a single solution to reduce our complexity, but am interested to know if combining both an rdbms & lucene works for you. Definitely let me know how it goes ! P Gu

Re: searching for non-empty fields

2007-09-27 Thread Brian Whitman
thanks Peter, Hoss and Ryan.. q=(URL:[* TO *] -URL:"") This gives me 400 Query parsing error: Cannot parse '(URL:[* TO *] - URL:"")': Lexical error at line 1, column 29. Encountered: "\"" (34), after : "\"" adding something like: I'll do this but the problem here is I have to wait

LockObtainFailedException

2007-09-27 Thread Jae Joo
will anyone help me why and how? org.apache.lucene.store.LockObtainFailedException: Lock obtain timed out: SimpleFSLock@/usr/local/se archengine/apache-solr-1.2.0/fr_companies/solr/data/index/write.lock at org.apache.lucene.store.Lock.obtain(Lock.java:70) at org.apache.lucene.inde

Re: LockObtainFailedException

2007-09-27 Thread matt davies
quick fix look for a lucene lock file in your tmp directory and delete it, then restart solr, should start I am an idiot though, so be careful, in fact, I'm worse than an idiot, I know a little :-) you got a lock file somewhere though, deleting that will help you out, for me it was in

Re: What is facet?

2007-09-27 Thread Erik Hatcher
On Sep 26, 2007, at 7:28 PM, Chris Hostetter wrote: cool => (popularity:[100 TO *] (+numFeatures:[10 TO *] +price:[0 TO 10])) lame => (+popularity:[* TO 99] +numFeatures:[* TO 9] +price:[11 TO *]) That example is definitely in the cool category. I couldn't resist creating a SolrTerm

Re: searching for non-empty fields

2007-09-27 Thread Yonik Seeley
On 9/27/07, Pieter Berkel <[EMAIL PROTECTED]> wrote: > While in theory -URL:"" should be valid syntax, the Lucene query parser > doesn't accept it and throws a ParseException. I don't have time to work on that now, but I did just open a bug: https://issues.apache.org/jira/browse/LUCENE-1006 -Yoni

Request for graphics

2007-09-27 Thread Benjamin Liles
I am trying to make a presentation on SOLR and have been unable to find the SOLR graphic in high quality. Could someone point me in the right direction or provide the graphics? Thanks, Benjamin Liles Lead Software Application Developer Digital Initiatives - Web Services University Libraries

moving index

2007-09-27 Thread Jae Joo
Hi, I do need to move the index files, but have a concerns any potential problem including performance? Do I have to keep the original document for querying? Thanks, Jae Joo

Re: moving index

2007-09-27 Thread Yonik Seeley
On 9/27/07, Jae Joo <[EMAIL PROTECTED]> wrote: > I do need to move the index files, but have a concerns any potential problem > including performance? > Do I have to keep the original document for querying? I assume you posted XML documents in Solr XML format (like ...)? If so, that is just an exa

Re: Converting German special characters / umlaute

2007-09-27 Thread Steven Rowe
Chris Hostetter wrote: > : is there an analyzer which automatically converts all german special > : characters to their specific dissected from, such as ü to ue and ä to > : ae, etc.?! > > See also the ISOLatin1TokenFilter which does this regardless of langauge. Actually, ISOLatin1TokenFilter doe

Problem with handle hold deleted files

2007-09-27 Thread Danilo Fantinato
Hi, I'm using EmbeddedSolrServer and when I start the snapinstaller process i'm calling the commit method of the EmbeddedSolr througth a servlet but the JVM holds deleted files on Operating System and usage disk space excessive. Follow line sample from the command "lsof |grep deleted" java 1

Re: LockObtainFailedException

2007-09-27 Thread Jae Joo
In solrconfig.xml, false 10 25000 1400 500 1000 1 Does writeLockTimeout too small? Thanks, Jae On 9/27/07, matt davies <[EMAIL PROTECTED]> wrote: > > quick fix > > look for a lucene lock file in your tmp directory and delete it, then > restart solr, should start >

Re: searching for non-empty fields

2007-09-27 Thread Yonik Seeley
On 9/27/07, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On 9/27/07, Pieter Berkel <[EMAIL PROTECTED]> wrote: > > While in theory -URL:"" should be valid syntax, the Lucene query parser > > doesn't accept it and throws a ParseException. > > I don't have time to work on that now, OK, I lied :-) It wa

Re: Converting German special characters / umlaute

2007-09-27 Thread J.J. Larrea
At 12:13 PM -0400 9/27/07, Steven Rowe wrote: >Chris Hostetter wrote: >> : is there an analyzer which automatically converts all german special >> : characters to their specific dissected from, such as ü to ue and ä to >> : ae, etc.?! >> >> See also the ISOLatin1TokenFilter which does this regardle

Re: Converting German special characters / umlaute

2007-09-27 Thread Walter Underwood
Accent transforms are language-specific, so an accent filter should take an ISO langauge code as an argument. Some examples: * In French and English, a diereses is a hint to pronounce neighboring vowels separateley, as in coöp, naïve, or Noël. * In German, ü transformes to ue. * In Swedish, ö i

Date facetting and ranges overlapping

2007-09-27 Thread Guillaume Smet
Hi all, I'm now using date facetting to browse events. It works really fine and is really useful. The only problem so far is that if I have an event which is exactly on the boundary of two ranges, it is referenced 2 times. If we admit that we have a gap of 6 hours starting from 2007-09-27 12:00,

Re: custom sorting

2007-09-27 Thread Chris Hostetter
: > Previously we were using lucene to do this. by using the : > SortComparatorSource we could sort the documents returned by distance : > nicely. we are now switching over to lucene because of the features it : > provides, however i am not able to see a way to do this in Solr. Someone started a

Re: LockObtainFailedException

2007-09-27 Thread Chris Hostetter
In "normal" solr usage, where Solr is the only thing writing to your index, you should never get a lock timeout ... typical reasosn for this to happen are if your servlet container crashed or was shutdown uncleanly and Solr wasn't able to clean up it's lock file (check your logs) There is an

Re: Date facetting and ranges overlapping

2007-09-27 Thread Chris Hostetter
: I'm now using date facetting to browse events. It works really fine : and is really useful. The only problem so far is that if I have an : event which is exactly on the boundary of two ranges, it is referenced : 2 times. yeah, this is one of the big caveats with date faceting right now ... i st

Re: custom sorting

2007-09-27 Thread Erik Hatcher
On Sep 27, 2007, at 2:50 PM, Chris Hostetter wrote: to answer the broader question of using customized LUcene SortComparatorSource objects in solr -- it is in fact possible. In Solr, all decisisons about how to sort are driven by FieldTypes. You can subclass any of the FieldTypes that come w

Selecting Distinct values?

2007-09-27 Thread David Whalen
Hi there. Is there a query I can use to select distinct values in an index? I thought I could use a facet, but the facets don't seem to return all the distinct values in the index, only the highest-count ones. Is there another query I can try? Or, can I adjust the facets somehow to make this wor

Re: custom sorting

2007-09-27 Thread Yonik Seeley
On 9/27/07, Erik Hatcher <[EMAIL PROTECTED]> wrote: > Using something like this, how would the custom SortComparatorSource > get a parameter from the request to use in sorting calculations? perhaps hook in via function query: dist(10.4,20.2,geoloc) And either manipulate the score with that and

Re: Date facetting and ranges overlapping

2007-09-27 Thread Guillaume Smet
On 9/27/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: > a better option (assuming a query parser change) would be a new option > thta says wether each computed range should be enclusive of the low poin,t > the high point, both end points, neither end points, or be "smart" (where > smart is the sam

Re: Selecting Distinct values?

2007-09-27 Thread Mike Klaas
On 27-Sep-07, at 12:01 PM, David Whalen wrote: Hi there. Is there a query I can use to select distinct values in an index? I thought I could use a facet, but the facets don't seem to return all the distinct values in the index, only the highest-count ones. Is there another query I can try? Or

Re: Date facetting and ranges overlapping

2007-09-27 Thread Guillaume Smet
On 9/27/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: > The simple workarround: if you know all of your data is indexed with > perfect 0.000second precision, then put "-1MILLI" at the end of your start > and end date faceting params. It fixed my problem. Thanks. -- Guillaume

RE: What is facet?

2007-09-27 Thread Teruhiko Kurosaka
Thank you Ezra and Chris for explaining this, and I like your idea, Erik. This will make intro to Solr easier for new comers, and make Solr more popular. -Kuro > That example is definitely in the cool category. I couldn't resist > creating a SolrTerminology wiki page linking to your post a

RE: Selecting Distinct values?

2007-09-27 Thread David Whalen
Silly me. Thanks! > -Original Message- > From: Mike Klaas [mailto:[EMAIL PROTECTED] > Sent: Thursday, September 27, 2007 4:46 PM > To: solr-user@lucene.apache.org > Subject: Re: Selecting Distinct values? > > On 27-Sep-07, at 12:01 PM, David Whalen wrote: > > > Hi there. > > > >

maxBufferedDocs vs autoCommit->maxDocs

2007-09-27 Thread Bouis, Laurent
Hi, What is the difference between the 1000, and the 1000... parameters in solrconfig.xml? Do they influence the frequency of flush to disk and document distribution in segments in a different way? When I did some test with low identical values, I saw similar behavior in terms of frequ

Re: maxBufferedDocs vs autoCommit->maxDocs

2007-09-27 Thread Mike Klaas
On 27-Sep-07, at 3:35 PM, Bouis, Laurent wrote: What is the difference between the 1000, and the 1000... parameters in solrconfig.xml? Do they influence the frequency of flush to disk and document distribution in segments in a different way? maxBufferedDocs affects disk flushing behaviour, bu

one query or multiple queries

2007-09-27 Thread Xuesong Luo
Hi, I have a user index(each user has a unique index record) and need to get information for 10 users. Should I run 10 queries or 1 query with multiple user ids? Any performance difference? Thanks Xuesong

Re: anyone can send me jetty-plus

2007-09-27 Thread Matt Kangas
If you're using Jetty 6, there's no need for a separate "Jetty Plus" download. The "plus" jarfiles come in the standard distribution. --matt On Sep 27, 2007, at 12:10 AM, James liu wrote: i can't download it from http://jetty.mortbay.org/jetty5/plus/ index.html -- regards jl -- Matt Kang