Re: BooleanQuery exception

2007-11-23 Thread Chris Hostetter
: Nov 19, 2007 3:01:31 PM org.apache.solr.core.SolrException log : SEVERE: java.lang.NoSuchMethodError: : org.apache.lucene.search.BooleanQuery.clauses()Ljava/util/List; : at org.apache.solr.search.QueryUtils.isNegative(QueryUtils.java:38) that's a pretty gnarly exception. unless there w

Re: OR-ing together filter queries

2007-11-23 Thread Chris Hostetter
: I am writing my own handler, and I would like to pre-filter the results : based on a field. Iÿÿm calling searcher.getDocList() with a custom : constructed query and filters list, but the filters always seem to AND : together. My question is this: how can I construct the List of : filters to

Re: Solr on Windows / Linux

2007-11-23 Thread Chris Hostetter
: looking at CollectionsDistribution and its documentation ( : http://wiki.apache.org/solr/CollectionDistribution). It appeared that it : uses rsync which is specific to Linux systems. : : Does it mean that Collections in Solr are not available on Windows or that : it uses a different mechanism f

Re: [OT] reg Text Utilities

2007-11-23 Thread Venkatraman S
and Visualization Libraries? I know of Prefuse. Any others around? On Nov 24, 2007 7:37 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > I like Aperture (http://aperture.sourceforge.net) and Tika (Lucene > subproject) is coming along nicely. Otherwise, you can use the > individual libraries like

Re: [OT] reg Text Utilities

2007-11-23 Thread Grant Ingersoll
I like Aperture (http://aperture.sourceforge.net) and Tika (Lucene subproject) is coming along nicely. Otherwise, you can use the individual libraries like PDFBox, etc. that Aperture and Tika both use. As for NLP applications, there are many to list, OpenNLP, LingPipe (not free), Carrot, l

Re: Finding the right place to start ...

2007-11-23 Thread Mike Klaas
On 23-Nov-07, at 5:24 PM, Mike Klaas wrote: On 23-Nov-07, at 5:17 PM, Chris Hostetter wrote: : The best way to help is to try out the patch, make sure it applies, see if the : functionality is working, and review the code changes. Review is usually the : biggest bottleneck in open-sour

Re: Finding the right place to start ...

2007-11-23 Thread Mike Klaas
On 23-Nov-07, at 5:17 PM, Chris Hostetter wrote: : The best way to help is to try out the patch, make sure it applies, see if the : functionality is working, and review the code changes. Review is usually the : biggest bottleneck in open-source development. this thread reminded me of so

[OT] reg Text Utilities

2007-11-23 Thread Venkatraman S
Hi, I would be interested in knowing as to which open source utilities does the community use for text conversions , as in, pdf to text, xls to text, word to text , ps to text etc etc. Are their any other 'interesting' utilities/libraries(free and available for commercial use) that can be used fo

Re: Finding the right place to start ...

2007-11-23 Thread Chris Hostetter
: The best way to help is to try out the patch, make sure it applies, see if the : functionality is working, and review the code changes. Review is usually the : biggest bottleneck in open-source development. this thread reminded me of some stuff i've in the back of my mind about "contributing"

Re: Near Duplicate Documents

2007-11-23 Thread Ken Krugler
It's correct that uncommon words are most likely not showing up in the signature. However, I was trying to say that if two documents has 99% common tokens and differ in one token with frequency > quantised frequency, the two resulted hashes are completely different. I think that the change in th

Re: Help with Debian solr/jetty install?

2007-11-23 Thread Jan-Pascal van Best
Hi all, I'm the maintainer of the Debian solr-jetty package, so I thought I'd step in. Solr-jetty worked with an earlier version of the Debian Jetty package, but it doesn't any more with the current Jetty version. Moreover, the snoop.jsp demo page (http://localhost:8280/snoop.jsp) also doesn't wor

Re: Memory use with sorting problem

2007-11-23 Thread Chris Laux
Hi Otis, thanks for the hint. Turns out I have 17.8 million unique terms. I'm fairly sure by now that the problem lies with the sorting. In the Lucene java docs (http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/search/Sort.html) it is stated that > Sorting u

Re: Get last updated/committed document

2007-11-23 Thread climbingrose
Assuming that you have the timestamp field defined: q=*:*&sort=timestamp desc On Nov 23, 2007 10:43 PM, Thorsten Scherler <[EMAIL PROTECTED]> wrote: > Hi all, > > I need to ask solr to return me the id of the last committed document. > > Is there a way to archive this via a standard lucene query o

Get last updated/committed document

2007-11-23 Thread Thorsten Scherler
Hi all, I need to ask solr to return me the id of the last committed document. Is there a way to archive this via a standard lucene query or do I need a custom connector that gives me this information? TIA for any information salu2 -- Thorsten Scherler thorsten.