Re: Spellcheck issues in 3.1

2010-10-13 Thread Markus Jelsma
Nice, that's the trick to remember. On Wednesday, October 13, 2010 12:16:07 pm Robert Muir wrote: > > SEVERE: java.lang.NoSuchMethodError: > > org.apache.lucene.analysis.standard.StandardFilter.(Lorg/apache/luc > > ene/util/Version;Lorg/apache/lucene/analysis/TokenStream;)V -- Markus Jelsma - CT

Re: Spellcheck issues in 3.1

2010-10-13 Thread Robert Muir
you need to clean and recompile > SEVERE: java.lang.NoSuchMethodError: > org.apache.lucene.analysis.standard.StandardFilter.(Lorg/apache/lucene/util/Version;Lorg/apache/lucene/analysis/TokenStream;)V

Re: spellcheck: issues

2008-10-10 Thread Jason Rennie
Ah, now I see. Results are always sorted first by the edit distance, then by the popularity. What I think would work even better than allowing a custom compareTo function would be to incorporate the frequency directly into the distance function. This would allow for greater control over the trad

Re: spellcheck: issues

2008-10-09 Thread Grant Ingersoll
On Oct 8, 2008, at 6:20 PM, Jason Rennie wrote: On Wed, Oct 8, 2008 at 3:31 PM, Jason Rennie <[EMAIL PROTECTED]> wrote: I just tried J-W and *yes* it seems to do a much better job! I'd certainly vote for that becoming the default :) Ack! I did some more testing and J-W results starte

Re: spellcheck: issues

2008-10-08 Thread Jason Rennie
On Wed, Oct 8, 2008 at 3:31 PM, Jason Rennie <[EMAIL PROTECTED]> wrote: > I just tried J-W and *yes* it seems to do a much better job! I'd certainly > vote for that becoming the default :) > Ack! I did some more testing and J-W results started to get weird (including suggesting "courses" for "c

Re: spellcheck: issues

2008-10-08 Thread Jason Rennie
On Wed, Oct 8, 2008 at 3:05 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > chane is in the dictionary. For better or worse, Lucene skips words that > are in the dictionary when OMP is false. Ah, I see. I think we'll use OMP=true, which seems like a reasonable setting anyway. > Makes sense

Re: spellcheck: issues

2008-10-08 Thread Grant Ingersoll
On Oct 8, 2008, at 2:03 PM, Jason Rennie wrote: On Wed, Oct 8, 2008 at 1:24 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: Token: chane OMP: false Oct 8, 2008 1:19:56 PM org.apache.solr.core.SolrCore execute INFO: [spell] webapp=null path=/select params={q=description %3Achane &spellcheck

Re: spellcheck: issues

2008-10-08 Thread Jason Rennie
On Wed, Oct 8, 2008 at 1:24 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > Token: chane OMP: false > Oct 8, 2008 1:19:56 PM org.apache.solr.core.SolrCore execute > INFO: [spell] webapp=null path=/select > params={q=description%3Achane&spellcheck=true&spellcheck.onlyMorePopular=false&spellcheck.e

Re: spellcheck: issues

2008-10-08 Thread Grant Ingersoll
Hi Jason, Here's what I did: 1. Took your code and modified it to be that of [1] below 2. Set up your config, schema, etc. as per the EmbeddedSolrServer paths in the code (a Maven like dir structure w/ src/main/resources/ solr/spell containing your configuration. 3. Ran the code. My output

Re: spellcheck: issues

2008-10-08 Thread Jason Rennie
Hi Grant, Here are solr config files (attached) and java code (included below) to recreate the test case. Jason List> terms = new ArrayList>(); terms.add(new Pair("chanel", 834)); terms.add(new Pair("chant", 10)); terms.add(new Pair("chang", 8)); terms.add

Re: spellcheck: issues

2008-10-07 Thread Jason Rennie
Sure. I just sent the relevant files/code directly to you. Let me know if you don't get them or have any trouble with them. Jason On Tue, Oct 7, 2008 at 3:27 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > Can you share your spellchecker setup and the code for the test case? I > would like t

Re: spellcheck: issues

2008-10-07 Thread Grant Ingersoll
Can you share your spellchecker setup and the code for the test case? I would like to reproduce it and see what's going on. On Oct 7, 2008, at 2:18 PM, Jason Rennie wrote: On Tue, Oct 7, 2008 at 11:56 AM, Grant Ingersoll <[EMAIL PROTECTED]>wrote: Is there anyway you can write up a small

Re: spellcheck: issues

2008-10-07 Thread Jason Rennie
On Tue, Oct 7, 2008 at 11:56 AM, Grant Ingersoll <[EMAIL PROTECTED]>wrote: > Is there anyway you can write up a small test case? This definitely sounds > like a bug. I tried adding single word documents according to the top ten suggestions and frequencies for "chanl". I.e. I created a fresh in

Re: spellcheck: issues

2008-10-07 Thread Grant Ingersoll
On Oct 6, 2008, at 6:10 PM, Jason Rennie wrote: I've been using spellcheck.count=10 since that seems to yield a much better top result than using the default count of 1. However, I'm still seeing weird cases. Here are a few queries with returned suggestions. Frequency counts are in par

Re: spellcheck: issues

2008-10-07 Thread Grant Ingersoll
On Oct 6, 2008, at 4:08 PM, Jason Rennie wrote: I've noticed a few issues with spellcheck as I've been testing it out for use on our site... 1. Rebuild breaks requests - I'm using rebuildOnCommit ATM. If a commit is going on and files are being rebuilt in the spellcheck data dir, s

Re: spellcheck: issues

2008-10-06 Thread Jason Rennie
I've been using spellcheck.count=10 since that seems to yield a much better top result than using the default count of 1. However, I'm still seeing weird cases. Here are a few queries with returned suggestions. Frequency counts are in parenthesis. - query is "candyz". Suggestions are: 1. "c