Nice, that's the trick to remember.
On Wednesday, October 13, 2010 12:16:07 pm Robert Muir wrote:
> > SEVERE: java.lang.NoSuchMethodError:
> > org.apache.lucene.analysis.standard.StandardFilter.(Lorg/apache/luc
> > ene/util/Version;Lorg/apache/lucene/analysis/TokenStream;)V
--
Markus Jelsma - CT
you need to clean and recompile
> SEVERE: java.lang.NoSuchMethodError:
> org.apache.lucene.analysis.standard.StandardFilter.(Lorg/apache/lucene/util/Version;Lorg/apache/lucene/analysis/TokenStream;)V
Ah, now I see. Results are always sorted first by the edit distance, then
by the popularity. What I think would work even better than allowing a
custom compareTo function would be to incorporate the frequency directly
into the distance function. This would allow for greater control over the
trad
On Oct 8, 2008, at 6:20 PM, Jason Rennie wrote:
On Wed, Oct 8, 2008 at 3:31 PM, Jason Rennie <[EMAIL PROTECTED]>
wrote:
I just tried J-W and *yes* it seems to do a much better job! I'd
certainly
vote for that becoming the default :)
Ack! I did some more testing and J-W results starte
On Wed, Oct 8, 2008 at 3:31 PM, Jason Rennie <[EMAIL PROTECTED]> wrote:
> I just tried J-W and *yes* it seems to do a much better job! I'd certainly
> vote for that becoming the default :)
>
Ack! I did some more testing and J-W results started to get weird
(including suggesting "courses" for "c
On Wed, Oct 8, 2008 at 3:05 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
> chane is in the dictionary. For better or worse, Lucene skips words that
> are in the dictionary when OMP is false.
Ah, I see. I think we'll use OMP=true, which seems like a reasonable
setting anyway.
> Makes sense
On Oct 8, 2008, at 2:03 PM, Jason Rennie wrote:
On Wed, Oct 8, 2008 at 1:24 PM, Grant Ingersoll
<[EMAIL PROTECTED]> wrote:
Token: chane OMP: false
Oct 8, 2008 1:19:56 PM org.apache.solr.core.SolrCore execute
INFO: [spell] webapp=null path=/select
params={q=description
%3Achane
&spellcheck
On Wed, Oct 8, 2008 at 1:24 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
> Token: chane OMP: false
> Oct 8, 2008 1:19:56 PM org.apache.solr.core.SolrCore execute
> INFO: [spell] webapp=null path=/select
> params={q=description%3Achane&spellcheck=true&spellcheck.onlyMorePopular=false&spellcheck.e
Hi Jason,
Here's what I did:
1. Took your code and modified it to be that of [1] below
2. Set up your config, schema, etc. as per the EmbeddedSolrServer
paths in the code (a Maven like dir structure w/ src/main/resources/
solr/spell containing your configuration.
3. Ran the code. My output
Hi Grant,
Here are solr config files (attached) and java code (included below) to
recreate the test case.
Jason
List> terms = new ArrayList>();
terms.add(new Pair("chanel", 834));
terms.add(new Pair("chant", 10));
terms.add(new Pair("chang", 8));
terms.add
Sure. I just sent the relevant files/code directly to you. Let me know if
you don't get them or have any trouble with them.
Jason
On Tue, Oct 7, 2008 at 3:27 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
> Can you share your spellchecker setup and the code for the test case? I
> would like t
Can you share your spellchecker setup and the code for the test case?
I would like to reproduce it and see what's going on.
On Oct 7, 2008, at 2:18 PM, Jason Rennie wrote:
On Tue, Oct 7, 2008 at 11:56 AM, Grant Ingersoll
<[EMAIL PROTECTED]>wrote:
Is there anyway you can write up a small
On Tue, Oct 7, 2008 at 11:56 AM, Grant Ingersoll <[EMAIL PROTECTED]>wrote:
> Is there anyway you can write up a small test case? This definitely sounds
> like a bug.
I tried adding single word documents according to the top ten suggestions
and frequencies for "chanl". I.e. I created a fresh in
On Oct 6, 2008, at 6:10 PM, Jason Rennie wrote:
I've been using spellcheck.count=10 since that seems to yield a much
better
top result than using the default count of 1. However, I'm still
seeing
weird cases. Here are a few queries with returned suggestions.
Frequency
counts are in par
On Oct 6, 2008, at 4:08 PM, Jason Rennie wrote:
I've noticed a few issues with spellcheck as I've been testing it
out for
use on our site...
1. Rebuild breaks requests - I'm using rebuildOnCommit ATM. If a
commit
is going on and files are being rebuilt in the spellcheck data dir,
s
I've been using spellcheck.count=10 since that seems to yield a much better
top result than using the default count of 1. However, I'm still seeing
weird cases. Here are a few queries with returned suggestions. Frequency
counts are in parenthesis.
- query is "candyz". Suggestions are: 1. "c
16 matches
Mail list logo