2014 16:42
An: solr-user@lucene.apache.org
Betreff: RE: Spellchecking and suggesting part numbers
Alexander,
You could use a higher value for spellcheck.count, maybe 20 or so, then in your
application pick out the suggestions that make changes on the right side.
Another option is to use
I’ve done something similar to this using the the EdgeNGram not the
spellchecker component, I don’t know if this is along with your requirements:
The relevant portion of my fieldType config:
class="solr.SpellCheckComponent">
>
>
cene.apache.org
Subject: Spellchecking and suggesting part numbers
Hello Solr Users,
we are trying to get suggestions for part numbers using the spellchecker.
Problem scenario:
ABCD1234 // This is the search term
ABCE1234 // This is what we get from spellchecker
ABCD1244 // This is what we would l
Hello Solr Users,
we are trying to get suggestions for part numbers using the spellchecker.
Problem scenario:
ABCD1234 // This is the search term
ABCE1234 // This is what we get from spellchecker
ABCD1244 // This is what we would like to get from spellchecker
Characters towards the left of our
I'm working with business names which are even sometimes people names such as "
Wardell F E B Dr ". I suspect I need to change my logic to not try to rely on
spellchecking so much as you suggest.
Thanks.
Corey
-Original Message-
From: Dyer, James [mailto:james.d...@in
com]
Sent: Friday, August 15, 2014 3:21 PM
To: Solr User List
Subject: Spellchecking suggestions won't collate
It must be Friday. I can't figure out why there is no collation value:
{
"responseHeader":{
"status":0,
"QTime":31,
"
It must be Friday. I can't figure out why there is no collation value:
{
"responseHeader":{
"status":0,
"QTime":31,
"params":{
"spellcheck":"on",
"spellcheck.collateParam.qf":"BUS_BUSINESS_NAME",
"spellcheck.maxResultsForSuggest":"5",
"spellcheck.maxCollations
h benoit
> Sent:Wed 14-05-2014 14:01
> Subject:permissive mm value and efficient spellchecking
> To:solr-user@lucene.apache.org;
> Hello,
>
> I'm using solr 4.2.1.
>
> I use a very permissive value for mm, to be able to find results even if
> request contains non re
Hello,
I'm using solr 4.2.1.
I use a very permissive value for mm, to be able to find results even if
request contains non relevant words.
At the same time, I'd like to be able to do some efficient spellcheking
with solrdirectspellchecker.
So for instance, if user searches for "rue de Chraonne
Elisabeth, i think you are looking for SOLR-3211 that introduced
spellcheck.collateParam.* to override e.g. dismax settings.
Markus
-Original message-
From:elisabeth benoit
Sent:Wed 14-05-2014 14:01
Subject:permissive mm value and efficient spellchecking
To:solr-user@lucene.apache.org
lto:fied...@gmail.com]
Sent: Saturday, May 03, 2014 10:15 AM
To: solr-user@lucene.apache.org
Subject: Re: Spellchecking - looking for general advice
Hi
I've set it to 2, but python implementation of Levenshtein says its 3 for
restraunt -> restaurant.
On Sat, May 3, 2014 at 2:44 PM, Sushe
t; Thanks,
> Susheel
>
> -Original Message-
> From: Maciej Dziardziel [mailto:fied...@gmail.com]
> Sent: Friday, May 02, 2014 7:05 PM
> To: solr-user@lucene.apache.org
> Subject: Spellchecking - looking for general advice
>
> Hi
>
> I was looking at spellcheck (
How much is the maxEdits you have set. It should catch restaurant example with
edit distance set to 2.
Thanks,
Susheel
-Original Message-
From: Maciej Dziardziel [mailto:fied...@gmail.com]
Sent: Friday, May 02, 2014 7:05 PM
To: solr-user@lucene.apache.org
Subject: Spellchecking
Hi
I was looking at spellcheck (Direct and FileBased) and testing that they can do.
Direct works fine most of the time, but I'd like to find solution for
few corner cases:
1) having "recruted" and "recruiter" in index, "recruter" should
suggest the latter.
Obviously the distance to the former
(615) 213-4311
-Original Message-
From: Gastone Penzo [mailto:gastone.pe...@gmail.com]
Sent: Friday, December 20, 2013 8:38 AM
To: solr-user@lucene.apache.org
Subject: Re: Spellchecking problem
Thank you for your answer.
this is the querystring
http://seshat:9000/solr/browse/?q=otto+maialot
Penzo [mailto:gastone.pe...@gmail.com]
> Sent: Friday, December 20, 2013 7:43 AM
> To: solr-user@lucene.apache.org
> Subject: Spellchecking problem
>
> Hello,
>
> i have problem with spellchecking.
> i use solr to index an ecommerce products (dvd, cd, books ecc)
> the collation
config.xml.
James Dyer
Ingram Content Group
(615) 213-4311
-Original Message-
From: Gastone Penzo [mailto:gastone.pe...@gmail.com]
Sent: Friday, December 20, 2013 7:43 AM
To: solr-user@lucene.apache.org
Subject: Spellchecking problem
Hello,
i have problem with spellchecking.
i use solr
Hello,
i have problem with spellchecking.
i use solr to index an ecommerce products (dvd, cd, books ecc)
the collation is only one but in the index there'is the field: typology (of
product)
When i build spellchecking indexes, they are build together.
How can i have only suggestsions o
>
>
> -Original Message-
> From: Gastone Penzo [mailto:gastone.pe...@gmail.com]
> Sent: Friday, September 20, 2013 4:00 AM
> To: solr-user@lucene.apache.org
> Subject: Spellchecking
>
> Hi,
> i'd like to know if is it possibile to have suggests only of a
e
"spellcheck.collateParam.fq=Product:Book".
See http://wiki.apache.org/solr/SpellCheckComponent#spellcheck.collate and
following sections.
James Dyer
Ingram Content Group
(615) 213-4311
-Original Message-
From: Gastone Penzo [mailto:gastone.pe...@gmail.com]
Sent: Friday, September 20, 20
Hi,
i'd like to know if is it possibile to have suggests only of a part of
indexes.
for example:
an ecommerce:
there are a lot of typologies of products (book, dvd, cd..)
if i search inside books, i want only suggests of books products, not cds
but the spellchecking indexs are all together
Hi All,
I didn't have the lucene-solr source compiling cleaning in eclipse
initially so I created a very quick maven project to demonstrate this issue:
https://github.com/rainkinz/solr_spellcheck_index_out_of_bounds.git
Having said that I just got everything set up in eclipse, so I can create a
Further to this. If I change:
tpms,service tire monitor,tire monitor,tire pressure monitor,tire pressure
monitoring system,tpm,low tire warning,tire pressure monitor system
to
service tire monitor,tire monitor,tire pressure monitor,tire pressure
monitoring system,tpm,low tire warning,tire pressu
Hi All,
I've been debugging an issue where the query 'tpms' would make the
spellchecker throw the following exception:
21021 [qtp91486057-17] ERROR org.apache.solr.servlet.SolrDispatchFilter –
null:java.lang.StringIndexOutOfBoundsException: String index out of range:
-1
at java.lang.AbstractStri
mean' spellchecking, as discussed in
https://issues.apache.org/jira/browse/SOLR-2585 (Context-Sensitive
Spelling Suggestions & Collations)
I'm using following environment:
- Solr 4.0-alpha (downloaded 25. June)
-
ipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:599)
> at
> org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:534)
> at java.lang.Thread.run(Thread.java:679)
>
> 500
>
>
>
> Fabio
>
> 2012/6/22 Markus Jelsma
>
>> Hi,
t is empty.
>
> Cheers,
>
>
>
> -Original message-
> > From:fabio curti
> > Sent: Fri 22-Jun-2012 09:34
> > To: solr-user@lucene.apache.org
> > Subject: Re: Solr spellchecking fails on sharded query
> >
> > Hi,
> > i try solr shar
Hi,
The spellcheck component must be enabled in your default request handler
otherwise your suggestions list is empty.
Cheers,
-Original message-
> From:fabio curti
> Sent: Fri 22-Jun-2012 09:34
> To: solr-user@lucene.apache.org
> Subject: Re: Solr spellchecking fail
being used for the various shards.
>
> The problem comes when I submit a sharded query. In that case the result
> comes back with the following:
>
>
>
>
>
> In other words, the list of words for which there are suggestions is empty.
>
> Is there a trick to sharded spellchecking? I appreciate any suggestions.
>
> Eric
>
result
> comes back with the following:
>
>
>
>
>
> In other words, the list of words for which there are suggestions is empty.
>
> Is there a trick to sharded spellchecking? I appreciate any suggestions.
>
> Eric
>
. In that case the result
comes back with the following:
In other words, the list of words for which there are suggestions is empty.
Is there a trick to sharded spellchecking? I appreciate any suggestions.
Eric
Hello,
I have tried to implement spellchecker based on index in nutch-solr by adding
spell field to schema.xml and making it a copy from content field. However,
this increased data folder size twice and spell filed as a copy of content
field appears in xml feed which is not necessary. Is it po
Hello,
I posted on the Lucene Forums, and someone told me to e-mail it here.
Instead of writing again my question here, I take the liberty to link my post.
Its about SolR, autocompletion, Spellchecking and "case-sentivieness" (?).
http://lucene.472066.n3.nabble.com/SolR-Spellch
l_search1
true
spellchecker1
spell_where
spell_search2
true
spellchecker2
--
View this message in context:
http://lucene.472066.n3.nabble.com/Problem-with-spellchecking
> How can i enable it in my search request handler and search both in one
> request?
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Problem-with-spellchecking-dont-want-multiple-request-to-SOLR-tp2988167p2992076.html
> Sent from the Solr - User mailing list archive at Nabble.com.
How can i enable it in my search request handler and search both in one
request?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Problem-with-spellchecking-dont-want-multiple-request-to-SOLR-tp2988167p2992076.html
Sent from the Solr - User mailing list archive at Nabble.com.
gt; it possible and how?
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Problem-with-spellchecking-dont-want-multiple-request-to-SOLR-tp2988167p2988167.html
> Sent from the Solr - User mailing list archive at Nabble.com.
?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Problem-with-spellchecking-dont-want-multiple-request-to-SOLR-tp2988167p2988167.html
Sent from the Solr - User mailing list archive at Nabble.com.
to use Solr suggester component for
autocompletion and spellchecking at the same time.
1) Does Solr can use the same spellcheck dictionary (that is based upon the
main index) for autocompletion and spellchecking?
2) In solrconfig.xml, should I configure a "suggest" search Compo
the spellChecker index,
only English terms.
Regards,
Alex
--
View this message in context:
http://lucene.472066.n3.nabble.com/Spellchecking-in-the-Chinese-Lanugage-tp2812726p2813149.html
Sent from the Solr - User mailing list archive at Nabble.com.
angle the characters.
On Tue, Apr 12, 2011 at 12:47 PM, Otis Gospodnetic
wrote:
> Hi,
>
> Does spellchecking in Chinese actually make sense? I once asked a native
> Chinese speaker about that and the person told me it didn't really make sense.
> Anyhow, with n-grams, I don
Hi,
Does spellchecking in Chinese actually make sense? I once asked a native
Chinese speaker about that and the person told me it didn't really make sense.
Anyhow, with n-grams, I don't think this could technically work even if it made
sense for Chinese, could it?
Otis
Semate
building the spelling index, I don't see Chinese characters in the
"spellchecker" index, only terms in English.
5. Tried adding the NGramFilterFactory to the CJKAnalyzer with no luck
either.
Thanks!
--
View this message in context:
http://lucene.472066.n3.nabble.com/Spellchecking-i
gt;
>
> And is attached to as a component to my search handler.
>
> Thanks,
>
> Colin
>
>
> > : I'm having an issue performing a spellcheck on some information and
> > : search of the archive isn't helping.
> >
> > For this type of
is working correctly as performing a
search works correctly.
The spellcheck is initialized like this:
title
default
searchfield
./spellchecker
false
This is attached as a component to my search handler and spellchecking
is done inline with the queries.
Thank
forming a spellcheck on some information and
> : search of the archive isn't helping.
>
> For this type of quesiton, there's not much feedback anyone can offer w/o
> knowing exactly what analyzers you have configured for hte various
> fieldtypes (both the field you ind
ou index/search and the fieldtype used for
spellchecking)
it's also fairly critical to know how you have the spellcheck component
configured.
off the cuff: i'd guess that maybe WordDelimiterFilter is being used in a
wonky way given your usecase -- but like i said: would need to see the
I'm having an issue performing a spellcheck on some information and
search of the archive isn't helping.
I'm indexing the word "p!nk" (yes, that's a bang in there), and have a
replacement filter setup so that the ! becomes i. Looking at the
analyzer the right thing is happening with both the inde
context:
http://lucene.472066.n3.nabble.com/Spellchecking-with-some-misspelled-words-in-index-source-tp2505722p2507110.html
Sent from the Solr - User mailing list archive at Nabble.com.
I'm building my spellcheck index from my content and it seems to be working,
but my problem is that there are a few misspelled words in my content. For
example: the word Sheriff is improperly misspelled Sherrif in my content a
couple dozen times (but spelled correctly a couple thousand times). The
st of the time.
James Dyer
E-Commerce Systems
Ingram Content Group
(615) 213-4311
-Original Message-
From: satya swaroop [mailto:satya.yada...@gmail.com]
Sent: Monday, January 17, 2011 10:32 AM
To: solr-user@lucene.apache.org
Subject: spellchecking even the key is true
Hi All,
Hi All,
can we get the spellchecking results even when the keyword is true.
As for spellchecking will give only to the wrong keywords, cant we get
similar and near words of the keyword though the spellcheck.q is true..
as an example
http://localhost:8080/solr/spellcheck?q=java&spellc
I therefore wrote an implementation of SolrSpellChecker that wraps jazzy,
the java aspell library. I also extended the SpellCheckComponent to take
the
matrix of suggested words and query the corpus to find the first
combination
of suggestions which returned a match. This works well for my use ca
e to publish the code in case someone finds it useful (although it's
> a bit crude at the moment and will need a decent tidy up). Would it be
> appropriate to open up a Jira issue for this?
>
> Cheers,
> ~mark
>
> On 27 July 2010 09:33, dan sutton wrote:
>
> > Hi,
l (although it's
> a bit crude at the moment and will need a decent tidy up). Would it be
> appropriate to open up a Jira issue for this?
>
> Cheers,
> ~mark
>
> On 27 July 2010 09:33, dan sutton wrote:
>
> > Hi,
> >
> > I've recently been loo
ike
to see yours to compare.
James Dyer
E-Commerce Systems
Ingram Book Company
(615) 213-4311
-Original Message-
From: Mark Holland [mailto:mark.holl...@zoopla.co.uk]
Sent: Tuesday, July 27, 2010 1:04 PM
To: solr-user@lucene.apache.org
Subject: Re: Spellchecking and frequency
Hi,
I
de in case someone finds it useful (although it's
a bit crude at the moment and will need a decent tidy up). Would it be
appropriate to open up a Jira issue for this?
Cheers,
~mark
On 27 July 2010 09:33, dan sutton wrote:
> Hi,
>
> I've recently been looking into Spellchecking in
Hi,
I've recently been looking into Spellchecking in solr, and was struck by how
limited the usefulness of the tool was.
Like most corpora , ours contains lots of different spelling mistakes for
the same word, so the 'spellcheck.onlyMorePopular' is not really that useful
unless
set('spellcheck.build', 'true');
$response = $client->query($q);
That should do it.
I hope this helps.
On Wed, May 5, 2010 at 4:56 AM, Peter Gabriel wrote:
> Hi there,
>
> i´m working with the solr-pecl extension and asking me how I to permanently
> activ
Hi there,
i´m working with the solr-pecl extension and asking me how I to permanently
activate spellchecking.
I couldn´t find a command from the pecl library to activate it by the client -
like $solrQuery->enableFacet(true) for factes.
Or is it possible to keep spellchecking permanen
Hi,
thanks, exactly that i forgot. Now it works fine. :-)
Am 03.05.2010 16:50, schrieb Michael Kuhlmann:
Am 03.05.2010 16:43, schrieb Jan Kammer:
Hi,
It worked fine with a normal field. There must something wrong with
copyfield, or why does dataimporthandler add/update no more documents?
Am 03.05.2010 16:43, schrieb Jan Kammer:
> Hi,
>
> It worked fine with a normal field. There must something wrong with
> copyfield, or why does dataimporthandler add/update no more documents?
Did you define your destination field as multivalue?
-Michael
uest.
Regards,
Gert.
-Original Message-
From: Jan Kammer [mailto:jan.kam...@mni.fh-giessen.de]
Sent: Montag, 3. Mai 2010 16:26
To: solr-user@lucene.apache.org
Subject: Re: SpellChecking
Hi,
if I define one of my normal fields from schema.xml in solrconfig.xml
for spellchecking all works
.
-Original Message-
From: Jan Kammer [mailto:jan.kam...@mni.fh-giessen.de]
Sent: Montag, 3. Mai 2010 16:26
To: solr-user@lucene.apache.org
Subject: Re: SpellChecking
Hi,
if I define one of my normal fields from schema.xml in solrconfig.xml
for spellchecking all works fine:
...
That didnt
Hi,
if I define one of my normal fields from schema.xml in solrconfig.xml
for spellchecking all works fine:
...
That didnt work, because nothing was in "spell" after that.
Next try was to copy each field in a line to "spell":
...
This does work up to 3 documents, i
It would help a lot to see your actual config file, and if you provided a
bit more
detail about what failure looks like
Best
Erick
On Mon, May 3, 2010 at 9:43 AM, Jan Kammer wrote:
> Hi there,
>
> I want to enable spellchecking, but i got many fields.
>
> I tried around wi
Hi there,
I want to enable spellchecking, but i got many fields.
I tried around with copyfield to copy all with "*" in one field, but
that didnt work.
Next try was to copy some fields specified each by name in one field
named "spell", but that worked only for 2 or 3 field
Another thing you might check into is stemming. The Porter stemmer
included in Solr is "aggressive", meaning that it will tend to do
weird things with misspellings. There is a different stemmer called
KStem which is available from www.lucidimagination.com/Downloads is
less aggressive. Porter turns
Character-based NGrams are a good tool for this problem. MLT is a
document-wide numerical analysis.
If the common types of OCR mistakes are different than what NGrams
create, you might tune the ngram generator. For example, swapping
letters might not happen very often. SIngle- and multi-word error
: My first problem appears because I need suggestions inclusive when the
: expression has returned results. It's seems that only appear
: suggestions when there are no results. Is there a way to do so?
can you give us an example of what your queries look like? with the
example configs, i can ge
I just resolved the issue (fresh coffee == good) !
In my schema, I had added:
but missed the copyField definition. Adding these:
and a restart and everything is working properly.
Thanks for the reply and for LucidImagination -- the only reason I have been
able to get Solr integrated int
What's your schema and your config look like for the various relevant pieces?
On Dec 8, 2009, at 8:04 PM, Michael Boyle wrote:
> Following Eric Hatcher's post about using SolrCell and acts_as_solr {
> http://www.lucidimagination.com/blog/2009/02/17/acts_as_solr_cell/ }, I have
> been able to in
Following Eric Hatcher's post about using SolrCell and acts_as_solr {
http://www.lucidimagination.com/blog/2009/02/17/acts_as_solr_cell/ }, I
have been able to index a rich document stream and retrieve it's id. No
worries.
However, I have the SpellCheckComponent setup to build on commit
(buil
Hello everybody
1. Have tons of digitalized text with the logical errors in OCR process
2. Have indexed with Solr and is working OK.
3. Have added spellchecker index-based for words and phrases with the
hope to offer suggestions with "suspicious" possible new query
expressions, or related query ex
and the caveat that all fields would need to be declared in the
solrconfig.xml (or get used for both fields)
this could work... would also need to augment the response with the
name of the dictionary, or assert that something will be written all
the time (so you could know the 2nd would b
Another thought that might work:
Declare two separate components, one for each field and then implement
a QueryConverter that takes in the field and only extracts the tokens
for the field or choice.
This is a definite workaround, but I think it might work. Hmm, except
we only have one Qu
One way would be to create a copyField containing both the fields and use it
as the dictionary's source.
If you do want to keep separate dictionaries for both the fields then I
guess we can introduce per-dictionary overridable parameters like the
per-field overridden facet parameters. That would b
I have a use case where I want to spellcheck the input query across
multiple fields:
Did you mean: location = washington
vs
Did you mean: person = washington
The current parameter / response structure for the spellcheck
component does not support this kind of thing. Any thoughts on how/i
t;
To: solr-user@lucene.apache.org
Sent: Friday, February 15, 2008 12:45:08 PM
Subject: Re: Integrated Spellchecking
That unfortunately got pushed aside to work on some of our higher
priority solr work since we already had it working one way.
Hoping to revisit this after we push to productio
Allocating some time to this next week. Need to try and remember what issues I was having when I
stopped working on it.
doug
Matthew Runo wrote:
I'd have to agree with this. I'd probably be able to put a bit of work
into it as well, as it's something we'd use for sure if it were available.
ROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, February 15, 2008 12:45:08 PM
Subject: Re: Integrated Spellchecking
That unfortunately got pushed aside to work on some of our higher priority solr
work since we
already had it working one way.
Hoping to revisit this after we push to
,
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Doug Steigerwald <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Friday, February 15, 2008 12:45:08 PM
> Subject: Re: Integrated Spellchecking
>
> That unfort
I'd have to agree with this. I'd probably be able to put a bit of work
into it as well, as it's something we'd use for sure if it were
available.
Thanks!
Matthew Runo
Software Developer
Zappos.com
702.943.7833
On Feb 18, 2008, at 6:09 AM, Grant Ingersoll wrote:
Hey Doug,
If you have perm
Hey Doug,
If you have permission to donate, perhaps you can just post the patch
anyway and state that it isn't quite ready to go. This is something I
could use too, and so may have some cycles to work on it. I hate to
replicate the work if you already have something that is more or less
That unfortunately got pushed aside to work on some of our higher priority solr work since we
already had it working one way.
Hoping to revisit this after we push to production and start working on new features and share what
I've done for this and multicore/spellcheck replication (which we hav
e exists for the spell checker?
>>>
>>> Thanks,
>>> Grant
>>>
>
>
So have you succeeded in implementing this patch? I'd definitely like to use
this functionality as a search suggestion.
--
View this message in context:
http://www.nabble.com/Integrated-Spellchecking-tp14930232p15504125.html
Sent from the Solr - User mailing list archive at Nabble.com.
On Jan 17, 2008, at 3:01 PM, Doug Steigerwald wrote:
I've got a couple search components for automatic spell correction
that I've been working on.
I've converted most of the SpellCheckerRequestHandler to a search
component (hopefully will throw a patch out soon for this). Then
another s
I've got a couple search components for automatic spell correction that I've
been working on.
I've converted most of the SpellCheckerRequestHandler to a search component (hopefully will throw a
patch out soon for this). Then another search component that will do auto correction for a query if
On Jan 17, 2008 2:33 PM, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> Yes -- this is what search components are for!
>
> Depending on where you put it in the chain, it could only return spell
> checked results if there are too few results (or the top score is below
> some threshold)
Score thresholds
Yes -- this is what search components are for!
Depending on where you put it in the chain, it could only return spell
checked results if there are too few results (or the top score is below
some threshold)
ryan
Grant Ingersoll wrote:
Is it feasible to submit a query to any of the various ha
Is it feasible to submit a query to any of the various handlers and
have it bring back results and spelling suggestions all in one
response? Is this something the query components piece would handle,
assuming one exists for the spell checker?
Thanks,
Grant
91 matches
Mail list logo