rences to trying to copyField and process same
> content in different ways. If copyField is not stored, the overhead is not
> as large.
>
> Regards,
> Alex
>
>
>
> On Tue., Mar. 2, 2021, 7:08 a.m. Martin Graney,
> wrote:
>
> > Hi All
> >
> > I hav
different ways. If copyField is not stored, the overhead is not
as large.
Regards,
Alex
On Tue., Mar. 2, 2021, 7:08 a.m. Martin Graney,
wrote:
> Hi All
>
> I have been trying to implement multi word synonyms using `sow=false` into
> a pre-existing system that applied pre-proce
Hi All
I have been trying to implement multi word synonyms using `sow=false` into
a pre-existing system that applied pre-processing to the phrase to apply
wildcards around the terms, i.e. `bread stick` => `*bread* *stick*`.
I got the synonyms expansion working perfectly, after discovering
Hello,
Do you mean that you want searches for "gain" to match documents with
"revenue" on them, but do *not* want searches for "revenue" to match
documents with "gain" on them?
If that's what you mean, how have you defined your synonyms? If you
Hi All,
Using SOLR default Synonyms search I am able to search Synonyms but for
some cases it is giving ambiguous results.
For example one of Synonyms of "Revenue" is "Gain"
Input Keyword for search: Revenue and Company
Irrelevant Output: Our company doesn't wan
hunspell_dictionary/en_US.dic"
> >
> > affix="../hunspell_dictionary/en_US.aff" ignoreCase="true" />
> >
> > >
> >
> >
> >
> >
> >
> >
> >
>
> dictionary="../hunspell_dictionary/en_US.dic"
>
> affix="../hunspell_dictionary/en_US.aff" ignoreCase="true" />
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
> dictionary="../hunspell_dictionary/en_US.dic"
>
> affix="../hunspell_di
Hi,
We are using the following configuration:
--
*Schema: *
*Managed Synonyms:* "abc implement", "bike",
Hi! hope everyone is well.
I was looking at some old articles and pondered upon
https://opensourceconnections.com/blog/2018/02/20/edismax-and-multiterm-synonyms-oddities/
.
Do we have a standard manner / robust solution to handle fields with
different analyzers (multi-word synonym etc.) clubbed
Hello, Solr Community:
Actually, you can set up a tokenizer for the managed synonyms.
But, the configuration is not on the reference guide, and I do not know how
to add a Tokenizer via API-call.
So, you might need to manually edit a JSON file below the config directory.
In the
I think the question makes sense as SynonymGraphFilterFactory accepts
tokenizerFactory,
he asked the managed version of SynonymGraphFilter could accept it as well.
https://lucene.apache.org/solr/guide/8_5/filter-descriptions.html#synonym-graph-filter
The answer seems to be NO.
Koji
On 2020/0
This question doesn’t really make sense. You don’t specify tokenizers on
filters, they’re specified at the _field_ level.
You can certainly define as many field(type)s as you want, each with a different
analysis chain and those chains can be made up of whatever you want to use, and
there are lots
nts which mention
> "chili pepper" even if they *also* mention "black pepper" or the unmodified
> word "pepper". Maybe some way using synonyms?
>
> Thanks!
>
> -s
>
> On Mon, Jul 6, 2020 at 6:43 PM Thomas Corthals
> wrote:
>
or [pepper NOT ("chili pepper" OR "hot pepper"
OR "pepper sauce")] because that excludes all documents which mention
"chili pepper" even if they *also* mention "black pepper" or the unmodified
word "pepper". Maybe some way using synonyms?
T
Hi,
Is it possible to specify a Tokenizer Factory on a Managed Synonym Graph
Filter? I would like to use a Standard Tokenizer or Keyword Tokenizer on
some fields.
Best,
Thomas
; words="stopwords_en.txt" />
>
> preserveOriginal="true" />
>
>
>
>
> synonyms="synonyms_en.txt" ignoreCase="true" expand="true" />
> words="stop
using multi-word synonyms which
contain stopwords.
If the stopwords appear in the middle, it works fine. For example, if I
have the following in my synonyms file (where i is a stopword):
iphone, apple i phone
And if I query: /select?q=iphone&qf=title&defType=edismax
The pa
Hm, I'm not sure what you mean, but I am pretty new to Solr. Apologies!
On 1/20/20, 12:01 PM, "fiedzia" wrote:
>From my understanding, if you want regional sales manager to be indexed as
both director of sales and area manager, you
>would have to type:
>
>Regional sales ma
>From my understanding, if you want regional sales manager to be indexed as
both director of sales and area manager, you
>would have to type:
>
>Regional sales manager -> director of sales, area manager
that works for searching, but because everything is in the same position,
searching for "dir
From my understanding, if you want regional sales manager to be indexed as both
director of sales and area manager, you would have to type:
Regional sales manager -> director of sales, area manager
I do not believe you can chain synonyms.
Re: bigrams/trigrams, I was more interested in
> what is the reasoning behind adding the bigrams and trigrams manually like
that? Maybe if we knew the end goal, we could figure out a different
strategy. Happy that at least the matching is working now!
I have large amount of synonyms and keep adding new ones, some of them
partially over
Hmm what is the reasoning behind adding the bigrams and trigrams manually
like that? Maybe if we knew the end goal, we could figure out a different
strategy. Happy that at least the matching is working now!
On 1/17/20, 10:28 AM, "fiedzia" wrote:
> Doing it the other way (new york cit
> Doing it the other way (new york city -> new_york_city, new_york) makes
more
sense,
Just checked it, that way does the matching as expected, but highlighting is
wrong
("new york: query matches "new york city" as it should, but also highlights
all of it)
--
Sent from: https://lucene.472066.n3.
> If you instead write "new york => new_york, new_york_city" it should work
I can't do that, as that would turn "new york" into "new york_city", which
is not what I want.
Doing it the other way (new york city -> new_york_city, new_york) makes more
sense, though I expect this to get positions wrong
If you instead write "new york => new_york, new_york_city" it should work
(https://doc.lucidworks.com/fusion/3.1/Collections/Synonyms-Files.html)
On 1/17/20, 6:29 AM, "fiedzia" wrote:
Having synonyms defined for
new york -> new_york
new
Having synonyms defined for
new york -> new_york
new york city -> new_york_city
I'd like the phrase
new york city
to be indexed as both, but SynonymGraphFilter picks only one. Is there a way
around that?
--
Maciej Dziardziel
fied...@gmail.com
--
Sent from: https://lucen
o
>>>>> change the field. But now, when we index new documents, Solr complains:
>>>>> 400 Bad Request
>>>>> Error: 'Exception writing document id someid to the index; possible
>>>>> analysis error: cannot change field "auth_country_code" from index
>>>>> options=DOCS to inconsistent index
>> options=DOCS_AND_FREQS_AND_POSITIONS',
>>>>>
>>>>> Since we are only making query time changes, I would really like to not
>>>>> have to reindex our entire collection. Is that possible somehow?
>>>>>
>>>>> Thanks,
>>>>> Bjarke
>>>>>
>>>>>
>>>>> >>>> sortMissingLast="true" positionIncrementGap="100">
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> >>>> synonyms="country-synonyms.txt" ignoreCase="false" expand="true"/>
>>>>>
>>>>>
>>>>
>>>>
>>
>>
gt; 400 Bad Request
> >>> Error: 'Exception writing document id someid to the index; possible
> >>> analysis error: cannot change field "auth_country_code" from index
> >>> options=DOCS to inconsistent index
> options=DOCS_AND_FREQS_AND_POSITIONS',
> >>>
> >>> Since we are only making query time changes, I would really like to not
> >>> have to reindex our entire collection. Is that possible somehow?
> >>>
> >>> Thanks,
> >>> Bjarke
> >>>
> >>>
> >>> >>> sortMissingLast="true" positionIncrementGap="100">
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>>>> synonyms="country-synonyms.txt" ignoreCase="false" expand="true"/>
> >>>
> >>>
> >>
> >>
>
>
gt;>>
>>> Since we are only making query time changes, I would really like to not
>>> have to reindex our entire collection. Is that possible somehow?
>>>
>>> Thanks,
>>> Bjarke
>>>
>>>
>>> >> sortMissingLast="true" positionIncrementGap="100">
>>>
>>>
>>>
>>>
>>>
>>> >> synonyms="country-synonyms.txt" ignoreCase="false" expand="true"/>
>>>
>>>
>>
>>
from index
> > options=DOCS to inconsistent index options=DOCS_AND_FREQS_AND_POSITIONS',
> >
> > Since we are only making query time changes, I would really like to not
> > have to reindex our entire collection. Is that possible somehow?
> >
> > Thanks,
> > B
AND_POSITIONS',
>
> Since we are only making query time changes, I would really like to not
> have to reindex our entire collection. Is that possible somehow?
>
> Thanks,
> Bjarke
>
>
> sortMissingLast="true" positionIncrementGap="100">
>
>
>
>
>
> synonyms="country-synonyms.txt" ignoreCase="false" expand="true"/>
>
>
We have a solr file of type "string".
It turns out that we need to do synonym expansion on query time in order to
account for some changes over time in the values stored in that field.
So we have tried introducing a custom fieldType that applies the synonym
filter at query time only (see bottom of
Hi Erick,
Is there anyway I can get it to match documents containing at least one of the
words of the original query? i.e. 'frozen' or 'dinner' or both. (But not
partial matches of the synonyms)
Thanks,Sunil
-Original Message-
From: Erick Erickson
To: solr-use
This is not a phrase query, rather it’s requiring either pair of words
to appear in the title.
You’ve told it that “frozen dinner” and “microwave foods” are synonyms.
So it’s looking for both the words “microwave” and “foods” in the title field,
or “frozen” and “dinner” in the title field
ion in order
> to support multi-word synonyms (I am using Solr 7.6). Here is my field
> configuration:
>
>
>
>
>
>
>
>synonyms="synonyms.txt"/>
>
>
>
>
>
> And this is my synonyms.txt file:
> f
I have enabled the SynonymGraphFilter in my field configuration in order to
support multi-word synonyms (I am using Solr 7.6). Here is my field
configuration:
And this is my synonyms.txt file:
frozen dinner,microwave food
Scenario 1: blue shirt
Ah, I didn’t read thoroughly enough. The problem is stopwords don’t really
count for fuzzy searching. By specifying “junk~” you’re not really searching
for “junk” or variants. You’re telling Solr “find any term that is a fuzzy
match” to “junk”. Under the covers, a search is being made for “jank
Thanks for your reply Erick.
I create a simple field type as below for testing and added 'junk' to the
stopwords but it doesnt seem to honor it when using fuzzzy search
Btw, I am using qf along with edismax and pass the value in q (sample query
below).
/solr/collection1/select?qf=title_autoCompl
ield=productId&group.ngroups=true&facet=on&facet.field=categoryFilter&sort=defaultMarketingSequence%20asc&q=junk~
>
>
>
>
> ignoreCase="true"/>
>
>
>
>
> synony
Hi,
Is there a way to use stopwords and fuzzy match in a SOLR query?
The below query matches 'jack' too and I added 'junk' to the stopwords (in
query) to avoid returning results but looks like its not honoring the
stopwords when using the fuzzy search.
solr/collection1/select?app-qf=title_autoCo
Hi,
Is there a way to use stopwords and fuzzy match in a SOLR query?
The below query matches 'jack' too and I added 'junk' to the stopwords (in
query) to avoid returning results but looks like its not honoring the
stopwords when using the fuzzy search.
solr/collection1/select?app-qf=title_autoCo
For a search like "1969 shirt" I would like to return items with either
1969 or 1960s but boost 1969 items higher. For the query "1960s shirt",
1960s and 1960, 1961, ... 1969 should all match equally.
Is there a standard technique for this? I'm struggling to do this with
eDisMax without adding new
synonym's file from ZK, which
is what the system edits when a user changes some synonyms.
I understand that a Collection needs to be reloaded if the synonyms were to
be used at indexation time, but this is not my case.
The managed API is on the same situation, basically it does what I am doing
On 12/29/2018 5:55 AM, Simón de Frosterus Pokrzywnicki wrote:
The problem is that when the user changes the synonyms, it automatically
triggers a sequential reload of all the Collections.
What exactly is being done when you say "the user changes the
synonyms"? Just uploading a ne
Hello,
I have a solrcloud setup with multiple Collections based on the same
configset.
One of the features I have is that the user can define their own synonyms
in order to improve their search experience which has worked fine until
recently.
Lately the platform has grown and the user has
On Wed, Dec 26, 2018 at 09:09:02PM -0800, Erick Erickson wrote:
> bq. However multiword synonyms are only compatible with queryTime synonyms
> expansion.
>
> Why do you say this? What version of Solr? Query-time mult-word
> synonyms were _added_, but AFAIK the capability of mult
bq. However multiword synonyms are only compatible with queryTime synonyms
expansion.
Why do you say this? What version of Solr? Query-time mult-word
synonyms were _added_, but
AFAIK the capability of multi-word synonyms was not taken away. Or are
you saying that
MLT doesn't play nice at all
Hi
It turns out that MoreLikeThis handler does not use queryTime synonyms
expansion.
It is only compatible with indexTime synonyms.
However multiword synonyms are only compatible with queryTime synonyms
expansion.
For this reason this does not allow the use of multiword synonyms within
used, and then get replaced
by Solr code.
Many thanks,
Markus
-Original message-
> From:Markus Jelsma
> Sent: Thursday 22nd November 2018 15:39
> To: solr-user@lucene.apache.org; solr-user
> Subject: RE: KeywordRepeat, stemming, (single term) synonyms and minimum
&g
On 11/28/2018 6:37 AM, Vincenzo D'Amore wrote:
Very likely I'm late to this party :) not sure with solr standalone, but
with solrcloud (7.3.1) you have to reload the core every time synonyms
referenced by a schema are changed.
I have a 7.5.0 download on my workstation, so I fir
Very likely I'm late to this party :) not sure with solr standalone, but
with solrcloud (7.3.1) you have to reload the core every time synonyms
referenced by a schema are changed.
On Mon, Nov 26, 2018 at 8:51 PM Walter Underwood
wrote:
> Should be easy to check with the analysis U
Should be easy to check with the analysis UI. Add a synonym and see if it is
used.
I seem to remember some work on reloading synonyms on the fly without a core
reload. These seem related...
https://issues.apache.org/jira/browse/SOLR-5200
https://issues.apache.org/jira/browse/SOLR-5234
wunder
I know that changes to the schema require a reload. But do changes to
files referenced by a schema also require a reload? So if for instance
I were to change the contents of a synonym file, would I need to reload
the core before Solr would use the new file? Synonyms in this case are
at
kus Jelsma
> Sent: Sunday 18th November 2018 23:21
> To: solr-user@lucene.apache.org; solr-user
> Subject: RE: KeywordRepeat, stemming, (single term) synonyms and minimum
> should match (edismax)
>
> Hello,
>
> Apologies for bothering you all again, but i really need
arkus
-Original message-
> From:Markus Jelsma
> Sent: Tuesday 13th November 2018 9:52
> To: solr-user
> Subject: KeywordRepeat, stemming, (single term) synonyms and minimum should
> match (edismax)
>
> Hello, apologies for this long winded e-mail.
>
> O
Hello, apologies for this long winded e-mail.
Our fields have KeywordRepeat and language specific filters such as a stemmer,
the final filter at query-time is SynonymGraph. We do not use
RemoveDuplicatesFilter for those of you wondering why when you see the parsed
queries below, this is due to
Synonyms in Solr are really a kind of "programmers" tool, useful for
mapping terms to other terms. This need not correspond to linguistic
notions of a synonym or hypernomy/hyponomy.
That being said, there's probably half a dozen approaches for doing these
kinds of taxonomical r
Hi
Does SolR provide a way to describe synonyms relationships such
"equivalent to" ,"narrower thant", "broader than" ?
It turns out both postgres and oracle do, but I can't find any related
information in the documentation.
This is useful to allow generalizi
You can have the synonyms text file in the same config folder as the rest
of your files like solrconfig.xml that you will push to Solr Cloud.
When you push the config file to Solr Cloud, the synonyms text file will be
push in to Solr Cloud together.
In your solrconfig.xml, you will need to add the
Hi All,
How can we add a synonyms text file to solr cloud. I have a text file with
comma separated synonyms.
Thanks & Regards
Piyush Rathor
Consultant
Deloitte Digital (Salesforce.com / Force.com)
Deloitte Consulting Pvt. Ltd.
Office: +1 (615) 209 4980
Mobile : +1 (302) 397 1491
te:
> Hi Roy, I think you miss the autoGeneratePhraseQueries=true in the field
> type definition.
> I was on a slightly different use case when I met your same issue (I was
> using synonyms expansion at query time) and honestly I didn't understand
> why this is not the default and
Hi Roy, I think you miss the autoGeneratePhraseQueries=true in the field
type definition.
I was on a slightly different use case when I met your same issue (I was
using synonyms expansion at query time) and honestly I didn't understand
why this is not the default and implicit behavior. In
I am not using edismax (eventually I would like to get there) but I'm just
testing with standard query right now. Original posting:
I'm trying to figure out why the multi-word synonym expansion is not
working correctly (or, at least what I'm misunderstanding). Specifically,
when I test a standar
Also share your fieldType settings for myfield as well from your schema
On Wed, Aug 15, 2018 at 8:00 PM Doug Turnbull <
dturnb...@opensourceconnections.com> wrote:
> Aside from the screenshot issue, one thing to check: are you searching
> with defType=edismax ?
>
> As in
> q=lcd&qf=myfield&sow=fa
Aside from the screenshot issue, one thing to check: are you searching
with defType=edismax ?
As in
q=lcd&qf=myfield&sow=false&defType=edismax
?
Also sow=false should the the default on Solr 7 and above
Doug
On Wed, Aug 15, 2018 at 6:27 PM Roy Lim wrote:
> I'm trying to figure out why the m
Yes please. That way we’ll see the whole thing.
--
Steve
www.lucidworks.com
> On Aug 15, 2018, at 7:20 PM, Roy Lim wrote:
>
> I've subscribed, shall I re-post it then via email?
>
>
>
> --
> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
I've subscribed, shall I re-post it then via email?
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
Roy,
Not sure of the point of Nabble when it strips content before passing messages
on to the mailing list. I’ve emailed them about this problem in the past but
they have done nothing about it.
Updating a post on Nabble will never make it to the mailing list. If you want
us to be able to rea
Thanks, updated original post. It just removed what I surrounded with the
raw text markup, I've added it back without markup. Not sure of the point
of raw text if it's always removed
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
The mail server strips pretty much all screenshots and attachments, so
I think some of the data you're trying to provide is missing from the
e-mail.
Best,
Erick
On Wed, Aug 15, 2018 at 3:27 PM, Roy Lim wrote:
> I'm trying to figure out why the multi-word synonym expansion is not working
> correc
I'm trying to figure out why the multi-word synonym expansion is not working
correctly. Specifically, when I test a standard query with Solr Admin it is
still splitting on whitespace.
Here is my setup:
- Solr 7.2.1
- synonym LCD => liquid crystal display
- q=myfield:LCD
- added: sow=false
- myfie
I THINK this might be a bug? I've had troubles with how the Solr Managed
Synonym endpoint handles URL encoding of synonyms. It seems to be
impossible to delete a synonym which has a forward slash in it.
I have a synonym with a key of "Hot/Cold Pack" (that's the key that sho
Vicenzo,
Thank you for the tip. I restarted Solr and it worked.
-Ennio
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
Have you reloaded the core (or restarted Solr) after the change in the synonyms
file?
Ciao,
Vincenzo
--
mobile: 3498513251
skype: free.dev
> On 17 Jul 2018, at 20:04, ennio wrote:
>
> No not using SolrCloud.
>
>
>
> --
> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
No not using SolrCloud.
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
>
> ignoreCase="true"
> words="lang/stopwords_en.txt"
> />
>
>
> protected="protwords.txt"/>
>
>
>
>
> synonyms="
Erick,
I'm invoking the synonym at query time.
Here is my fieldType definition.
--
Sent from: http://lucene.472066.n3.nabble.com/Sol
Hi Ennio,
could you please share:
* your configuration (specifically the field type declaration in your
schema)
* the query (please add debug=true) and the corresponding query response
Best,
Andrea
On 17/07/18 17:35, Ennio Bozzetti wrote:
I'm trying to get my synonyms to work, bu
before indexing and/or querying?
The admin UI>>collection>>analysis page is very helpful.
Best,
Erick
On Tue, Jul 17, 2018 at 8:35 AM, Ennio Bozzetti wrote:
> I'm trying to get my synonyms to work, but for this one keyword I cannot get
> it to work.
>
> I added the
I'm trying to get my synonyms to work, but for this one keyword I cannot get it
to work.
I added the following to my synonyms file.
fiber,fibre
But when I search for fiber or fibre it does not work.
Fiber is the American English spelling and Fibre is the British English
spelling.
My
thanks for the solution its working fine for me.
I did the same configuration but missed the
tokenizerFactory="solr.KeywordTokenizerFactory" in the filter tag. that
great
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
at 10:14 PM howed wrote:
> Finally got back to looking at this, and found that the solution was to
> switch to the unified
> <
> https://lucene.apache.org/solr/guide/7_2/highlighting.html#choosing-a-highlighter>
>
> highlighter which doesn't seem to have the same
Finally got back to looking at this, and found that the solution was to
switch to the unified
<https://lucene.apache.org/solr/guide/7_2/highlighting.html#choosing-a-highlighter>
highlighter which doesn't seem to have the same problem with my complex
synonyms. This required some
Thank you,
Elizabeth
On Wed, Apr 18, 2018 at 12:38 PM, Elizabeth Haubert <
ehaub...@opensourceconnections.com> wrote:
> I'm seeing pf and pf3 clauses fail to generate in long queries containing
> synonyms. Wondering if anyone else has run into this, or if it needs to be
&g
I'm seeing pf and pf3 clauses fail to generate in long queries containing
synonyms. Wondering if anyone else has run into this, or if it needs to be
submitted as a bug in Jira. It is a showstopper problem for the current
project, as the pf and pf3 were pretty heavily tuned.
Using Solr 7.1
David
Yes, highlighting is tricky, especially with synonyms. Sorry, I would need to
see a bit more of your config before saying more about it.
Thanks -- Rick
--
Sorry for being brief. Alternate email is rickleir at yahoo dot com
he text "19
some st lcx__balmoral__cannum__clear_lake__lower_norton vic 3456" into a Solr
field that has our custom synonym filter. This then causes the synonyms for
the locality "cannum" to be generated, and if we search for "19 some st
balmoral" we will still get a match on the locality component of t
David
When you have "lcx__balmoral__cannum__clear_lake__lower_norton" in a field,
would you search for *cannum* ? That might not perform well.
Why not have a multivalue field for this information?
It could be that you have a good reason for this, and I just do not understand.
Cheers -- Rick
--
Hi,
Is it possible to specify the synonyms file as a variable, set a default
synonym file and passing the file name from the request? If so, is there an
example of this?
Such as,
Thanks,
Roopa
I see okay, thank you.
On Wed, Feb 14, 2018 at 10:34 AM, Alessandro Benedetti wrote:
> I see,
> According to what I know it is not possible to run for the same field
> different query time analysis.
>
> Not sure if anyone was working on that.
>
> Regards
>
>
>
> -
> ---
> Alessan
I see,
According to what I know it is not possible to run for the same field
different query time analysis.
Not sure if anyone was working on that.
Regards
-
---
Alessandro Benedetti
Search Consultant, R&D Software Engineer, Director
Sease Ltd. - www.sease.io
--
Sent from: http
need not be a copy field which holds data, since we can use this is only
for getting the analysis.
Get the synonyms for the user query as tokens.
Create a edismax query based on the query tokens.
Return the score
This custom parser would be called in LTR as a scalar feature.
I am at the stage I
name, field to run the analysis on. This field
need not be a copy field which holds data, since we can use this is only
for getting the analysis.
Get the synonyms for the user query as tokens.
Create a edismax query based on the query tokens.
Return the score
This custom parser would be called in LTR
"I can go with the "title" field and have that include the synonyms in
analysis. Only problem is that the number of fields and number of synonyms
files are quite a lot (~ 8 synonyms files) due to different weightage and
type of expansion (exact vs partial) based on these. Hence
Thank you, Alessandro,
I was trying these options before replying.
Yes, I am looking to generate a score for a query with synonym expansion
(not binary feature)
I can go with the "title" field and have that include the synonyms in
analysis. Only problem is that the number of fields
In the end a feature will just be a numerical value.
How do you plan to use synonyms in a field to generate a numerical feature ?
Are you planning to define a binary feature for a field, in case there is a
match on the synonyms ?
Or a feature which contains a score for a query ( with synonyms
Steve,
According to your comment, I made this test :
1/ put the SynonymGraphFilterFactory after the StopFilterFactory in query
time analyze chain
2/ remove the stop word in the synonyms file
om, olympique marseille
The parsed query
me_text_gp:olympiqu +name_text_gp:de
> +name_text_gp:marseil) name_text_gp:om)))
>
> (btw my stop list only has “de” on it)
>
> Thanks,
>
> --
> Steve
> www.lucidworks.com
>
> > On Feb 10, 2018, at 2:12 AM, Dominique Bejean
> wrote:
> >
> > Hi,
i,
>
> More info.
>
> When I test the analisys for the field type the synonyms are correctly
> expanded for both expressions
>
> om maillot
> maillot om
> olympique de marseille maillot
> maillot olympique de marseille
>
> resulting outputs always include the
Hi,
More info.
When I test the analisys for the field type the synonyms are correctly
expanded for both expressions
om maillot
maillot om
olympique de marseille maillot
maillot olympique de marseille
resulting outputs always include the following terms (obvioulsly not always
in the same order
1 - 100 of 924 matches
Mail list logo