[
https://issues.apache.org/jira/browse/SOLR-14436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Atin updated SOLR-14436:
------------------------
Priority: Major (was: Minor)
> When using Synonym Graph Filter, Solr does not tokenize query-string if it
> has multi-word synonym
> -------------------------------------------------------------------------------------------------
>
> Key: SOLR-14436
> URL: https://issues.apache.org/jira/browse/SOLR-14436
> Project: Solr
> Issue Type: Bug
> Security Level: Public(Default Security Level. Issues are Public)
> Components: query, Schema and Analysis
> Affects Versions: 8.3.1
> Reporter: Atin
> Priority: Major
> Attachments: Scenario1.png, Scenario2.png
>
>
>
> While using Synonym Graph Filter, if the query string contains a multi-word
> synonym, it considers that multi-word synonym as a single term and does not
> tokenize it further.
>
> For example- *soap powder* is a search *query* which is also a _multi-word
> synonym_ in the synonym file as-
> {quote}s(104254535,1,'soap powder',n,1,1).
> s(104254535,2,'built-soap powder',n,1,0).
> s(104254535,3,'washing powder',n,1,0).{quote}
>
> There are 2 documents having _soap_(2) and _powder_(1) altogether.
> doc1: "Sunny Berlin breast tumors soap powder"
> doc2: "She is in soap Berlin today"
>
>
> +Scenario 1 (screenshot attached)+
> *without* Synonym Graph Filter => 2 docs returned , as it checks for
> *"soap"* and *"powder"* separately.
>
> +Scenario 2 (screenshot attached)+
> *with* Synonym Graph Filter => only 1 doc returned, but 2 were expected. Here
> only *"soap powder"* is being checked and it is not tokenized into "soap" and
> "powder" and searched further.
> Is it possible to expand query string - *soap powder* as:
> Synonym(soap powder) + Synonym(soap) + Synonym(powder)
>
> Thank You.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]