Re: SynonimGraphFilter expands wrong synonims

2018-09-07 Thread Andrea Gazzarini
And as you probably already checked, inserting the proper *tokenizerFactory* also expands the right synonym line: q = (body:"Cytosolic 5'-nucleotidase II"  OR body:"EC 3.1.3.5") parsedQuery = SpanOrQuery(spanOr([body:p49902, spanNear([body:cytosol, body:purin, body:5, body:nucleotidas], 0, tru

Re: SynonimGraphFilter expands wrong synonims

2018-09-05 Thread Andrea Gazzarini
You're right, my answer forgot to mention the *tokenizerFactory* parameter that you can add in the filter declaration. But, differently from what you think the default tokenizer used for parsing the synonyms _is not_ the tokenizer of the current analyzer (StandardTokenizer in your example) but

Re: SynonimGraphFilter expands wrong synonims

2018-09-05 Thread Danilo Tomasoni
Hi Andrea, thank you for your answer. About the second question: The standardTokenizer should be applied also to the phrase query, so the ' and - symbols should be removed even there, and this should allow a match in the synonim file isn't it? With an example: in phrase query: "Cytosolic

Re: SynonimGraphFilter expands wrong synonims

2018-09-05 Thread Andrea Gazzarini
Hi Danilo, let's see if this can help you (I'm sorry for the poor debugging, I'm reading & writing from my mobile): the first issue should have something to do with synonym overlapping and since I'm very curious about what it is happening, I will be more precise when I will be in front of a lap

SynonimGraphFilter expands wrong synonims

2018-09-05 Thread Danilo Tomasoni
Hello to all, I have an issue related to synonimgraphfilter expanding the wrong synonims for a phrase-term at query time. I have a dictionary with the following lines P49902,Cytosolic purine 5'-nucleotidase,EC 3.1.3.5,Cytosolic 5'-nucleotidase II A8K9N1,Glucosidase\, beta\, acid 3,Cytosolic,G