Dobry den Mikhail,
So I searched for "banka" which means "bank" at my language. Below is
highlighted fragments of a matched document. You can see from mark tags that
"Bankalar", "banka", "bankaya", "bankalar" terms exist in document,
"highlighting":{
"/var/www/vhosts/deneme.biz/httpdocs/kho3/ibb/files/7d-2000-4267.htm":{
"content":["Anlamda <mark>Bankalar</mark> Arası Mevduat Sayılamayacağı )
\n\n • TÜRKİYE'DEKİ BİR <mark>BANKANIN</mark> YURTDIŞINDAKİ
<mark>BANKAYA</mark> PARA YATIRMASI ( <mark>Banka</mark> ve Sigorta ",
"anlamında kurulmuş bir <mark>banka</mark> olarak değerlendirilmesine
ve davacı <mark>Banka</mark> tarafından yurt dışındaki <mark>bankaya</mark>
yatırılan mevduatın da <mark>bankalar</mark> arası mevduat ",
"anlamında kurulmuş bir <mark>banka</mark> olarak değerlendirilmesine
ve davacı <mark>Banka</mark> tarafından yurt dışındaki <mark>bankaya</mark>
yatırılan mevduatın da <mark>bankalar</mark> arası mevduat "]},
Below is debug-explain part of the response about the same document, how or
where should I read the variations matched term "banka" ? ("Bankalar",
"bankaya" ....)
"explain":{
"/var/www/vhosts/deneme.biz/httpdocs/kho3/ibb/files/7d-2000-4267.htm":{
"match":true,
"value":2.6295655,
"description":"max of:",
"details":[{
"match":true,
"value":2.6295655,
"description":"weight(content:banka in 7179) [SchemaSimilarity],
result of:",
"details":[{
"match":true,
"value":2.6295655,
"description":"score(freq=58.0), computed as boost * idf * tf
from:",
"details":[{
"match":true,
"value":2.6807382,
"description":"idf, computed as log(1 + (N - n + 0.5) / (n
+ 0.5)) from:",
"details":[{
"match":true,
"value":3361,
"description":"n, number of documents containing term"},
{
"match":true,
"value":49063,
"description":"N, total number of documents with
field"}]},
{
"match":true,
"value":0.980911,
"description":"tf, computed as freq / (freq + k1 * (1 - b +
b * dl / avgdl)) from:",
"details":[{
"match":true,
"value":58.0,
"description":"freq, occurrences of term within
document"},
{
"match":true,
"value":1.2,
"description":"k1, term saturation parameter"},
{
"match":true,
"value":0.75,
"description":"b, length normalization parameter"},
{
"match":true,
"value":664.0,
"description":"dl, length of field (approximate)"},
{
"match":true,
"value":721.1222,
"description":"avgdl, average length of field"}]}]}]}]},
-----Original Message-----
From: Mikhail Khludnev [mailto:[email protected]]
Sent: Wednesday, June 3, 2020 4:39 PM
To: solr-user
Subject: Re: which terms are used at the matched document?
Hi,
debugQuery response contains matched terms as well. It's just a little bit
hard to read.
On Wed, Jun 3, 2020 at 3:55 PM Serkan KAZANCI <[email protected]> wrote:
> Hi,
>
>
>
> Is it possible to retrieve the terms that are used to match the document?
> (Keyword term itself, stemmed versions of term, term matched from
> synonyms.txt)
>
>
>
> Example: search keyword "heaven"
>
>
>
> Found in document1 via "heavens" and "heaven", found in document2 via
> "heavenly" , found in document3 via "paradise" (because of synonyms.txt)
>
>
>
> I looked into debug mode but I believe it returns information about the
> ranking calculation.
>
>
>
> Thanks,
>
>
>
> Serkan
>
>
>
>
>
>
>
>
>
>
--
Sincerely yours
Mikhail Khludnev