RE: which terms are used at the matched document?

Serkan KAZANCI Wed, 03 Jun 2020 07:15:17 -0700

Dobry den Mikhail,

So I searched for "banka" which means "bank" at my language. Below is 
highlighted fragments of a matched document. You can see from mark tags that 
"Bankalar", "banka", "bankaya", "bankalar" terms exist in document,



"highlighting":{
    "/var/www/vhosts/deneme.biz/httpdocs/kho3/ibb/files/7d-2000-4267.htm":{
      "content":["Anlamda <mark>Bankalar</mark> Arası Mevduat Sayılamayacağı ) 
\n\n • TÜRKİYE&#x27;DEKİ BİR <mark>BANKANIN</mark> YURTDIŞINDAKİ 
<mark>BANKAYA</mark> PARA YATIRMASI ( <mark>Banka</mark> ve Sigorta ",
        "anlamında kurulmuş bir <mark>banka</mark> olarak değerlendirilmesine 
ve davacı <mark>Banka</mark> tarafından yurt dışındaki <mark>bankaya</mark> 
yatırılan mevduatın da <mark>bankalar</mark> arası mevduat ",
        "anlamında kurulmuş bir <mark>banka</mark> olarak değerlendirilmesine 
ve davacı <mark>Banka</mark> tarafından yurt dışındaki <mark>bankaya</mark> 
yatırılan mevduatın da <mark>bankalar</mark> arası mevduat "]},



Below is debug-explain part of the response about the same document, how or 
where should I read the variations matched term "banka" ? ("Bankalar", 
"bankaya" ....)


"explain":{
      "/var/www/vhosts/deneme.biz/httpdocs/kho3/ibb/files/7d-2000-4267.htm":{
        "match":true,
        "value":2.6295655,
        "description":"max of:",
        "details":[{
            "match":true,
            "value":2.6295655,
            "description":"weight(content:banka in 7179) [SchemaSimilarity], 
result of:",
            "details":[{
                "match":true,
                "value":2.6295655,
                "description":"score(freq=58.0), computed as boost * idf * tf 
from:",
                "details":[{
                    "match":true,
                    "value":2.6807382,
                    "description":"idf, computed as log(1 + (N - n + 0.5) / (n 
+ 0.5)) from:",
                    "details":[{
                        "match":true,
                        "value":3361,
                        "description":"n, number of documents containing term"},
                      {
                        "match":true,
                        "value":49063,
                        "description":"N, total number of documents with 
field"}]},
                  {
                    "match":true,
                    "value":0.980911,
                    "description":"tf, computed as freq / (freq + k1 * (1 - b + 
b * dl / avgdl)) from:",
                    "details":[{
                        "match":true,
                        "value":58.0,
                        "description":"freq, occurrences of term within 
document"},
                      {
                        "match":true,
                        "value":1.2,
                        "description":"k1, term saturation parameter"},
                      {
                        "match":true,
                        "value":0.75,
                        "description":"b, length normalization parameter"},
                      {
                        "match":true,
                        "value":664.0,
                        "description":"dl, length of field (approximate)"},
                      {
                        "match":true,
                        "value":721.1222,
                        "description":"avgdl, average length of field"}]}]}]}]},


-----Original Message-----
From: Mikhail Khludnev [mailto:[email protected]] 
Sent: Wednesday, June 3, 2020 4:39 PM
To: solr-user
Subject: Re: which terms are used at the matched document?

Hi,
debugQuery response contains matched terms as well. It's just a little bit
hard to read.

On Wed, Jun 3, 2020 at 3:55 PM Serkan KAZANCI <[email protected]> wrote:

> Hi,
>
>
>
> Is it possible to retrieve the terms that are used to match the document?
> (Keyword term itself, stemmed versions of term, term matched from
> synonyms.txt)
>
>
>
> Example:  search keyword "heaven"
>
>
>
> Found in document1 via "heavens" and "heaven", found in document2 via
> "heavenly" , found in document3 via "paradise" (because of synonyms.txt)
>
>
>
> I looked into debug mode but I believe it returns information about the
> ranking calculation.
>
>
>
> Thanks,
>
>
>
> Serkan
>
>
>
>
>
>
>
>
>
>

-- 
Sincerely yours
Mikhail Khludnev

RE: which terms are used at the matched document?

Reply via email to