hi Ahmet,

thank you for the reply.


i'd give it a try with some sample docs tomorrow.


thank you!


Cq








At 2014-03-08 00:52:01,"Ahmet Arslan" <iori...@yahoo.com> wrote:
>
>Hi,
>
>Looks like totaltermfreq (ttf) is equals to collection frequency. 
>Please see other relevancy functions : 
>http://wiki.apache.org/solr/FunctionQuery#Relevance_Functions
>
>Ahmet
>
>On Friday, March 7, 2014 6:38 PM, cqlangyi <cqlan...@163.com> wrote:
>
>hi Ahmet,
>
>thank you, quite clear!!!
>
>so now i could get 'df' via 'LukeRequestHandler', but
>how about 'cf', could i get it somehow?
>
>
>thanks!
>
>Cq
>
>
>
>
>
>
>
>At 2014-03-08 00:06:50,"Ahmet Arslan" <iori...@yahoo.com> wrote:
>>Hi,
>>
>>You already gave examples, using your example documents :
>>
>>>1. "fox jump over the gray dog, fox gone"
>>>2. "fox is a kind of animal, dog also is"
>>>3. "i like red fox"
>>
>>collection frequency : cf('fox') = 4 = tf('fox',d1) + tf('fox',d2) + 
>>tf('fox',d3) = 2 + 1 + 1 = 4
>>document frequency   : df('fox') = 3 = (how many documents contain term fox?)
>>
>>Ahmet
>>
>>
>>
>>
>>
>>On Friday, March 7, 2014 5:25 PM, cqlangyi <cqlan...@163.com> wrote:
>>
>>hi Ahmet,
>>
>>thank you very much for the reply.
>>
>>i'm just a little bit confused about "collection frequency" & "document 
>>frequency",
>>would you mind help me out with these 2 phrases?
>>
>>thank you!
>>
>>
>>Cq
>>
>>
>>
>>
>>
>>
>>
>>At 2014-03-07 22:43:34,"Ahmet Arslan" <iori...@yahoo.com> wrote:
>>>Hi,
>>>
>>>Thats collection frequency (cf). TermsComponent could be modified to report 
>>>cf instead of document frequency(df).
>>>
>>>Ahmet
>>>
>>>
>>>
>>>On Friday, March 7, 2014 10:49 AM, cqlangyi <cqlan...@163.com> wrote:
>>>hi there,
>>>
>>>
>>>i have a question with following example.
>>>
>>>
>>>say i have only 3 documents indexd,
>>>
>>>
>>>1. "fox jump over the gray dog, fox gone"
>>>2. "fox is a kind of animal, dog also is"
>>>3. "i like red fox"
>>>
>>>
>>>with query "http://localhost/solr/admin/luke?fl=myfield&numTerms=5
>>>solr give back the "top terms" as:
>>>fox    3
>>>dog   2
>>>...
>>>
>>>
>>>i guess Luke request handler means there's 3 documents have the term “fox", 
>>>2 documents have the term "dog"
>>>but what want to have is:
>>>fox   4        ----- which means "fox" totally appeared 4 times in all 
>>>document
>>>dog 2        ----- which means "dog" appeared 2 times totally.
>>>
>>>
>>>so is there any methods to do that?
>>>
>>>
>>>thanks a lot
>>>
>>>
>>>Cq
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>At 2014-03-07 16:12:17,solr-user-h...@lucene.apache.org wrote:
>>>>Hi! This is the ezmlm program. I'm managing the
>>>>solr-user@lucene.apache.org mailing list.
>>>>
>>>>I'm working for my owner, who can be reached
>>>>at solr-user-ow...@lucene.apache.org.
>>>>
>>>>Acknowledgment: I have added the address
>>>>
>>>>  cqlan...@163.com
>>>>
>>>>to the solr-user mailing list.
>>>>
>>>>Welcome to solr-user@lucene.apache.org!
>>>>
>>>>Please save this message so that you know the address you are
>>>>subscribed under, in case you later want to unsubscribe or change your
>>>>subscription address.
>>>>
>>>>
>>>>--- Administrative commands for the solr-user list ---
>>>>
>>>>I can handle administrative requests automatically. Please
>>>>do not send them to the list address! Instead, send
>>>>your message to the correct command address:
>>>>
>>>>To subscribe to the list, send a message to:
>>>>   <solr-user-subscr...@lucene.apache.org>
>>>>
>>>>To remove your address from the list, send a message to:
>>>>   <solr-user-unsubscr...@lucene.apache.org>
>>>>
>>>>Send mail to the following for info and FAQ for this list:
>>>>   <solr-user-i...@lucene.apache.org>
>>>>   <solr-user-...@lucene.apache.org>
>>>>
>>>>Similar addresses exist for the digest list:
>>>>   <solr-user-digest-subscr...@lucene.apache.org>
>>>>   <solr-user-digest-unsubscr...@lucene.apache.org>
>>>>
>>>>To get messages 123 through 145 (a maximum of 100 per request), mail:
>>>>   <solr-user-get.123_...@lucene.apache.org>
>>>>
>>>>To get an index with subject and author for messages 123-456 , mail:
>>>>   <solr-user-index.123_...@lucene.apache.org>
>>>>
>>>>They are always returned as sets of 100, max 2000 per request,
>>>>so you'll actually get 100-499.
>>>>
>>>>To receive all messages with the same subject as message 12345,
>>>>send a short message to:
>>>>   <solr-user-thread.12...@lucene.apache.org>
>>>>
>>>>The messages should contain one line or word of text to avoid being
>>>>treated as sp@m, but I will ignore their content.
>>>>Only the ADDRESS you send to is important.
>>>>
>>>>You can start a subscription for an alternate address,
>>>>for example "john@host.domain", just add a hyphen and your
>>>>address (with '=' instead of '@') after the command word:
>>>><solr-user-subscribe-john=host.dom...@lucene.apache.org>
>>>>
>>>>To stop subscription for this address, mail:
>>>><solr-user-unsubscribe-john=host.dom...@lucene.apache.org>
>>>>
>>>>In both cases, I'll send a confirmation message to that address. When
>>>>you receive it, simply reply to it to complete your subscription.
>>>>
>>>>If despite following these instructions, you do not get the
>>>>desired results, please contact my owner at
>>>>solr-user-ow...@lucene.apache.org. Please be patient, my owner is a
>>>>lot slower than I am ;-)
>>>>
>>>>--- Enclosed is a copy of the request I received.
>>>>
>>>>Return-Path: <cqlan...@163.com>
>>>>Received: (qmail 15386 invoked by uid 99); 7 Mar 2014 08:12:16 -0000
>>>>Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136)
>>>>    by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 08:12:16 +0000
>>>>X-ASF-Spam-Status: No, hits=4.9 required=5.0
>>>>    tests=HTML_MESSAGE,RCVD_IN_PSBL,SPF_PASS
>>>>X-Spam-Check-By: apache.org
>>>>Received-SPF: pass (athena.apache.org: domain of cqlan...@163.com 
>>>>designates 220.181.13.59 as permitted sender)
>>>>Received: from [220.181.13.59] (HELO m13-59.163.com) (220.181.13.59)
>>>>    by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 08:12:10 +0000
>>>>DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=163.com;
>>>>    s=s110527; h=Date:From:Subject:MIME-Version:Message-ID; bh=KIKmb
>>>>    puxu1huGSa5A5RUYvBKNt2RimeBgObxnp/l7gM=; b=N9yyj5qhfT8TXAwfhcRlY
>>>>    mjX4dgzti8JvVtAoO2k69n0r6alQMYT2HiOlNtjTL2XXTiJqreBx4LW07HvP5qIK
>>>>    GRbHPusNhK0s2edW9nRzffFZELJ+wfKwOpB/WLNHQXZqlAKyGP3w5civwG+rprB0
>>>>    vaXbO9dYxInWKc80ZIU5Hc=
>>>>Received: from cqlangyi$163.com ( [222.129.238.198] ) by
>>>> ajax-webmail-wmsvr59 (Coremail) ; Fri, 7 Mar 2014 16:11:45 +0800 (CST)
>>>>X-Originating-IP: [222.129.238.198]
>>>>Date: Fri, 7 Mar 2014 16:11:45 +0800 (CST)
>>>>From: cqlangyi  <cqlan...@163.com>
>>>>To: 
>>>>    
>>>> solr-user-sc.1394177943.kmfejmmdgfggfaeokajb-cqlangyi=163....@lucene.apache.org
>>>>Subject: Re:confirm subscribe to solr-user@lucene.apache.org
>>>>X-Priority: 3
>>>>X-Mailer: Coremail Webmail Server Version SP_ntes V3.5 build
>>>> 20131204(24406.5820.5783) Copyright (c) 2002-2014 www.mailtech.cn 163com
>>>>In-Reply-To: <1394177943.74586.ez...@lucene.apache.org>
>>>>References: <1394177943.74586.ez...@lucene.apache.org>
>>>>X-CM-CTRLDATA: 2T34YmZvb3Rlcl9odG09OTE2NDo4MQ==
>>>>Content-Type: multipart/alternative; 
>>>>    boundary="----=_Part_174263_595565442.1394179905833"
>>>>MIME-Version: 1.0
>>>>Message-ID: <77b43682.ba9b.1449b991929.coremail.cqlan...@163.com>
>>>>X-CM-TRANSID:O8GowADX389DfxlTrCkLAA--.29605W
>>>>X-CM-SenderInfo: pftot0xj1lqiywtou0bp/1tbiGBtHvFEAKkKiBQABsn
>>>>X-Coremail-Antispam: 1U5529EdanIXcx71UUUUU7vcSsGvfC2KfnxnUU==
>>>>X-Virus-Checked: Checked by ClamAV on apache.org
>>>>

Reply via email to