hi Ahmet,
thank you for the reply. i'd give it a try with some sample docs tomorrow. thank you! Cq At 2014-03-08 00:52:01,"Ahmet Arslan" <[email protected]> wrote: > >Hi, > >Looks like totaltermfreq (ttf) is equals to collection frequency. >Please see other relevancy functions : >http://wiki.apache.org/solr/FunctionQuery#Relevance_Functions > >Ahmet > >On Friday, March 7, 2014 6:38 PM, cqlangyi <[email protected]> wrote: > >hi Ahmet, > >thank you, quite clear!!! > >so now i could get 'df' via 'LukeRequestHandler', but >how about 'cf', could i get it somehow? > > >thanks! > >Cq > > > > > > > >At 2014-03-08 00:06:50,"Ahmet Arslan" <[email protected]> wrote: >>Hi, >> >>You already gave examples, using your example documents : >> >>>1. "fox jump over the gray dog, fox gone" >>>2. "fox is a kind of animal, dog also is" >>>3. "i like red fox" >> >>collection frequency : cf('fox') = 4 = tf('fox',d1) + tf('fox',d2) + >>tf('fox',d3) = 2 + 1 + 1 = 4 >>document frequency : df('fox') = 3 = (how many documents contain term fox?) >> >>Ahmet >> >> >> >> >> >>On Friday, March 7, 2014 5:25 PM, cqlangyi <[email protected]> wrote: >> >>hi Ahmet, >> >>thank you very much for the reply. >> >>i'm just a little bit confused about "collection frequency" & "document >>frequency", >>would you mind help me out with these 2 phrases? >> >>thank you! >> >> >>Cq >> >> >> >> >> >> >> >>At 2014-03-07 22:43:34,"Ahmet Arslan" <[email protected]> wrote: >>>Hi, >>> >>>Thats collection frequency (cf). TermsComponent could be modified to report >>>cf instead of document frequency(df). >>> >>>Ahmet >>> >>> >>> >>>On Friday, March 7, 2014 10:49 AM, cqlangyi <[email protected]> wrote: >>>hi there, >>> >>> >>>i have a question with following example. >>> >>> >>>say i have only 3 documents indexd, >>> >>> >>>1. "fox jump over the gray dog, fox gone" >>>2. "fox is a kind of animal, dog also is" >>>3. "i like red fox" >>> >>> >>>with query "http://localhost/solr/admin/luke?fl=myfield&numTerms=5 >>>solr give back the "top terms" as: >>>fox 3 >>>dog 2 >>>... >>> >>> >>>i guess Luke request handler means there's 3 documents have the term “fox", >>>2 documents have the term "dog" >>>but what want to have is: >>>fox 4 ----- which means "fox" totally appeared 4 times in all >>>document >>>dog 2 ----- which means "dog" appeared 2 times totally. >>> >>> >>>so is there any methods to do that? >>> >>> >>>thanks a lot >>> >>> >>>Cq >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> >>>At 2014-03-07 16:12:17,[email protected] wrote: >>>>Hi! This is the ezmlm program. I'm managing the >>>>[email protected] mailing list. >>>> >>>>I'm working for my owner, who can be reached >>>>at [email protected]. >>>> >>>>Acknowledgment: I have added the address >>>> >>>> [email protected] >>>> >>>>to the solr-user mailing list. >>>> >>>>Welcome to [email protected]! >>>> >>>>Please save this message so that you know the address you are >>>>subscribed under, in case you later want to unsubscribe or change your >>>>subscription address. >>>> >>>> >>>>--- Administrative commands for the solr-user list --- >>>> >>>>I can handle administrative requests automatically. Please >>>>do not send them to the list address! Instead, send >>>>your message to the correct command address: >>>> >>>>To subscribe to the list, send a message to: >>>> <[email protected]> >>>> >>>>To remove your address from the list, send a message to: >>>> <[email protected]> >>>> >>>>Send mail to the following for info and FAQ for this list: >>>> <[email protected]> >>>> <[email protected]> >>>> >>>>Similar addresses exist for the digest list: >>>> <[email protected]> >>>> <[email protected]> >>>> >>>>To get messages 123 through 145 (a maximum of 100 per request), mail: >>>> <[email protected]> >>>> >>>>To get an index with subject and author for messages 123-456 , mail: >>>> <[email protected]> >>>> >>>>They are always returned as sets of 100, max 2000 per request, >>>>so you'll actually get 100-499. >>>> >>>>To receive all messages with the same subject as message 12345, >>>>send a short message to: >>>> <[email protected]> >>>> >>>>The messages should contain one line or word of text to avoid being >>>>treated as sp@m, but I will ignore their content. >>>>Only the ADDRESS you send to is important. >>>> >>>>You can start a subscription for an alternate address, >>>>for example "[email protected]", just add a hyphen and your >>>>address (with '=' instead of '@') after the command word: >>>><[email protected]> >>>> >>>>To stop subscription for this address, mail: >>>><[email protected]> >>>> >>>>In both cases, I'll send a confirmation message to that address. When >>>>you receive it, simply reply to it to complete your subscription. >>>> >>>>If despite following these instructions, you do not get the >>>>desired results, please contact my owner at >>>>[email protected]. Please be patient, my owner is a >>>>lot slower than I am ;-) >>>> >>>>--- Enclosed is a copy of the request I received. >>>> >>>>Return-Path: <[email protected]> >>>>Received: (qmail 15386 invoked by uid 99); 7 Mar 2014 08:12:16 -0000 >>>>Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) >>>> by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 08:12:16 +0000 >>>>X-ASF-Spam-Status: No, hits=4.9 required=5.0 >>>> tests=HTML_MESSAGE,RCVD_IN_PSBL,SPF_PASS >>>>X-Spam-Check-By: apache.org >>>>Received-SPF: pass (athena.apache.org: domain of [email protected] >>>>designates 220.181.13.59 as permitted sender) >>>>Received: from [220.181.13.59] (HELO m13-59.163.com) (220.181.13.59) >>>> by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 08:12:10 +0000 >>>>DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=163.com; >>>> s=s110527; h=Date:From:Subject:MIME-Version:Message-ID; bh=KIKmb >>>> puxu1huGSa5A5RUYvBKNt2RimeBgObxnp/l7gM=; b=N9yyj5qhfT8TXAwfhcRlY >>>> mjX4dgzti8JvVtAoO2k69n0r6alQMYT2HiOlNtjTL2XXTiJqreBx4LW07HvP5qIK >>>> GRbHPusNhK0s2edW9nRzffFZELJ+wfKwOpB/WLNHQXZqlAKyGP3w5civwG+rprB0 >>>> vaXbO9dYxInWKc80ZIU5Hc= >>>>Received: from cqlangyi$163.com ( [222.129.238.198] ) by >>>> ajax-webmail-wmsvr59 (Coremail) ; Fri, 7 Mar 2014 16:11:45 +0800 (CST) >>>>X-Originating-IP: [222.129.238.198] >>>>Date: Fri, 7 Mar 2014 16:11:45 +0800 (CST) >>>>From: cqlangyi <[email protected]> >>>>To: >>>> >>>> solr-user-sc.1394177943.kmfejmmdgfggfaeokajb-cqlangyi=163....@lucene.apache.org >>>>Subject: Re:confirm subscribe to [email protected] >>>>X-Priority: 3 >>>>X-Mailer: Coremail Webmail Server Version SP_ntes V3.5 build >>>> 20131204(24406.5820.5783) Copyright (c) 2002-2014 www.mailtech.cn 163com >>>>In-Reply-To: <[email protected]> >>>>References: <[email protected]> >>>>X-CM-CTRLDATA: 2T34YmZvb3Rlcl9odG09OTE2NDo4MQ== >>>>Content-Type: multipart/alternative; >>>> boundary="----=_Part_174263_595565442.1394179905833" >>>>MIME-Version: 1.0 >>>>Message-ID: <[email protected]> >>>>X-CM-TRANSID:O8GowADX389DfxlTrCkLAA--.29605W >>>>X-CM-SenderInfo: pftot0xj1lqiywtou0bp/1tbiGBtHvFEAKkKiBQABsn >>>>X-Coremail-Antispam: 1U5529EdanIXcx71UUUUU7vcSsGvfC2KfnxnUU== >>>>X-Virus-Checked: Checked by ClamAV on apache.org >>>>
