hi Ahmet,
thank you for the reply. i'd give it a try with some sample docs tomorrow. thank you! Cq At 2014-03-08 00:52:01,"Ahmet Arslan" <iori...@yahoo.com> wrote: > >Hi, > >Looks like totaltermfreq (ttf) is equals to collection frequency. >Please see other relevancy functions : >http://wiki.apache.org/solr/FunctionQuery#Relevance_Functions > >Ahmet > >On Friday, March 7, 2014 6:38 PM, cqlangyi <cqlan...@163.com> wrote: > >hi Ahmet, > >thank you, quite clear!!! > >so now i could get 'df' via 'LukeRequestHandler', but >how about 'cf', could i get it somehow? > > >thanks! > >Cq > > > > > > > >At 2014-03-08 00:06:50,"Ahmet Arslan" <iori...@yahoo.com> wrote: >>Hi, >> >>You already gave examples, using your example documents : >> >>>1. "fox jump over the gray dog, fox gone" >>>2. "fox is a kind of animal, dog also is" >>>3. "i like red fox" >> >>collection frequency : cf('fox') = 4 = tf('fox',d1) + tf('fox',d2) + >>tf('fox',d3) = 2 + 1 + 1 = 4 >>document frequency : df('fox') = 3 = (how many documents contain term fox?) >> >>Ahmet >> >> >> >> >> >>On Friday, March 7, 2014 5:25 PM, cqlangyi <cqlan...@163.com> wrote: >> >>hi Ahmet, >> >>thank you very much for the reply. >> >>i'm just a little bit confused about "collection frequency" & "document >>frequency", >>would you mind help me out with these 2 phrases? >> >>thank you! >> >> >>Cq >> >> >> >> >> >> >> >>At 2014-03-07 22:43:34,"Ahmet Arslan" <iori...@yahoo.com> wrote: >>>Hi, >>> >>>Thats collection frequency (cf). TermsComponent could be modified to report >>>cf instead of document frequency(df). >>> >>>Ahmet >>> >>> >>> >>>On Friday, March 7, 2014 10:49 AM, cqlangyi <cqlan...@163.com> wrote: >>>hi there, >>> >>> >>>i have a question with following example. >>> >>> >>>say i have only 3 documents indexd, >>> >>> >>>1. "fox jump over the gray dog, fox gone" >>>2. "fox is a kind of animal, dog also is" >>>3. "i like red fox" >>> >>> >>>with query "http://localhost/solr/admin/luke?fl=myfield&numTerms=5 >>>solr give back the "top terms" as: >>>fox 3 >>>dog 2 >>>... >>> >>> >>>i guess Luke request handler means there's 3 documents have the term “fox", >>>2 documents have the term "dog" >>>but what want to have is: >>>fox 4 ----- which means "fox" totally appeared 4 times in all >>>document >>>dog 2 ----- which means "dog" appeared 2 times totally. >>> >>> >>>so is there any methods to do that? >>> >>> >>>thanks a lot >>> >>> >>>Cq >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> >>>At 2014-03-07 16:12:17,solr-user-h...@lucene.apache.org wrote: >>>>Hi! This is the ezmlm program. I'm managing the >>>>solr-user@lucene.apache.org mailing list. >>>> >>>>I'm working for my owner, who can be reached >>>>at solr-user-ow...@lucene.apache.org. >>>> >>>>Acknowledgment: I have added the address >>>> >>>> cqlan...@163.com >>>> >>>>to the solr-user mailing list. >>>> >>>>Welcome to solr-user@lucene.apache.org! >>>> >>>>Please save this message so that you know the address you are >>>>subscribed under, in case you later want to unsubscribe or change your >>>>subscription address. >>>> >>>> >>>>--- Administrative commands for the solr-user list --- >>>> >>>>I can handle administrative requests automatically. Please >>>>do not send them to the list address! Instead, send >>>>your message to the correct command address: >>>> >>>>To subscribe to the list, send a message to: >>>> <solr-user-subscr...@lucene.apache.org> >>>> >>>>To remove your address from the list, send a message to: >>>> <solr-user-unsubscr...@lucene.apache.org> >>>> >>>>Send mail to the following for info and FAQ for this list: >>>> <solr-user-i...@lucene.apache.org> >>>> <solr-user-...@lucene.apache.org> >>>> >>>>Similar addresses exist for the digest list: >>>> <solr-user-digest-subscr...@lucene.apache.org> >>>> <solr-user-digest-unsubscr...@lucene.apache.org> >>>> >>>>To get messages 123 through 145 (a maximum of 100 per request), mail: >>>> <solr-user-get.123_...@lucene.apache.org> >>>> >>>>To get an index with subject and author for messages 123-456 , mail: >>>> <solr-user-index.123_...@lucene.apache.org> >>>> >>>>They are always returned as sets of 100, max 2000 per request, >>>>so you'll actually get 100-499. >>>> >>>>To receive all messages with the same subject as message 12345, >>>>send a short message to: >>>> <solr-user-thread.12...@lucene.apache.org> >>>> >>>>The messages should contain one line or word of text to avoid being >>>>treated as sp@m, but I will ignore their content. >>>>Only the ADDRESS you send to is important. >>>> >>>>You can start a subscription for an alternate address, >>>>for example "john@host.domain", just add a hyphen and your >>>>address (with '=' instead of '@') after the command word: >>>><solr-user-subscribe-john=host.dom...@lucene.apache.org> >>>> >>>>To stop subscription for this address, mail: >>>><solr-user-unsubscribe-john=host.dom...@lucene.apache.org> >>>> >>>>In both cases, I'll send a confirmation message to that address. When >>>>you receive it, simply reply to it to complete your subscription. >>>> >>>>If despite following these instructions, you do not get the >>>>desired results, please contact my owner at >>>>solr-user-ow...@lucene.apache.org. Please be patient, my owner is a >>>>lot slower than I am ;-) >>>> >>>>--- Enclosed is a copy of the request I received. >>>> >>>>Return-Path: <cqlan...@163.com> >>>>Received: (qmail 15386 invoked by uid 99); 7 Mar 2014 08:12:16 -0000 >>>>Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) >>>> by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 08:12:16 +0000 >>>>X-ASF-Spam-Status: No, hits=4.9 required=5.0 >>>> tests=HTML_MESSAGE,RCVD_IN_PSBL,SPF_PASS >>>>X-Spam-Check-By: apache.org >>>>Received-SPF: pass (athena.apache.org: domain of cqlan...@163.com >>>>designates 220.181.13.59 as permitted sender) >>>>Received: from [220.181.13.59] (HELO m13-59.163.com) (220.181.13.59) >>>> by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 08:12:10 +0000 >>>>DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=163.com; >>>> s=s110527; h=Date:From:Subject:MIME-Version:Message-ID; bh=KIKmb >>>> puxu1huGSa5A5RUYvBKNt2RimeBgObxnp/l7gM=; b=N9yyj5qhfT8TXAwfhcRlY >>>> mjX4dgzti8JvVtAoO2k69n0r6alQMYT2HiOlNtjTL2XXTiJqreBx4LW07HvP5qIK >>>> GRbHPusNhK0s2edW9nRzffFZELJ+wfKwOpB/WLNHQXZqlAKyGP3w5civwG+rprB0 >>>> vaXbO9dYxInWKc80ZIU5Hc= >>>>Received: from cqlangyi$163.com ( [222.129.238.198] ) by >>>> ajax-webmail-wmsvr59 (Coremail) ; Fri, 7 Mar 2014 16:11:45 +0800 (CST) >>>>X-Originating-IP: [222.129.238.198] >>>>Date: Fri, 7 Mar 2014 16:11:45 +0800 (CST) >>>>From: cqlangyi <cqlan...@163.com> >>>>To: >>>> >>>> solr-user-sc.1394177943.kmfejmmdgfggfaeokajb-cqlangyi=163....@lucene.apache.org >>>>Subject: Re:confirm subscribe to solr-user@lucene.apache.org >>>>X-Priority: 3 >>>>X-Mailer: Coremail Webmail Server Version SP_ntes V3.5 build >>>> 20131204(24406.5820.5783) Copyright (c) 2002-2014 www.mailtech.cn 163com >>>>In-Reply-To: <1394177943.74586.ez...@lucene.apache.org> >>>>References: <1394177943.74586.ez...@lucene.apache.org> >>>>X-CM-CTRLDATA: 2T34YmZvb3Rlcl9odG09OTE2NDo4MQ== >>>>Content-Type: multipart/alternative; >>>> boundary="----=_Part_174263_595565442.1394179905833" >>>>MIME-Version: 1.0 >>>>Message-ID: <77b43682.ba9b.1449b991929.coremail.cqlan...@163.com> >>>>X-CM-TRANSID:O8GowADX389DfxlTrCkLAA--.29605W >>>>X-CM-SenderInfo: pftot0xj1lqiywtou0bp/1tbiGBtHvFEAKkKiBQABsn >>>>X-Coremail-Antispam: 1U5529EdanIXcx71UUUUU7vcSsGvfC2KfnxnUU== >>>>X-Virus-Checked: Checked by ClamAV on apache.org >>>>