hi Ahmet,
thank you, quite clear!!! so now i could get 'df' via 'LukeRequestHandler', but how about 'cf', could i get it somehow? thanks! Cq At 2014-03-08 00:06:50,"Ahmet Arslan" <iori...@yahoo.com> wrote: >Hi, > >You already gave examples, using your example documents : > >>1. "fox jump over the gray dog, fox gone" >>2. "fox is a kind of animal, dog also is" >>3. "i like red fox" > >collection frequency : cf('fox') = 4 = tf('fox',d1) + tf('fox',d2) + >tf('fox',d3) = 2 + 1 + 1 = 4 >document frequency : df('fox') = 3 = (how many documents contain term fox?) > >Ahmet > > > > > >On Friday, March 7, 2014 5:25 PM, cqlangyi <cqlan...@163.com> wrote: > >hi Ahmet, > >thank you very much for the reply. > >i'm just a little bit confused about "collection frequency" & "document >frequency", >would you mind help me out with these 2 phrases? > >thank you! > > >Cq > > > > > > > >At 2014-03-07 22:43:34,"Ahmet Arslan" <iori...@yahoo.com> wrote: >>Hi, >> >>Thats collection frequency (cf). TermsComponent could be modified to report >>cf instead of document frequency(df). >> >>Ahmet >> >> >> >>On Friday, March 7, 2014 10:49 AM, cqlangyi <cqlan...@163.com> wrote: >>hi there, >> >> >>i have a question with following example. >> >> >>say i have only 3 documents indexd, >> >> >>1. "fox jump over the gray dog, fox gone" >>2. "fox is a kind of animal, dog also is" >>3. "i like red fox" >> >> >>with query "http://localhost/solr/admin/luke?fl=myfield&numTerms=5 >>solr give back the "top terms" as: >>fox 3 >>dog 2 >>... >> >> >>i guess Luke request handler means there's 3 documents have the term “fox", 2 >>documents have the term "dog" >>but what want to have is: >>fox 4 ----- which means "fox" totally appeared 4 times in all >>document >>dog 2 ----- which means "dog" appeared 2 times totally. >> >> >>so is there any methods to do that? >> >> >>thanks a lot >> >> >>Cq >> >> >> >> >> >> >> >> >> >> >>At 2014-03-07 16:12:17,solr-user-h...@lucene.apache.org wrote: >>>Hi! This is the ezmlm program. I'm managing the >>>solr-user@lucene.apache.org mailing list. >>> >>>I'm working for my owner, who can be reached >>>at solr-user-ow...@lucene.apache.org. >>> >>>Acknowledgment: I have added the address >>> >>> cqlan...@163.com >>> >>>to the solr-user mailing list. >>> >>>Welcome to solr-user@lucene.apache.org! >>> >>>Please save this message so that you know the address you are >>>subscribed under, in case you later want to unsubscribe or change your >>>subscription address. >>> >>> >>>--- Administrative commands for the solr-user list --- >>> >>>I can handle administrative requests automatically. Please >>>do not send them to the list address! Instead, send >>>your message to the correct command address: >>> >>>To subscribe to the list, send a message to: >>> <solr-user-subscr...@lucene.apache.org> >>> >>>To remove your address from the list, send a message to: >>> <solr-user-unsubscr...@lucene.apache.org> >>> >>>Send mail to the following for info and FAQ for this list: >>> <solr-user-i...@lucene.apache.org> >>> <solr-user-...@lucene.apache.org> >>> >>>Similar addresses exist for the digest list: >>> <solr-user-digest-subscr...@lucene.apache.org> >>> <solr-user-digest-unsubscr...@lucene.apache.org> >>> >>>To get messages 123 through 145 (a maximum of 100 per request), mail: >>> <solr-user-get.123_...@lucene.apache.org> >>> >>>To get an index with subject and author for messages 123-456 , mail: >>> <solr-user-index.123_...@lucene.apache.org> >>> >>>They are always returned as sets of 100, max 2000 per request, >>>so you'll actually get 100-499. >>> >>>To receive all messages with the same subject as message 12345, >>>send a short message to: >>> <solr-user-thread.12...@lucene.apache.org> >>> >>>The messages should contain one line or word of text to avoid being >>>treated as sp@m, but I will ignore their content. >>>Only the ADDRESS you send to is important. >>> >>>You can start a subscription for an alternate address, >>>for example "john@host.domain", just add a hyphen and your >>>address (with '=' instead of '@') after the command word: >>><solr-user-subscribe-john=host.dom...@lucene.apache.org> >>> >>>To stop subscription for this address, mail: >>><solr-user-unsubscribe-john=host.dom...@lucene.apache.org> >>> >>>In both cases, I'll send a confirmation message to that address. When >>>you receive it, simply reply to it to complete your subscription. >>> >>>If despite following these instructions, you do not get the >>>desired results, please contact my owner at >>>solr-user-ow...@lucene.apache.org. Please be patient, my owner is a >>>lot slower than I am ;-) >>> >>>--- Enclosed is a copy of the request I received. >>> >>>Return-Path: <cqlan...@163.com> >>>Received: (qmail 15386 invoked by uid 99); 7 Mar 2014 08:12:16 -0000 >>>Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) >>> by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 08:12:16 +0000 >>>X-ASF-Spam-Status: No, hits=4.9 required=5.0 >>> tests=HTML_MESSAGE,RCVD_IN_PSBL,SPF_PASS >>>X-Spam-Check-By: apache.org >>>Received-SPF: pass (athena.apache.org: domain of cqlan...@163.com designates >>>220.181.13.59 as permitted sender) >>>Received: from [220.181.13.59] (HELO m13-59.163.com) (220.181.13.59) >>> by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 08:12:10 +0000 >>>DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=163.com; >>> s=s110527; h=Date:From:Subject:MIME-Version:Message-ID; bh=KIKmb >>> puxu1huGSa5A5RUYvBKNt2RimeBgObxnp/l7gM=; b=N9yyj5qhfT8TXAwfhcRlY >>> mjX4dgzti8JvVtAoO2k69n0r6alQMYT2HiOlNtjTL2XXTiJqreBx4LW07HvP5qIK >>> GRbHPusNhK0s2edW9nRzffFZELJ+wfKwOpB/WLNHQXZqlAKyGP3w5civwG+rprB0 >>> vaXbO9dYxInWKc80ZIU5Hc= >>>Received: from cqlangyi$163.com ( [222.129.238.198] ) by >>> ajax-webmail-wmsvr59 (Coremail) ; Fri, 7 Mar 2014 16:11:45 +0800 (CST) >>>X-Originating-IP: [222.129.238.198] >>>Date: Fri, 7 Mar 2014 16:11:45 +0800 (CST) >>>From: cqlangyi <cqlan...@163.com> >>>To: >>> >>> solr-user-sc.1394177943.kmfejmmdgfggfaeokajb-cqlangyi=163....@lucene.apache.org >>>Subject: Re:confirm subscribe to solr-user@lucene.apache.org >>>X-Priority: 3 >>>X-Mailer: Coremail Webmail Server Version SP_ntes V3.5 build >>> 20131204(24406.5820.5783) Copyright (c) 2002-2014 www.mailtech.cn 163com >>>In-Reply-To: <1394177943.74586.ez...@lucene.apache.org> >>>References: <1394177943.74586.ez...@lucene.apache.org> >>>X-CM-CTRLDATA: 2T34YmZvb3Rlcl9odG09OTE2NDo4MQ== >>>Content-Type: multipart/alternative; >>> boundary="----=_Part_174263_595565442.1394179905833" >>>MIME-Version: 1.0 >>>Message-ID: <77b43682.ba9b.1449b991929.coremail.cqlan...@163.com> >>>X-CM-TRANSID:O8GowADX389DfxlTrCkLAA--.29605W >>>X-CM-SenderInfo: pftot0xj1lqiywtou0bp/1tbiGBtHvFEAKkKiBQABsn >>>X-Coremail-Antispam: 1U5529EdanIXcx71UUUUU7vcSsGvfC2KfnxnUU== >>>X-Virus-Checked: Checked by ClamAV on apache.org >>>