Re: term frequency solrj

2017-01-09 Thread Shawn Heisey
On 1/9/2017 6:31 AM, huda barakat wrote: > Can anybody help me, I need to get term frequency for a specific > filed, I use the techproduct example and I use this code: The variable "terms" is null on line 29, which is why you are getting NullPointerException. > query.setRequestHandler("terms");

Re: term frequency solrj

2017-01-09 Thread Mikhail Khludnev
Hello Huda, Try to check this https://github.com/apache/lucene-solr/blob/master/solr/solrj/src/test/org/apache/solr/client/solrj/response/TermsResponseTest.java On Mon, Jan 9, 2017 at 4:31 PM, huda barakat wrote: > Hi, > Can anybody help me, I need to get term frequency for a specific filed,

Re: term frequency with stemming

2015-07-27 Thread Aki Balogh
Hi Alessandro, I'm counting word frequencies on a site. All I want to do is, I want to count "running" and "run" as the same topic. It's not really fuzzy matching I believe -- i.e. I wouldn't want to match "running" and "sprinting". I think stemming should be it.. seems to work fine now.. TY, A

Re: term frequency with stemming

2015-07-27 Thread Alessandro Benedetti
A part the funny "crypted" message by Darin xD I would like to focus on the initial user requirement : "get term frequencies with fuzzy matching" Solr/Lucene offer you the support for fuzzy query independently of the way you token filter your terms at analysis time. You can run fuzzy queries with

Re: term frequency with stemming

2015-07-25 Thread Aki Balogh
I believe I found a solution: use a third-party stemmer to stem the term first, then pass it to termfreq. The only trick is, each term in a phrase has to be stemmed separately (i.e. "end-user experience" has to be broken down into "end-user" -> "end-us" and "experience" -> "experi") before being p

Re: term frequency with stemming

2015-07-24 Thread Darin Amos
Hi Dale, I would think the coffee shop is better, I have in-laws visiting at home. Thanks Darin > On Jul 24, 2015, at 12:04 PM, Aki Balogh wrote: > > Hi All, > > I'm using TermVectorComponent and stemming (Porter) in order to get term > frequencies with fuzzy matching. I'm stemming at index

Re: Term Frequency Calculation - Clarification

2015-05-20 Thread ariya bala
Please ignore. On Wed, May 20, 2015 at 2:45 PM, ariya bala wrote: > Thanks Jack. > In my case there is only one document - Foo Foo is in bar > As per your comment, I should expect TF to be 2. > But I am getting one. > Is there any check where if one match is a subset of other, is calculated > o

Re: Term Frequency Calculation - Clarification

2015-05-20 Thread ariya bala
Thanks Jack. In my case there is only one document - Foo Foo is in bar As per your comment, I should expect TF to be 2. But I am getting one. Is there any check where if one match is a subset of other, is calculated once? My class extends DefaultSimilarity. Cheers Ariya Bala S On Wed, May 20, 201

Re: Term Frequency Calculation - Clarification

2015-05-20 Thread Jack Krupansky
Yes. tf is both 1 and 2 - tf is per document, which is 1 for the first document and 2 for the second document. See: http://lucene.apache.org/core/5_1_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html -- Jack Krupansky On Wed, May 20, 2015 at 6:13 AM, ariya bala wrote: > Hi, >

Re: term frequency outweighs exact phrase match

2012-04-13 Thread alxsss
0&indent=on&qt=search&debugQuery=true Thanks. Alex. -Original Message- From: Chris Hostetter To: solr-user Sent: Thu, Apr 12, 2012 7:43 pm Subject: Re: term frequency outweighs exact phrase match : I use solr 3.5 with edismax. I have the following issue with phrase : search.

Re: term frequency outweighs exact phrase match

2012-04-12 Thread Chris Hostetter
: I use solr 3.5 with edismax. I have the following issue with phrase : search. For example if I have three documents with content like : : 1.apache apache : 2. solr solr : 3.apache solr : : then search for apache solr displays documents in the order 1,.2,3 : instead of 3, 2, 1 because term fr

Re: term frequency outweighs exact phrase match

2012-04-12 Thread alxsss
. I see omitTermFreqAndPositions="true" but not sure if it is the setting I need, because its description is too vague. Thanks. Alex. -Original Message- From: Erick Erickson To: solr-user Sent: Wed, Apr 11, 2012 8:23 am Subject: Re: term frequency outweighs exact ph

Re: term frequency outweighs exact phrase match

2012-04-11 Thread Erick Erickson
Consider boosting on phrase with a SHOULD clause, something like field:"apache solr"^2.. Best Erick On Tue, Apr 10, 2012 at 12:46 PM, wrote: > Hello, > > I use solr 3.5 with edismax. I have the following issue with phrase search. > For example if I have three documents with content like > > 1

Re: Term frequency question

2011-11-07 Thread Chris Hostetter
: ./NoLengthNormAndTfSimilarity.java:7: error: lengthNorm(String,int) in : NoLengthNormAndTfSimilarity cannot override lengthNorm(String,int) in : Similarity : public float lengthNorm(String fieldName, int numTerms) { : ^ : overridden method is final : 1 error : - : What am I doing

Re: Term frequency across multiple documents

2011-01-13 Thread Ahmet Arslan
So you are interested in collection frequency of words. TermsComponent gives you document frequency of terms. You can modify it to give collection frequency info. http://search-lucene.com/m/of5Fn1PUOHU/ --- On Wed, 1/12/11, Juan Grande wrote: > From: Juan Grande > Subject: Re: Term fre

Re: Term frequency across multiple documents

2011-01-12 Thread Juan Grande
Maybe there is a better solution, but I think that you can solve this problem using facets. You will get the number of documents where each term appears. Also, you can filter a specific set of terms by entering a query like "+field:term1 OR +field:term2 OR ...", or using the facet.query parameter.

Re: term frequency vector access?

2010-02-11 Thread Andrzej Bialecki
On 2010-02-11 17:04, Mike Perham wrote: In an UpdateRequestProcessor (processing an AddUpdateCommand), I have a SolrInputDocument with a field 'content' that has termVectors="true" in schema.xml. Is it possible to get access to that field's term vector in the URP? No, term vectors are created

Re: term frequency vector access?

2010-02-11 Thread Koji Sekiguchi
Mike Perham wrote: In an UpdateRequestProcessor (processing an AddUpdateCommand), I have a SolrInputDocument with a field 'content' that has termVectors="true" in schema.xml. Is it possible to get access to that field's term vector in the URP? You cannot get term vector info of a document b

Re: Term frequency

2008-03-29 Thread Chris Hostetter
: is there a way to get the term frequency per found result back from Solr ? this info is in the "explain" section of the debugQuery output, see this recent post about a similar question... http://www.nabble.com/Highlight---get-terms-used-by-lucene-to16276184.html#a16323025 -Hoss