On 1/9/2017 6:31 AM, huda barakat wrote:
> Can anybody help me, I need to get term frequency for a specific
> filed, I use the techproduct example and I use this code:
The variable "terms" is null on line 29, which is why you are getting
NullPointerException.
> query.setRequestHandler("terms");
Hello Huda,
Try to check this
https://github.com/apache/lucene-solr/blob/master/solr/solrj/src/test/org/apache/solr/client/solrj/response/TermsResponseTest.java
On Mon, Jan 9, 2017 at 4:31 PM, huda barakat
wrote:
> Hi,
> Can anybody help me, I need to get term frequency for a specific filed,
Hi Alessandro,
I'm counting word frequencies on a site. All I want to do is, I want to
count "running" and "run" as the same topic.
It's not really fuzzy matching I believe -- i.e. I wouldn't want to match
"running" and "sprinting".
I think stemming should be it.. seems to work fine now..
TY,
A
A part the funny "crypted" message by Darin xD
I would like to focus on the initial user requirement :
"get term
frequencies with fuzzy matching"
Solr/Lucene offer you the support for fuzzy query independently of the way
you token filter your terms at analysis time.
You can run fuzzy queries with
I believe I found a solution: use a third-party stemmer to stem the term
first, then pass it to termfreq.
The only trick is, each term in a phrase has to be stemmed separately (i.e.
"end-user experience" has to be broken down into "end-user" -> "end-us" and
"experience" -> "experi") before being p
Hi Dale,
I would think the coffee shop is better, I have in-laws visiting at home.
Thanks
Darin
> On Jul 24, 2015, at 12:04 PM, Aki Balogh wrote:
>
> Hi All,
>
> I'm using TermVectorComponent and stemming (Porter) in order to get term
> frequencies with fuzzy matching. I'm stemming at index
Please ignore.
On Wed, May 20, 2015 at 2:45 PM, ariya bala wrote:
> Thanks Jack.
> In my case there is only one document - Foo Foo is in bar
> As per your comment, I should expect TF to be 2.
> But I am getting one.
> Is there any check where if one match is a subset of other, is calculated
> o
Thanks Jack.
In my case there is only one document - Foo Foo is in bar
As per your comment, I should expect TF to be 2.
But I am getting one.
Is there any check where if one match is a subset of other, is calculated
once?
My class extends DefaultSimilarity.
Cheers
Ariya Bala S
On Wed, May 20, 201
Yes.
tf is both 1 and 2 - tf is per document, which is 1 for the first document
and 2 for the second document.
See:
http://lucene.apache.org/core/5_1_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html
-- Jack Krupansky
On Wed, May 20, 2015 at 6:13 AM, ariya bala wrote:
> Hi,
>
0&indent=on&qt=search&debugQuery=true
Thanks.
Alex.
-Original Message-
From: Chris Hostetter
To: solr-user
Sent: Thu, Apr 12, 2012 7:43 pm
Subject: Re: term frequency outweighs exact phrase match
: I use solr 3.5 with edismax. I have the following issue with phrase
: search.
: I use solr 3.5 with edismax. I have the following issue with phrase
: search. For example if I have three documents with content like
:
: 1.apache apache
: 2. solr solr
: 3.apache solr
:
: then search for apache solr displays documents in the order 1,.2,3
: instead of 3, 2, 1 because term fr
.
I see omitTermFreqAndPositions="true" but not sure if it is the setting I need,
because its description is too vague.
Thanks.
Alex.
-Original Message-
From: Erick Erickson
To: solr-user
Sent: Wed, Apr 11, 2012 8:23 am
Subject: Re: term frequency outweighs exact ph
Consider boosting on phrase with a SHOULD clause, something
like field:"apache solr"^2..
Best
Erick
On Tue, Apr 10, 2012 at 12:46 PM, wrote:
> Hello,
>
> I use solr 3.5 with edismax. I have the following issue with phrase search.
> For example if I have three documents with content like
>
> 1
: ./NoLengthNormAndTfSimilarity.java:7: error: lengthNorm(String,int) in
: NoLengthNormAndTfSimilarity cannot override lengthNorm(String,int) in
: Similarity
: public float lengthNorm(String fieldName, int numTerms) {
: ^
: overridden method is final
: 1 error
: -
: What am I doing
So you are interested in collection frequency of words.
TermsComponent gives you document frequency of terms. You can modify it to give
collection frequency info. http://search-lucene.com/m/of5Fn1PUOHU/
--- On Wed, 1/12/11, Juan Grande wrote:
> From: Juan Grande
> Subject: Re: Term fre
Maybe there is a better solution, but I think that you can solve this
problem using facets. You will get the number of documents where each term
appears. Also, you can filter a specific set of terms by entering a query
like "+field:term1 OR +field:term2 OR ...", or using the facet.query
parameter.
On 2010-02-11 17:04, Mike Perham wrote:
In an UpdateRequestProcessor (processing an AddUpdateCommand), I have
a SolrInputDocument with a field 'content' that has termVectors="true"
in schema.xml. Is it possible to get access to that field's term
vector in the URP?
No, term vectors are created
Mike Perham wrote:
In an UpdateRequestProcessor (processing an AddUpdateCommand), I have
a SolrInputDocument with a field 'content' that has termVectors="true"
in schema.xml. Is it possible to get access to that field's term
vector in the URP?
You cannot get term vector info of a document b
: is there a way to get the term frequency per found result back from Solr ?
this info is in the "explain" section of the debugQuery output, see this
recent post about a similar question...
http://www.nabble.com/Highlight---get-terms-used-by-lucene-to16276184.html#a16323025
-Hoss
19 matches
Mail list logo