Re: Flatten term frequency

2018-11-29 Thread Vincenzo D'Amore
Hi, an update: I've found another alternative while reading the valuable post written by Doug. In short, I should split the field "title" in two fields: title_notf and title_phrase. title_notf is without term frequency (with omitTermFreqAndPositions=true) for matching queries a

Re: Flatten term frequency

2018-11-29 Thread Vincenzo D'Amore
; > > > > > > > > > https://opensourceconnections.com/blog/2014/12/08/title-search-when-relevancy-is-only-skin-deep/ > > > > > > Doug > > > On Thu, Nov 29, 2018 at 7:59 AM Alexandre Rafalovitch < > arafa...@gmail.com> > > > wrote

Re: Flatten term frequency

2018-11-29 Thread Alexandre Rafalovitch
gt; > > Regards, > > >Alex. > > > On Thu, 29 Nov 2018 at 05:43, Vincenzo D'Amore > > wrote: > > > > > > > > Hi all, > > > > > > > > I have a relevancy problem, I suppose to know a solution for this > > p

Re: Flatten term frequency

2018-11-29 Thread Vincenzo D'Amore
I suppose to know a solution for this > problem > > > but I would like to know if in your experience there is a better one. > > > > > > For example I have two documents which have the "termA" in their field > > > "title", the former has t

Re: Flatten term frequency

2018-11-29 Thread Doug Turnbull
; > "title", the former has the "termA" repeated more times but the latter > has > > the term only once. When searching for "termA" the former has bigger > score > > due to TF/IDF. > > > > Both the documents are fairly similar so I

Re: Flatten term frequency

2018-11-29 Thread Alexandre Rafalovitch
I have two documents which have the "termA" in their field > "title", the former has the "termA" repeated more times but the latter has > the term only once. When searching for "termA" the former has bigger score > due to TF/IDF. > > Both th

Flatten term frequency

2018-11-29 Thread Vincenzo D'Amore
ated more times but the latter has the term only once. When searching for "termA" the former has bigger score due to TF/IDF. Both the documents are fairly similar so I don't want that term frequency in the title boosts the score. The only solution I know to flatten the score when ther

Re: term frequency solrj

2017-01-09 Thread Shawn Heisey
On 1/9/2017 6:31 AM, huda barakat wrote: > Can anybody help me, I need to get term frequency for a specific > filed, I use the techproduct example and I use this code: The variable "terms" is null on line 29, which is why you are getting NullPointerException. > query.setRe

Re: term frequency solrj

2017-01-09 Thread Mikhail Khludnev
Hello Huda, Try to check this https://github.com/apache/lucene-solr/blob/master/solr/solrj/src/test/org/apache/solr/client/solrj/response/TermsResponseTest.java On Mon, Jan 9, 2017 at 4:31 PM, huda barakat wrote: > Hi, > Can anybody help me, I need to get term frequency for a specific

term frequency solrj

2017-01-09 Thread huda barakat
Hi, Can anybody help me, I need to get term frequency for a specific filed, I use the techproduct example and I use this code: // import java.util.List; import org.apache.solr.client.solrj.SolrClient; import

Re: find documents based on specific term frequency

2015-08-26 Thread Chris Hostetter
: "Is there a way to search for documents that have a word appearing more : than a certain number of times? For example, I want to find documents : that only have more than 10 instances of the word "genetics" …" Try... q=text:genetics&fq={!frange+incl=false+l=10}termfreq('text','genetics') No

find documents based on specific term frequency

2015-08-26 Thread Tang, Rebecca
Hi there, We have an index build on solr 5.0. We received an user question: "Is there a way to search for documents that have a word appearing more than a certain number of times? For example, I want to find documents that only have more than 10 instances of the word "genetics" …" I'm not sure

Re: term frequency with stemming

2015-07-27 Thread Aki Balogh
. > > This will allow you to run your fuzzy query and leave your index terms as > you want ( without affecting in this way the term frequency) . > > Can you give us more details about your use of stemming ? > Usually stemming is something a little bit different from fuzzy search. &g

Re: term frequency with stemming

2015-07-27 Thread Alessandro Benedetti
an run fuzzy queries with the edit distance ( by default calculated over a Levenstein Automaton) . This will allow you to run your fuzzy query and leave your index terms as you want ( without affecting in this way the term frequency) . Can you give us more details about your use of stemming ?

Re: term frequency with stemming

2015-07-25 Thread Aki Balogh
etmuse.com/> On Fri, Jul 24, 2015 at 12:04 PM, Aki Balogh wrote: > Hi All, > > I'm using TermVectorComponent and stemming (Porter) in order to get term > frequencies with fuzzy matching. I'm stemming at index and query time. > > Is there a way to get term frequency fro

Re: term frequency with stemming

2015-07-24 Thread Darin Amos
tching. I'm stemming at index and query time. > > Is there a way to get term frequency from the index? > * termfreq doesn't support stemming or wildcards > * terms component doesn't allow additional filters > * I could use a copyfield to save a non-stemmed version at index

term frequency with stemming

2015-07-24 Thread Aki Balogh
Hi All, I'm using TermVectorComponent and stemming (Porter) in order to get term frequencies with fuzzy matching. I'm stemming at index and query time. Is there a way to get term frequency from the index? * termfreq doesn't support stemming or wildcards * terms component doesn

Re: Logic on Term Frequency Calculation : Bug or Functionality

2015-05-21 Thread Ahmet Arslan
Hi Ariya, DefaultSimilarity does not use raw term frequency, but instead it uses square root of raw term frequency. If you want to observe raw term frequency information in explain section, I suggest you to play with org.apache.lucene.search.similarities.SimilarityBase and its sub-classes

Logic on Term Frequency Calculation : Bug or Functionality

2015-05-21 Thread ariya bala
Hi, I am puzzled on the Term Frequency Behaviour of the DefaultSimilarity implementation I have suppressed the IDF by setting to 1. TF-IDF would inturn reflect the same value as in Term Frequency Below are the inferences: Red coloured are expected to give a hit count(Term Frequency) of 2 but was

Re: Term Frequency Calculation - Clarification

2015-05-20 Thread ariya bala
--- >> > *Document content:* Foo Foo is in bar >> > *Search query:* Foo bar >> > *slop:* 3 >> > >> > With Slop 3, There are two matches to the query >> > Foo is in bar >> > Foo Foo is in bar >> > >> > *Should the Term Frequency be 1 or 2? Also point to the explanation of >> the >> > logic implemented in Lucene/Solr.* >> > >> > -- >> > Cheers >> > *Ariya * >> > >> > > > > -- > *Ariya * > -- *Ariya *

Re: Term Frequency Calculation - Clarification

2015-05-20 Thread ariya bala
he TF part (acheived by > setting > > IDF=1). > > > > Question is: > > - > > *Document content:* Foo Foo is in bar > > *Search query:* Foo bar > > *slop:* 3 > > > > With Slop 3, There are two matches to the query > > Foo is in bar > > Foo

Re: Term Frequency Calculation - Clarification

2015-05-20 Thread Jack Krupansky
* Foo bar > *slop:* 3 > > With Slop 3, There are two matches to the query > Foo is in bar > Foo Foo is in bar > > *Should the Term Frequency be 1 or 2? Also point to the explanation of the > logic implemented in Lucene/Solr.* > > -- > Cheers > *Ariya * >

Term Frequency Calculation - Clarification

2015-05-20 Thread ariya bala
are two matches to the query Foo is in bar Foo Foo is in bar *Should the Term Frequency be 1 or 2? Also point to the explanation of the logic implemented in Lucene/Solr.* -- Cheers *Ariya *

Re: Can solr TermVectorComponent return term frequency for the term in my query?

2015-02-04 Thread Aki Balogh
Nevermind -- I found I can just add another fq, so i'm not getting the 0s back, which makes it quick to add it up on my end. So the solution is: collection1/query?q=crawl_id:40fq=text:%22matched%20text%22&fl=termfreq(text,%27matched%20text%27)&rows=100&tv=false Thanks for your help! Akos (A

Re: Can solr TermVectorComponent return term frequency for the term in my query?

2015-02-04 Thread Aki Balogh
PS - I found that termfreq() actually returns the raw tf, i.e. an integer for each document. However, I have to get the request and add them up on my end. Unfortunately totaltermfreq() sums the similarity-modified tf values. Is there a way to just get the sum of the termfreq() values? Akos (Aki

Re: Can solr TermVectorComponent return term frequency for the term in my query?

2015-02-04 Thread Aki Balogh
Is there a way to set solr to only return raw tf (i.e. by maybe turning off the DefaultSimilarity), so I could use ttf() to get the sum of raw tf values? Or do I need to parse each tf value, square it and add them up in post-processing? Thx, Aki On Wed, Feb 4, 2015 at 4:39 PM, Ahmet Arslan wro

Re: Can solr TermVectorComponent return term frequency for the term in my query?

2015-02-04 Thread Ahmet Arslan
Hi, So you want raw tf. tf method implemented as square root of raw tf. So you can re-obtain it by reverse operation. 1.424 * 1.424 = 2.02 = int = 2 Ahmet On Wednesday, February 4, 2015 11:31 PM, Aki Balogh wrote: Hi Ahmet, Thank you for your idea, very helpful. I can indeed get tf value

Re: Can solr TermVectorComponent return term frequency for the term in my query?

2015-02-04 Thread Aki Balogh
Hi Ahmet, Thank you for your idea, very helpful. I can indeed get tf values through the tf and ttf function queries. Since tf uses Similarity, I'm getting back some floats (i.e. "dog occurs 1.424 times"), when I was expecting ints. Is there a way to get back ints (simple word count)? Thanks, Aki

Re: Can solr TermVectorComponent return term frequency for the term in my query?

2015-02-04 Thread Ahmet Arslan
Hi Aki, How about tf function query? https://cwiki.apache.org/confluence/display/solr/Function+Queries Ahmet On Wednesday, February 4, 2015 7:59 PM, Aki Balogh wrote: I'm using solr TermVectorComponent to get term frequencies for specific terms in a corpus. I.e. I query for "q=dog" and want t

Can solr TermVectorComponent return term frequency for the term in my query?

2015-02-04 Thread Aki Balogh
I'm using solr TermVectorComponent to get term frequencies for specific terms in a corpus. I.e. I query for "q=dog" and want to get back term frequencies for "dog" in the corpus. However, when I request term frequencies, I get back ALL term frequencies for ALL matching documents, which is generati

Re: Suggester: weight (term frequency) and 'mm' feasibility (allTermsRequired)

2014-12-30 Thread Boon Low
Now I have some questions: 1. Term frequency, weight/count The suggesters derive suggestions from a field in the index. What’s the feasibility of creating a custom dictionary that can automatically populate the weight/count field using term frequency (tf) during build time? Autosuggest in most

Suggester: weight (term frequency) and 'mm' feasibility

2014-12-10 Thread Boon Low
. Intrigued with the "finite-state machines" in the prefix/fuzzy suggesters too. Can’t wait to load test this properly. Now I have some questions: 1. Term frequency, weight/count The suggesters derive suggestions from a field in the index. What’s the feasibility of creating a custom dicti

Re: Total term frequency in solr includes deleted documents

2014-10-28 Thread Alexandre Rafalovitch
AM, nutchsolruser wrote: >> How can we get exact term frequency with excluding deleted documents term >> frequency, and that is without optimization because optimization is >> expensive in our case ? >> Is there any other way we can get term frequency for entire collection in >>

Re: Total term frequency in solr includes deleted documents

2014-10-28 Thread Shawn Heisey
On 10/28/2014 7:16 AM, nutchsolruser wrote: > How can we get exact term frequency with excluding deleted documents term > frequency, and that is without optimization because optimization is > expensive in our case ? > Is there any other way we can get term frequency for entire collecti

Total term frequency in solr includes deleted documents

2014-10-28 Thread nutchsolruser
Currently I am working on getting term frequency (not document frequency) of term in particular field for whole index. For that I am using function query ttf(field_name,'term'), This returns me total occurrences of term in that field. But It seems it is also considering deleted docum

How to get term frequency count grouped by date?

2014-06-13 Thread Dipanjan Kailthya
: 1. Faceting on the date field: I seem to get only document frequencies: http://localhost:8983/solr/select?q=text:the&facet=true&facet.field=date&rows=0 2. Use the term vector component or termfreq() Using this I was only able to get the term frequency counts on a per document bas

Re: get term frequency, just only keywords search

2014-04-26 Thread Jack Krupansky
You need to use a shingle filter at index time so that pairs of adjacent words get indexed as single terms, then you can do a term frequency for the shingled pair of terms ("Research Development" as a single term). Be sure to manually apply any other filters, such as lower case o

Re: get term frequency, just only keywords search

2014-04-26 Thread ksmith
igher count document will display first in list. so how can i sort on that. can you please help me asap. Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510p4133260.html Sent from the Solr - User mailing list archive at Nabble.com.

changing term frequency (tf) at query time

2014-01-15 Thread Karan jindal
Hi all, Can I change the value of term frequency for certain files while at query time? For ex: if there is a scenario in which I want to ignore term frequency of *title* field, can I modify solr/lucene code to always gives term frequency for title as 1. One possible way of doing it can be this

Re: get term frequency, just only keywords search

2013-08-21 Thread Jack Krupansky
es and write the code needed to ferret it out. -- Jack Krupansky -Original Message- From: danielitos85 Sent: Wednesday, August 21, 2013 4:41 AM To: solr-user@lucene.apache.org Subject: Re: get term frequency, just only keywords search Thanks a lot guys, @Jack in my search I use dis

Re: get term frequency, just only keywords search

2013-08-21 Thread danielitos85
suggest? Thanks a lot for support. -- View this message in context: http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510p4085831.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: get term frequency, just only keywords search

2013-08-19 Thread Jack Krupansky
The Lucene PhraseQuery goes through a lot of effort to calculate "phrase frequency" (phraseFreq) - but that is not the same as term frequency (don't confuse terms and phrases). Feel free to pick that number out of the debugQuery output, or from the XML variant of the explai

Re: get term frequency, just only keywords search

2013-08-19 Thread Erick Erickson
ery=on", in the explain I obtain termFreq=2.0 and it right. > > Is it possible to obtain that parameter? > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510p4085464.html > Sent f

Re: get term frequency, just only keywords search

2013-08-19 Thread danielitos85
ok I undestand it (thanks) but if I search a sentence and type "debugQuery=on", in the explain I obtain termFreq=2.0 and it right. Is it possible to obtain that parameter? -- View this message in context: http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywo

Re: get term frequency, just only keywords search

2013-08-19 Thread Jack Krupansky
"Term frequency" is about "terms", nothing else. So, by definition, a phrase or any other collection of terms does not have a "termfreq" - in Lucene. -- Jack Krupansky -Original Message- From: danielitos85 Sent: Monday, August 19, 2013 9:59 AM T

Re: get term frequency, just only keywords search

2013-08-19 Thread danielitos85
there isn't a way to get termFreq about a search like "french fries" (sentence)? -- View this message in context: http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510p4085454.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: get term frequency, just only keywords search

2013-08-19 Thread Jack Krupansky
user@lucene.apache.org Subject: Re: get term frequency, just only keywords search Thanks Jack, but if my keyword search are two words? for example "french fries" ? how is the right syntax? -- View this message in context: http://lucene.472066.n3.nabble.com/get-term-frequency-just-onl

Re: get term frequency, just only keywords search

2013-08-19 Thread danielitos85
Thanks Jack, but if my keyword search are two words? for example "french fries" ? how is the right syntax? -- View this message in context: http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510p4085399.html Sent from the Solr - User mailing list

Re: get term frequency, just only keywords search

2013-08-14 Thread Jack Krupansky
q="pizza"+"tomato"&fl=id,termfreq(myfield,'pizza'),termfreq(myfield,'tomato') -- Jack Krupansky -Original Message- From: danielitos85 Sent: Wednesday, August 14, 2013 1:22 PM To: solr-user@lucene.apache.org Subject: Re: get term frequency, j

Re: get term frequency, just only keywords search

2013-08-14 Thread danielitos85
t: http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510p4084643.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: get term frequency, just only keywords search

2013-08-14 Thread Jack Krupansky
- From: danielitos85 Sent: Wednesday, August 14, 2013 12:50 PM To: solr-user@lucene.apache.org Subject: Re: get term frequency, just only keywords search Thanks Jack, I'm tring to use tf function but I don't understand: why he returns a float value and not integer? At the start of th

Re: get term frequency, just only keywords search

2013-08-14 Thread danielitos85
Thanks Jack, I'm tring to use tf function but I don't understand: why he returns a float value and not integer? At the start of this topic I explained an example where I used term Frequency but it don't works how I need because he returns the term frequency about all the te

Re: get term frequency, just only keywords search

2013-08-14 Thread Jack Krupansky
You can use the termfreq or tf function query in your field list to return the term frequency for a term, like: fl=id,tf(foods,'pizza') -- Jack Krupansky -Original Message- From: danielitos85 Sent: Wednesday, August 14, 2013 5:29 AM To: solr-user@lucene.apache.org Subject

RE: get term frequency, just only keywords search

2013-08-14 Thread danielitos85
Thanks a lot -- View this message in context: http://lucene.472066.n3.nabble.com/SOLVED-get-term-frequency-just-only-keywords-search-tp4084510p4084530.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: get term frequency, just only keywords search

2013-08-14 Thread danielitos85
thanks a lot Markus ;) If I use regex parameter it works -- View this message in context: http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510p4084525.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: get term frequency, just only keywords search

2013-08-14 Thread Markus Jelsma
Why? Using terms.limit or a ^term$ regex should limit the response to the exact term right? -Original message- > From:danielitos85 > Sent: Wednesday 14th August 2013 12:20 > To: solr-user@lucene.apache.org > Subject: RE: get term frequency, just only keywords search >

RE: get term frequency, just only keywords search

2013-08-14 Thread danielitos85
66.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510p4084518.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: get term frequency, just only keywords search

2013-08-14 Thread Markus Jelsma
Try the TermsComponent. It will return one or more terms and their counts for a given field only. -Original message- > From:danielitos85 > Sent: Wednesday 14th August 2013 11:30 > To: solr-user@lucene.apache.org > Subject: get term frequency, just only keywords search &g

get term frequency, just only keywords search

2013-08-14 Thread danielitos85
ed keywords into field foods and not only for my keyword search ("pizza")? Please, have you any suggests? Thanks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/get-term-frequency-just-only-keywords-search-tp4084510.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Tony Mullins
for the documents in your result set (note > that the rows parameter matters if you want results for the whole set, the > default is 10). TermVectors also must be stored for each field that you > want term frequency returned for. Suppose you have the query > http://localhost:8983/solr

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread P Williams
stored for each field that you want term frequency returned for. Suppose you have the query http://localhost:8983/solr/collection1/tvrh?q=cable&fl=includes&tv.tf=true on the example that comes packaged with Solr. Then part of the response is: id IW-02 9885A004 1 1 1 2 1 1

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Jack Krupansky
k Krupansky -Original Message- From: Tony Mullins Sent: Thursday, July 04, 2013 12:24 PM To: solr-user@lucene.apache.org Subject: Re: Total Term Frequency per ResultSet in Solr 4.3 ? So what is the workaround for this problem ? Can it be done without changing any source code ? Thanks,

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Tony Mullins
; The amazing spider man is amazing spider the > > spider > > 1 > > 1439641369145507840 > > > > 2 > > 3 > > > > > > 1 > >1 > > > > > > As you can see facet is actually just returning the no. of docs found > &g

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Yonik Seeley
actual frequency. > Actual frequency is returned by the field 'amazing_freq' & 'spider_freq' ! > > So is there any workaround for this to get the total of term-frequency in > resultset without any modification to Solr source code ? > > > Thanks, > To

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Tony Mullins
ainst those keywrods not the actual frequency. Actual frequency is returned by the field 'amazing_freq' & 'spider_freq' ! So is there any workaround for this to get the total of term-frequency in resultset without any modification to Solr source code ? Thanks, Tony On Thu

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Yonik Seeley
know the occurrence of that same term in the result set of that 'X2' > search criteria. > > At the moment if I give termfreq(field,term) then it gives me the term > frequency per document and if I use totaltermfreq(field,term), it gives me > the total term frequency in entir

Re: Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Jack Krupansky
ew feature/improvement. -- Jack Krupansky -Original Message- From: Tony Mullins Sent: Thursday, July 04, 2013 9:45 AM To: solr-user@lucene.apache.org Subject: Total Term Frequency per ResultSet in Solr 4.3 ? Hi , I have lots of crawled data, indexed in my Solr (4.3.0) and lets say user

Total Term Frequency per ResultSet in Solr 4.3 ?

2013-07-04 Thread Tony Mullins
7;X2' and he/she wants to know the occurrence of that same term in the result set of that 'X2' search criteria. At the moment if I give termfreq(field,term) then it gives me the term frequency per document and if I use totaltermfreq(field,term), it gives me the total term frequency in

Re: How to re-index Solr & get term frequency within documents

2013-07-03 Thread Otis Gospodnetic
Hi, Try this instead: http://wiki.apache.org/solr/DataImportHandler#SolrEntityProcessor Background info: https://issues.apache.org/jira/browse/SOLR-1499 Otis -- Solr & ElasticSearch Support -- http://sematext.com/ Performance Monitoring -- http://sematext.com/spm On Wed, Jul 3, 2013 at 2:50 AM

Re: How to re-index Solr & get term frequency within documents

2013-07-02 Thread Tony Mullins
Hi Otis, I am quite new to Solr. And have looked at this link " http://search-lucene.com/jd/solr/solr-dataimporthandler/org/apache/solr/handler/dataimport/SolrEntityProcessor.html"; but could not figure out how to use it to re-index my all data in solr. Could you please explain in little detail t

Re: How to re-index Solr & get term frequency within documents

2013-07-02 Thread Otis Gospodnetic
indexing from scratch from the >> original data sources. >> >> -- Jack Krupansky >> >> -Original Message----- From: Otis Gospodnetic >> Sent: Monday, July 01, 2013 2:26 PM >> To: solr-user@lucene.apache.org >> Subject: Re: How to re-index S

Re: How to re-index Solr & get term frequency within documents

2013-07-01 Thread Tony Mullins
; -Original Message- From: Otis Gospodnetic > Sent: Monday, July 01, 2013 2:26 PM > To: solr-user@lucene.apache.org > Subject: Re: How to re-index Solr & get term frequency within documents > > > If all your fields are stored, you can do it with > http://search-

Re: How to re-index Solr & get term frequency within documents

2013-07-01 Thread Jack Krupansky
solr-user@lucene.apache.org Subject: Re: How to re-index Solr & get term frequency within documents If all your fields are stored, you can do it with http://search-lucene.com/?q=solrentityprocessor Otherwise, just reindex the same way you indexed in the first place. *Always* be ready to reindex from

Re: How to re-index Solr & get term frequency within documents

2013-07-01 Thread Otis Gospodnetic
; -- Jack Krupansky >> >> -Original Message- From: Tony Mullins >> Sent: Monday, July 01, 2013 10:47 AM >> To: solr-user@lucene.apache.org >> Subject: How to re-index Solr & get term frequency within documents >> >> >> Hi, >> >>

Re: How to re-index Solr & get term frequency within documents

2013-07-01 Thread Tony Mullins
> Sounds like you want "termfreq": > > termfreq(field_arg,term) > > fl=id,a,b,c,termfreq(a,xyz) > > > -- Jack Krupansky > > -Original Message- From: Tony Mullins > Sent: Monday, July 01, 2013 10:47 AM > To: solr-user@lucene.apache.org > Subje

Re: How to re-index Solr & get term frequency within documents

2013-07-01 Thread Jack Krupansky
:47 AM To: solr-user@lucene.apache.org Subject: How to re-index Solr & get term frequency within documents Hi, I am using Solr 4.3.0. If I change my solr's schema.xml then do I need to re-index my solr ? And if yes , how to ? My 2nd question is I need to find the frequency of term per document in a

How to re-index Solr & get term frequency within documents

2013-07-01 Thread Tony Mullins
Hi, I am using Solr 4.3.0. If I change my solr's schema.xml then do I need to re-index my solr ? And if yes , how to ? My 2nd question is I need to find the frequency of term per document in all documents of search result. My field is And I am trying this query http://localhost:8080/solr/se

Re: Disabling tf (term frequency) during indexing and/or scoring

2013-05-13 Thread tasmaniski
This is an old post, now there is a solution in SOLR omitTermFreqAndPositions="true" http://wiki.apache.org/solr/SchemaXml#Data_Types -- View this message in context: http://lucene.472066.n3.nabble.com/Disabling-tf-term-frequency-during-indexing-and-or-scoring-tp502956p4062595

Re: Disable term frequency for some fields in solr

2013-01-16 Thread Amit Jha
It will affect the phrase queries. That is why I am not using suggest configuration. On Thu, Jan 17, 2013 at 7:20 AM, Chris Hostetter wrote: > > : Or there is some other way to do that? > > I'm late to this thread, but what was wrong with the simple suggestion of > omitTermFreqAndPositions="true"

Re: Disable term frequency for some fields in solr

2013-01-16 Thread Chris Hostetter
: Or there is some other way to do that? I'm late to this thread, but what was wrong with the simple suggestion of omitTermFreqAndPositions="true" ? -Hoss

Re: Disable term frequency for some fields in solr

2013-01-16 Thread Amit Jha
Please correct my understanding, Use one of the factory as global similarity. And extends org.apache.lucene.search.similarities.DefaultSimilarity to create custom sim. And add a similarity tag in field type definition for required fields. Or there is some other way to do that? Rgds AJ On 17-

RE: Disable term frequency for some fields in solr

2013-01-16 Thread Markus Jelsma
To: solr-user@lucene.apache.org > Subject: Re: Disable term frequency for some fields in solr > > There's gonna be two ways to do this - for yourself or for everyone. > > For yourself, you'll want to subclass > org.apache.lucene.sear

Re: Disable term frequency for some fields in solr

2013-01-16 Thread Upayavira
There's gonna be two ways to do this - for yourself or for everyone. For yourself, you'll want to subclass org.apache.lucene.search.similarities.DefaultSimilarity and org.apache.solr.search.similarities.DefaultSimilarityFactory. Alternatively, patch those two files to allow setting the TF or the

Re: Disable term frequency for some fields in solr

2013-01-16 Thread Amit Jha
Done same thing in solr3.6 and working but in sorl3.6 filed level of similarity is not available. And Solr4 has Similarity Factories. So I was not getting how do I do it on solr4. Which class do i need to extend and move ahead. On Wed, Jan 16, 2013 at 4:44 PM, Upayavira wrote: > For someone ver

Re: Disable term frequency for some fields in solr

2013-01-16 Thread Upayavira
This involves taking a subclass of the DefaultSimilarity class, in Java, and adding that to your Solr setup. For someone versed in Java, this is relatively straight-forward. For others it is non-trivial. Upayavira On Wed, Jan 16, 2013, at 10:57 AM, Amit Jha wrote: > Hi, > > How can I do this in

Re: Disable term frequency for some fields in solr

2013-01-16 Thread Amit Jha
Hi, How can I do this in solr4. Amit On Thu, Dec 6, 2012 at 1:40 PM, Markus Jelsma wrote: > custom similarity for that field that returns 1 for

RE: Disable term frequency for some fields in solr

2012-12-06 Thread Markus Jelsma
> Sent: Thu 06-Dec-2012 08:13 > To: solr-user@lucene.apache.org > Subject: Disable term frequency for some fields in solr > > Hi, > > In my case I would like to disable term frequency for some fields. These > field should return a constant term frequency irrespective of how m

Re: get a list of terms sorted by total term frequency

2012-11-07 Thread Edward Garrett
i see... using the -t flag it would be cool if TermsComponent had an option to sort by total term frequency, something like terms.sort={count|index|ttf} surely that's a common enough use case On Wed, Nov 7, 2012 at 6:17 PM, Michael McCandless wrote: > Lucene's misc module has

Re: get a list of terms sorted by total term frequency

2012-11-07 Thread Michael McCandless
Lucene's misc module has HighFreqTerms tool. Mike McCandless http://blog.mikemccandless.com On Wed, Nov 7, 2012 at 1:15 PM, Edward Garrett wrote: > hi, > > is there a simple way to get a list of all terms that occur in a field > sorted by their total term frequency

get a list of terms sorted by total term frequency

2012-11-07 Thread Edward Garrett
hi, is there a simple way to get a list of all terms that occur in a field sorted by their total term frequency within that field? TermsComponent (http://wiki.apache.org/solr/TermsComponent) "provides fast field faceting over the whole index", but as counts it gives the number of docu

Re: minimum match and not matched words / term frequency in query result

2012-04-18 Thread Jan Høydahl
ore the term "purple", beacuse we don't have any. > > Can you suggest how to approach the problem? > > I was thinking about the debugQuery output, but since I will not get details > about all the results I probably will miss something. > > I am trying to wr

minimum match and not matched words / term frequency in query result

2012-04-18 Thread giovanni.bricc...@banzai.it
l miss something. I am trying to write a new SearchComponent but I don't know how to get term frequency data from a ResponseBuilder object... I am new to solr/lucene programming. Thanks a lot

Re: term frequency outweighs exact phrase match

2012-04-13 Thread alxsss
0&indent=on&qt=search&debugQuery=true Thanks. Alex. -Original Message- From: Chris Hostetter To: solr-user Sent: Thu, Apr 12, 2012 7:43 pm Subject: Re: term frequency outweighs exact phrase match : I use solr 3.5 with edismax. I have the following issue with phrase : search.

Re: term frequency outweighs exact phrase match

2012-04-12 Thread Chris Hostetter
: I use solr 3.5 with edismax. I have the following issue with phrase : search. For example if I have three documents with content like : : 1.apache apache : 2. solr solr : 3.apache solr : : then search for apache solr displays documents in the order 1,.2,3 : instead of 3, 2, 1 because term

Re: term frequency outweighs exact phrase match

2012-04-12 Thread alxsss
In that case documents 1 and 2 will not be in the results. We need them also be shown in the results but be ranked after those docs with exact match. I think omitting term frequency in calculating ranking in phrase queries will solve this issue, but I do not see that such a parameter in configs

Re: term frequency outweighs exact phrase match

2012-04-11 Thread Erick Erickson
ents with content like > > 1.apache apache > 2. solr solr > 3.apache solr > > then search for apache solr displays documents in the order 1,.2,3 instead of > 3, 2, 1 because term frequency in the first and second documents is higher > than in the third document. We want result

term frequency outweighs exact phrase match

2012-04-10 Thread alxsss
Hello, I use solr 3.5 with edismax. I have the following issue with phrase search. For example if I have three documents with content like 1.apache apache 2. solr solr 3.apache solr then search for apache solr displays documents in the order 1,.2,3 instead of 3, 2, 1 because term frequency in

Re: Limiting term frequency in a document to a specific term

2012-01-24 Thread Erick Erickson
slan wrote: >>> Below is an example query to search for the term frequency >>> in a document, >>> but it is returning the frequency for all the terms. >>> >>> [ >>> > http://localhost:8983/solr/select/?fl=documentPageId&q=documentPageId:

Re: Limiting term frequency in a document to a specific term

2012-01-24 Thread solr user
hy I get a parse exception for the above syntax. On Monday, January 23, 2012, Ahmet Arslan wrote: >> Below is an example query to search for the term frequency >> in a document, >> but it is returning the frequency for all the terms. >> >> [ >> http://loc

Re: Ngram autocompleter and term frequency boosting

2012-01-23 Thread Cuong Hoang
rmance Monitoring SaaS for Solr - > http://sematext.com/spm/solr-performance-monitoring/index.html > > > > > > > > - Original Message - > >> From: Cuong Hoang > >> To: solr-user@lucene.apache.org > >> Cc: > >> Sent: Thursday,

  1   2   >