Re: Requiring multiple matches of a term

2011-08-22 Thread Simon Willnauer
On Mon, Aug 22, 2011 at 8:10 PM, Chris Hostetter wrote: > > : One simple way of doing this is maybe to write a wrapper for TermQuery > : that only returns docs with a Term Frequency  > X as far as I > : understand the question those terms don't have to be within a certain > : window right? > > I d

Re: Requiring multiple matches of a term

2011-08-22 Thread Chris Hostetter
: One simple way of doing this is maybe to write a wrapper for TermQuery : that only returns docs with a Term Frequency > X as far as I : understand the question those terms don't have to be within a certain : window right? I don't think you could do it as a Query Wrapper -- it would have to be

RE: Requiring multiple matches of a term

2011-08-21 Thread Michael Ryan
> One simple way of doing this is maybe to write a wrapper for TermQuery > that only returns docs with a Term Frequency > X as far as I > understand the question those terms don't have to be within a certain > window right? Correct. Terms can be anywhere in the document. I figured term frequencie

Re: Requiring multiple matches of a term

2011-08-21 Thread Simon Willnauer
On Fri, Aug 19, 2011 at 6:26 PM, Michael Ryan wrote: > Is there a way to specify in a query that a term must match at least X times > in a document, where X is some value greater than 1? > One simple way of doing this is maybe to write a wrapper for TermQuery that only returns docs with a Term F

Re: Requiring multiple matches of a term

2011-08-19 Thread Chris Hostetter
FWIW: i think this is a really cool and interesting question. : Is there a way to specify in a query that a term must match at least X : times in a document, where X is some value greater than 1? at the moment, i think your "phrase query" approach is really the only viable way (allthough it di

Requiring multiple matches of a term

2011-08-19 Thread Michael Ryan
Is there a way to specify in a query that a term must match at least X times in a document, where X is some value greater than 1? For example, I want to only get documents that contain the word "dog" three times. I've thought that using a proximity query with an arbitrary large distance value