2011/6/23 Anshum <ansh...@gmail.com> > Pooja, > You could use UIMA (or any other) Parts of Speech Tagger. You could read a > little more about it here. > > http://uima.apache.org/downloads/sandbox/hmmTaggerUsersGuide/hmmTaggerUsersGuide.html#sandbox.tagger.annotatorDescriptor > This would help you annotate and segregate nouns from verbs in the input. > You could then aptly form the query. > Perhaps this would take some effort but 'm assuming it'd work reasonably > well. >
I've done this recently using UIMA POS tagger and other annotators within a TokenFilter to add TypeAttribute and PayloadAttribute to each token and eventually filter/boost when searching. Regards, Tommaso > > -- > Anshum Gupta > http://ai-cafe.blogspot.com > > > On Thu, Jun 23, 2011 at 11:18 AM, Pooja Verlani <pooja.verl...@gmail.com > >wrote: > > > Hi, > > > > Say for example, a query like "mammohan singh dancing", I am preferring > to > > make a compulsory condition on nouns to be searched but any verb isnt > > important for me, I am preferring to extract results for manmohan singh > and > > not for dancing. If I can extract noun verb or can get to know that in my > > index I have a concept of "manmohan singh" or an identity if not concept, > I > > would like to define rules for doing a strict(compulsory) match of > > noun(concept) and loose match(non-compulsory boosting) for the verb. > > > > Basically, I want to avoid getting zero results for a compulsory match of > > the 3 tokens(in this case manmohan singh dancing) of the query and > instead > > I > > want to do a compulsory match on manmohan singh since that exists in my > > index and "dancing" shouldn't be a compulsory match for non-zero number > of > > results. > > > > Hope this explains. > > Any suggestions? > > > > Regards, > > Pooja > > > > > > On Thu, Jun 23, 2011 at 11:07 AM, Anshum <ansh...@gmail.com> wrote: > > > > > What would you mean by 'noun or some concept'. Would be better if you > > could > > > give a rather concrete example. > > > About detecting parts of speech, you could use a lot of libraries but I > > > didn't get about boosting terms from the Index. > > > > > > > > > -- > > > Anshum Gupta > > > http://ai-cafe.blogspot.com > > > > > > > > > On Thu, Jun 23, 2011 at 11:02 AM, Pooja Verlani < > pooja.verl...@gmail.com > > > >wrote: > > > > > > > Hi, > > > > > > > > At the query time, I want to make the lucene query such that it > should > > > > boost > > > > only the noun from the query or some concept existing in the index. > Are > > > > there any possibilities or any possible ideas that can be worked > > around? > > > > > > > > > > > > Regards, > > > > Pooja > > > > > > > > > >