Hi!
I'm facing a similar problem.
I'm dealing with Thesauri on an Oracle RDBMS and I'm trying to integrate
Solr in order to speed up search operations.
Did you succeed in this integration?
Thanks,
Laura
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
Hi Chris,
thanks for your description. I should think about this a little bit
more, then I will ask some details. The main problem is that Synonyms
are one kind of relations, and Thesaurus may contain 6-10 kinds of
relations. And it is depending on the user, which types of relations
he would like
: The question asked, in good faith, was does solr support or extend to
: implementing a thesaurus. It looks like it does not which is fine. It does
Well, my point was that "thesaurus" is not a feature description. it's a
data structure, and depending on your goals, the existing SynonymFilter
: My imaginative use case:
: - the user enters a term and maybe he turns on a flag to get not just
: the term, but all terms, which related somehow with this (usually the
: synonyms and narrower terms).
: - Solr first find the queried term(s) in the thesaurus, then finds the
: related terms, modif
Two Peters (or rather a stupid english bloke who can't work out how to type
fancy accents :-)
Sorry Péter (took me 10 minutes to work out i could cut and paste) my reply
was to the clustering post by Peter Sturge. Clustering sounds great but
being able to define a thesaurus scheme excatly would be
Hi Lee,
according to my vision the user could decide which relationship types
would he likes to attach to his search, and the application would call
his attention to other possibilities. So there would be no heuristic
method applied, because e.g. boarder terms would cause lots of
misleading result
Hi Peter,
Thats way to clever for me :-)
Discovering thesuarus relationships would be fantastic but its not clear
what heuristics you would need to use to discover broader, narrower, related
documents etc. Although I might be doing the clustering down i'm sceptical
about the accuracy.
cheers Lee
Hi Lee,
Perhaps Solr's clustering component might be helpful for your use case?
http://wiki.apache.org/solr/ClusteringComponent
On Fri, Dec 10, 2010 at 9:17 AM, lee carroll
wrote:
> Hi Chris,
>
> Its all a bit early in the morning for this mined :-)
>
> The question asked, in good faith, was
Hi Chris,
Its all a bit early in the morning for this mined :-)
The question asked, in good faith, was does solr support or extend to
implementing a thesaurus. It looks like it does not which is fine. It does
support synonyms and synonym rings which is again fine. The ski example was
an illustrat
I also try to define the problem.
In the library world there are some general and special thesaurus,
which reveal the relations between concepts. The relations have types
as Lee described: Prefered Term (PT), Broader Terms (BT), Narrower
Terms (NT) Related Terms (RT) and others. Some of these thes
: a term can have a Prefered Term (PT), many Broader Terms (BT), Many Narrower
: Terms (NT) Related Terms (RT) etc
...
: User supplied Term is say : Ski
:
: Prefered term: Skiing
: Broader terms could be : Ski and Snow Boarding, Mountain Sports, Sports
: Narrower terms: down hill skiing,
No, it doesn't. And it's not entirely clear what (if any) simple way
there is to use Solr to expose hieararchically related documents in a
way that preserves and usefully allows navigation of the relationships.
At least in general, for sophisticated stuff.
On 12/2/2010 3:55 AM, lee carroll w
Hi
Stephen, yes sorry should have been more plain
a term can have a Prefered Term (PT), many Broader Terms (BT), Many Narrower
Terms (NT) Related Terms (RT) etc
So
User supplied Term is say : Ski
Prefered term: Skiing
Broader terms could be : Ski and Snow Boarding, Mountain Sports, Sports
Narr
Hello Lee,
these bells sound like "SKOS" ;o)
AFAIK Solr does not support thesauri just plain flat synonym lists.
One could implement a thesaurus filter and put it into the end of the analyzer
chain of solr.
The filter would then do a thesaurus lookup for each token it receives and
possibly
*
Hi Lee,
Can you describe your thesaurus format (it's not exactly self-descriptive) and
how you would like it to be applied?
I gather you're referring to a thesaurus feature in another product (or product
class)? Maybe if you describe that it would help too.
Steve
> -Original Message-
15 matches
Mail list logo