Nutch
- Original Message
From: Walter Underwood
To: solr-user@lucene.apache.org
Sent: Wednesday, May 27, 2009 10:53:16 PM
Subject: Re: term vectors
If you really, really need to do XML-smart queries, go ahead and buy
MarkLogic. I've worked with the principle folk there and they are
re
> Sent: Wednesday, May 27, 2009 10:53:16 PM
> Subject: Re: term vectors
>
> If you really, really need to do XML-smart queries, go ahead and buy
> MarkLogic. I've worked with the principle folk there and they are
> really sharp. Their engine is awesome. XML search is hard, a
If you really, really need to do XML-smart queries, go ahead and buy
MarkLogic. I've worked with the principle folk there and they are
really sharp. Their engine is awesome. XML search is hard, and you
can't take a regular search engine, even a really good one, and make
it do full XML without tons
I've been experimenting with the XML + Solr combo too. What I've found to be
a good working solution is to:
pick out the nodes you want as solr documents (every div1 or div2 etc.)
index the text only (with lots of metadata fields)
add a field for either the xpath to that node, or
save the indivi
On May 27, 2009, at 4:56 PM, Yosvanys Aponte wrote:
i undestand what you say
but the problem i have is
user can make query like this:
//tei.2//p"[quijote"]
A couple of problems with this... for one, there's no query parser
that'll interpret that syntax as you mean it in Solr. And also,
i undestand what you say
but the problem i have is
user can make query like this:
//tei.2//p"[quijote"]
user want to find all paragraph that belong to tei.2 and have the word
"quijote"
then i have to search structure and content, because i have the and index
format to save the structure and the
Aponte - I'm not quite understanding your question. Could you provide
some detailed examples of what you're trying to accomplish?
Just guessing from your post, it seems what you are after is
flattening your structure so that it fits within Solr/Lucene's
document & field capabilities and th