Hi Jacob,

Thanks for the reply. I am still trying to nail down this problem with the best 
possible solution.
Yeah I had thought about these 2 approaches but both of them are gonna make my 
indexing slower.  Plus the fact that I will have atleast 5 rich text files 
associated with each document is not helping much either.

Anyways I will explore and see if I can come up with anything better (may be a 
separate index for rich text docs).

Thanks,
Venu



________________________________
From: Jacob Singh <jacobsi...@gmail.com>
To: solr-user@lucene.apache.org
Sent: Saturday, April 4, 2009 9:59:13 PM
Subject: Re: ExtractingRequestHandler Question

Hi TIA,

I have the same desired requirement.  If you look up in the archives,
you might find a similar thread between myself and the always super
helpful Erik Hatcher.  Basically, it can't be done (right now).

You can however use the "ExtractOnly" request handler, and just get
the extracted text back from solr, and then use xpath to get out the
attributes and then add them to your XML you are sending.

Not ideal because the file has to be transfered twice.

The only other option is to send the file as per the instructions via
POST with its attributes as POST fields.

Keep in mind that Solr documents are immutable, which means they
cannot change.  When you update a document with the same primary key,
it will simply delete the existing one and add the new one.

hth,
Jacob

On Sat, Apr 4, 2009 at 5:59 AM, Venu Mittal <metale...@yahoo.com> wrote:
> Hi,
>
> I am using ExtractingRequestHandler to index  rich text documents.
> The way I am doing it is I get some data related to the document from 
> database and then post an xml  (containing only this data ) to solr. Then I 
> make another call to solr, which sends the actual document to be indexed.
> But while doing so I am loosing out all the other data that is related to the 
> document.
>
> Is this the right way to do handle it or am I missing out on something.
>
> TIA
>
>
>
>



-- 

+1 510 277-0891 (o)
+91 9999 33 7458 (m)

web: http://pajamadesign.com

Skype: pajamadesign
Yahoo: jacobsingh
AIM: jacobsingh
gTalk: jacobsi...@gmail.com



      

Reply via email to