Try to set to FINEST / DEBUG level the  extract request handler and Tika
packages and post relevant log lines
 On 11 Jan 2014 14:38, "sweety" <sweetyshind...@yahoo.com> wrote:

> Sorry, that my question was not clear.
> Initially when indexed pdf files it showed the data within this pdf in the
> contents field.as follows:(this is output for initially indexed documents)
> <str name="contents">
> Cloud ctured As tale in size as well as complexity. We need a cloud based
> system that will solve this problem.  Provide interfaces to registeP CSS
> Client Measurements Benchmarkinse times by varying Number of documents
> fromnds to millions Nuervers from 1 to 5 Storage and search options as
> discussed abo
> </str>
>
> But for newly indexed documents, the contents field is empty,
> Actually coding.pdf is of 3mb size, but as shown in the output the contents
> of this pdf are not extracted, indexing extracts the metadata,but not the
> contents of the file,
> the contents field is empty, <str name="contents"></str>
>
> what is the reason for this? Is is because of some jar missing?
>
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/using-extract-handler-data-not-extracted-tp4110850p4110873.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to