Try to set to FINEST / DEBUG level the extract request handler and Tika packages and post relevant log lines On 11 Jan 2014 14:38, "sweety" <sweetyshind...@yahoo.com> wrote:
> Sorry, that my question was not clear. > Initially when indexed pdf files it showed the data within this pdf in the > contents field.as follows:(this is output for initially indexed documents) > <str name="contents"> > Cloud ctured As tale in size as well as complexity. We need a cloud based > system that will solve this problem. Provide interfaces to registeP CSS > Client Measurements Benchmarkinse times by varying Number of documents > fromnds to millions Nuervers from 1 to 5 Storage and search options as > discussed abo > </str> > > But for newly indexed documents, the contents field is empty, > Actually coding.pdf is of 3mb size, but as shown in the output the contents > of this pdf are not extracted, indexing extracts the metadata,but not the > contents of the file, > the contents field is empty, <str name="contents"></str> > > what is the reason for this? Is is because of some jar missing? > > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/using-extract-handler-data-not-extracted-tp4110850p4110873.html > Sent from the Solr - User mailing list archive at Nabble.com. >