Re: delta-import of rich documents like word and pdf files!

2011-12-22 Thread Alessandro Benedetti
Hi Guys, I probably found a way to mime the delta import for the fileEntityProcessor ( I have used it for xml files ... ) Adding this configuration in the xml-data-config : And using command : *command=full-import&clean=false* * * Solr adds to the index only the files that were changed from the

Re: delta-import of rich documents like word and pdf files!

2011-11-20 Thread kumar8anuj
I am using solr 3.4 and configured my DataImportHandler to get some data from MySql as well as index some rich document from the disk. This is the part of db-data-config file where i am indexing Rich text documents. http://localhost/resumes-new/resumes${re

Re: delta-import of rich documents like word and pdf files!

2011-11-18 Thread neuron005
When I set my fileSize of type string. It shows error as I have posted above. Then I changed it to slong and results was severe..here is log 18 Nov, 2011 3:00:54 PM org.apache.solr.response.BinaryResponseWriter$Resolver getDoc WARNING: Error reading a field from document : SolrDocument[{}]

Re: delta-import of rich documents like word and pdf files!

2011-11-17 Thread neuron005
Sorry for disturbing you allactually I had to add plong instead of type string. My problem is solved Be ready for new thread CHEERS -- View this message in context: http://lucene.472066.n3.nabble.com/delta-import-of-rich-documents-like-word-and-pdf-files-tp3502039p3515711.html Sent from t

Re: delta-import of rich documents like word and pdf files!

2011-11-17 Thread neuron005
I ran this command and can see size of my files http://localhost:8080/solr/select?q=user&f.fileSize.facet.range.start=100 Great thanks...string worked...i dont know why that did not work last time But when I do that in browse section..following output i saw in my logs SEVERE: Exception during

Re: delta-import of rich documents like word and pdf files!

2011-11-17 Thread neuron005
And also I set my fileSize of type long. "String" will not work I think ! Size can not be a string...it shows error on using string as type. -- View this message in context: http://lucene.472066.n3.nabble.com/delta-import-of-rich-documents-like-word-and-pdf-files-tp3502039p3515505.html Sent from

Re: delta-import of rich documents like word and pdf files!

2011-11-17 Thread neuron005
Thanks for your reply, I performed these steps. in data-config.xml : in schema.xml : -- But still there is no response in browse sectionI edited facet_r

Re: delta-import of rich documents like word and pdf files!

2011-11-17 Thread Ahmet Arslan
> Now, I want to index my files according to their size and > facet them > according to their size ranges. I know that there is an > option of "fileSize" > in FileListEntityProcessor but I am not getting any way to > perform this. > Is fileSize a metadata? You don't need a dynamic field for this.

Re: delta-import of rich documents like word and pdf files!

2011-11-17 Thread neuron005
Thank you for your replies guys.that helped a lot. Thanks "iorixxx" that was the command that worked out. I also tried my solr with mysql and that worked too. Congo! :) Now, I want to index my files according to their size and facet them according to their size ranges. I know that t

Re: delta-import of rich documents like word and pdf files!

2011-11-14 Thread Erick Erickson
And you cannot update-in-place. That is, you can't update just selected fields in a document, you have to re-index the whole document. Best Erick On Mon, Nov 14, 2011 at 6:11 AM, Ahmet Arslan wrote: > >> Thanks for your reply...my >> data-config.xml is >> >>         > type="BinF

Re: delta-import of rich documents like word and pdf files!

2011-11-14 Thread Ahmet Arslan
> Thanks for your reply...my > data-config.xml is > >         type="BinFileDataSource" name="bin"/> > >             name="f" pk="id" processor="FileListEntityProcessor" > recursive="true" > rootEntity="false" > dataSource="null"  baseDir="/var/data/solr" > fileName=

Re: delta-import of rich documents like word and pdf files!

2011-11-14 Thread neuron005
Thanks for your reply...my data-config.xml is -- View this message in context: http://lucene.472066.n3.nabble.com/delta-import-of-ric

Re: delta-import of rich documents like word and pdf files!

2011-11-14 Thread Ahmet Arslan
> Thanks for your reply Mr. Erick > All I want to do is that I have indexed some of my pdf > files and doc files. > Now, any changes I make to them, I want a > delta-import(incremental) so that > I do not have to re index whole document by full import . > Only changes made > to these documents shou

Re: delta-import of rich documents like word and pdf files!

2011-11-13 Thread neuron005
and changes are : file content, maybe I change its author and headers -- View this message in context: http://lucene.472066.n3.nabble.com/delta-import-of-rich-documents-like-word-and-pdf-files-tp3502039p3505951.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: delta-import of rich documents like word and pdf files!

2011-11-13 Thread neuron005
Thanks for your reply Mr. Erick All I want to do is that I have indexed some of my pdf files and doc files. Now, any changes I make to them, I want a delta-import(incremental) so that I do not have to re index whole document by full import . Only changes made to these documents should get updated.

Re: delta-import of rich documents like word and pdf files!

2011-11-12 Thread Erick Erickson
Can you give more details about what you're trying to do? It looks like you're using DataImportHandler? What defines a document needing to be re-indexed? How do you expect to be able to identify them??? Perhaps you can review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Sat, Nov