Re: Solr indexing based on last_modified

2015-08-17 Thread Erick Erickson
Well, you'll have to have some kind of timestamp that you can reference and only re-send files that have a newer timestamp. Or keep a DB around with file path/last indexed timestamp or Best, Erick On Mon, Aug 17, 2015 at 12:36 PM, coolmals wrote: > I have a file system. I have a scheduler wh

Re: Solr indexing based on last_modified

2015-08-17 Thread coolmals
I have a file system. I have a scheduler which will call solr in scheduled time interval. Any updates to the file system must be indexed by solr. Only changes must be re-indexed as file system is huge and cannot be re-indexed every time. -- View this message in context: http://lucene.472066.n3.

Re: Solr indexing based on last_modified

2015-08-17 Thread Erick Erickson
There's no way that I know of with post.jar. Post.jar was never really intended as a production tool, and sending all the files to Solr for parsing (pdf, word and the like) is putting quite a load on the Solr server. What is your use-case? You might consider a SolrJ program, it would be simple eno