Re: Solr indexing based on last_modified

2015-08-17 Thread Erick Erickson
Well, you'll have to have some kind of timestamp that you can reference and only re-send files that have a newer timestamp. Or keep a DB around with file path/last indexed timestamp or Best, Erick On Mon, Aug 17, 2015 at 12:36 PM, coolmals wrote: > I have a file system. I have a scheduler wh

Re: Solr indexing based on last_modified

2015-08-17 Thread coolmals
I have a file system. I have a scheduler which will call solr in scheduled time interval. Any updates to the file system must be indexed by solr. Only changes must be re-indexed as file system is huge and cannot be re-indexed every time. -- View this message in context: http://lucene.472066.n3.

Re: Solr indexing based on last_modified

2015-08-17 Thread Erick Erickson
There's no way that I know of with post.jar. Post.jar was never really intended as a production tool, and sending all the files to Solr for parsing (pdf, word and the like) is putting quite a load on the Solr server. What is your use-case? You might consider a SolrJ program, it would be simple eno

Solr indexing based on last_modified

2015-08-17 Thread coolmals
I want to update the index of a file only if last_modified has changed in the file. I am running post.jar with fileTypes="*", i would want to update the index of the files only if there is any change in them since the last update of index. Can you let me know how to achieve this? -- View this me