Re: Not all EML files are indexing during indexing

Walter Underwood Tue, 02 Jun 2020 08:42:12 -0700

> On Jun 2, 2020, at 7:40 AM, Charlie Hull <[email protected]> wrote:
> 
> If it was me I'd probably build a standalone indexer script in Python that 
> did the file handling, called out to a separate Tika service for extraction, 
> posted to Solr.


I would do the same thing, and I would base that script on Scrapy 
(https://scrapy.org <https://scrapy.org/>). I worked on a Python-based web 
spider for about ten years.

wunder
Walter Underwood
[email protected]
http://observer.wunderwood.org/  (my blog)

Re: Not all EML files are indexing during indexing

Reply via email to