Re: Exclude a folder/directory from indexing

2020-08-28 Thread Walter Underwood
For building a crawler, I’d start with Scrapy (https://scrapy.org ). It is a solid design and should be easy to use for crawling web pages, files, or an API. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Aug 28, 2020, at 4:16

Re: Exclude a folder/directory from indexing

2020-08-28 Thread Joe Doupnik
    Some time ago I faced a roughly similar challenge. After many trials and tests I ended up creating my own programs to accomplish the tasks of fetching files, selecting which are allowed to be indexed, and feeding them into Solr (POST style). This work is open source, found on https://netlab

Re: Exclude a folder/directory from indexing

2020-08-27 Thread Alexandre Rafalovitch
If you are indexing from Drupal into Solr, that's the question for Drupal's solr module. If you are doing it some other way, which way are you doing it? bin/post command? Most likely this is not the Solr question, but whatever you have feeding data into Solr. Regards, Alex. On Thu, 27 Aug 2020

Exclude a folder/directory from indexing

2020-08-27 Thread Staley, Phil R - DCF
Can you or how do you exclude a specific folder/directory from indexing in SOLR version 7.x or 8.x? Also our CMS is Drupal 8 Thanks, Phil Staley DCF Webmaster 608 422-6569 phil.sta...@wisconsin.gov