Re: apply document filter to solr index

2016-01-04 Thread Alexandre Rafalovitch
Well, you have a crawling and extraction pipeline. You can probably inject a classification algorithm somewhere in there, possibly NLP trained on manual seed. Or just a list of typical words as a start. This is kind of pre-Solr stage though. Regards, Alex On 4 Jan 2016 7:37 pm, wrote: > Hi

Re: apply document filter to solr index

2016-01-04 Thread Binoy Dalal
There is no way that you can do that in solr. You'll have to write something at the app level, where you're crawling your docs or write a custom update handler that will preprocess the crawled docs and throw out the irrelevant ones. One way you can do that is look at the doc title and the url fo

apply document filter to solr index

2016-01-04 Thread liviuchristian
Hi everyone, I'm working on a search engine based on solr which indexes documents from a large variety of websites.  The engine is focused on cook recipes. However, one problem is that these websites provide not only content related to cooking recipes but also content related to: fashion, travel