Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-02-01 Thread Luigi Caiazza
e.apache.org/core/7_2_1/core/org/apache/ > lucene/document/IntPoint.html > > 2018-01-31 13:45 GMT+01:00 Rick Leir : > > > Luigi > > Is there a reason for not indexing all of your on-disk pages? That seems > > to be the first step. But I do not understand what your goal

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-01-31 Thread Luigi Caiazza
t.html 2018-01-31 13:45 GMT+01:00 Rick Leir : > Luigi > Is there a reason for not indexing all of your on-disk pages? That seems > to be the first step. But I do not understand what your goal is. > Cheers -- Rick > > On January 30, 2018 1:33:27 PM EST, Luigi Caiazza > wrote

Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-01-30 Thread Luigi Caiazza
Hello, I am working on a project that simulates a selective, large-scale crawling. The system adapts its behaviour according with some external user queries received at crawling time. Briefly, it analyzes the already crawled pages in the top-k results for each query, and prioritizes the visit of t