Thanks, David, Shawn, Jagdish

Help and suggestions are really appreciated.

Regards,
Lucky Sharma

On Thu, Jun 27, 2019 at 12:50 AM Shawn Heisey <apa...@elyograg.org> wrote:
>
> On 6/26/2019 12:56 PM, Lucky Sharma wrote:
> > @Shawn: Sorry I forgot to mention the corpus size: the corpus size is
> > around 3 million docs, where we need to query for 1500 docs and run
> > aggregations, sorting, search on them.
>
> Assuming the documents aren't HUGE, that sounds like something Solr
> should be able to handle pretty easily on a typical modern 64-bit
> system.  I handled multiple indexes much larger than that with an 8GB
> heap on Linux servers with 64GB total memory.  Most likely you won't
> need anything that large.
>
> Depending on exactly what you're going to do with it, that's probably
> also something easily handled by a relational database or a more modern
> NoSQL solution ... especially if "traditional search" is not part of
> your goals.  Solr can do things beyond search, but search is where
> everything is optimized, so if search is not part of your goal, you
> might want to look elsewhere.
>
> > @David: But will that not be a performance hit (resource incentive)?
> > since it will have that many terms to search upon, the query parse
> > tree will be big, isn't it?
>
> The terms query parser is far more efficient than a simple boolean "OR"
> search with the same number of terms.  It is highly recommended for use
> cases like you have described.
>
> The default maxBooleanClauses limit that Lucene enforces on boolean
> queries is 1024 ... but this is an arbitrary value.  The limit was
> designed as a way to prevent massive queries from running when it wasn't
> truly intended for such queries to have been created in the first place.
>   It is common for users to increase the default limit.
>
> You're probably going to want to send your queries as POST requests,
> because those have a 2MB default body-size restriction, which can be
> increased.  GET requests are limited by the HTTP header size
> restriction, which defaults 8192 bytes on all web server implementations
> I have checked, including the one that's included with Solr.  Increasing
> that is possible, but not recommended ... especially to the sizes you
> would need for the queries you have described.
>
> Thanks,
> Shawn



-- 
Warm Regards,

Lucky Sharma
Contact No :+91 9821559918

Reply via email to