One way to solve the issue may be to create another field to group the value in
a range, so you have fewer facet values to query.
Sent from my iPhone
On Nov 5, 2013, at 4:31 AM, Erick Erickson wrote:
> You're just going to have to accept it being slow. Think of it this way:
> you have
> 4M (sa
You're just going to have to accept it being slow. Think of it this way:
you have
4M (say) buckets that have to be counted into. Then the top 500 have to be
collected to return. That's just going to take some time unless you have
very beefy machines.
I'd _really_ back up and consider whether this
Erick,
It could have more than 4M distinct values. The purpose of this facet is
to display the most frequent, say top 500, urls to users.
Sascha,
Thanks for the info. I will look into this thread thing.
Mingfeng
On Mon, Nov 4, 2013 at 4:47 AM, Erick Erickson wrote:
> How many unique URLs do
How many unique URLs do you have in your 9M
docs? If your 9M hits have 4M distinct URLs, then
this is not very valuable to the user.
Sascha:
Was that speedup on a single field or were you faceting over
multiple fields? Because as I remember that code spins off
threads on a per-field basis, and if
Hi Ming,
which Solr version are you using? In case you use one of the latest
versions (4.5 or above) try the new parameter facet.threads with a
reasonable value (4 to 8 gave me a massive performance speedup when
working with large facets, i.e. nTerms >> 10^7).
-Sascha
Mingfeng Yang wrote:
> I h
I have an index with 170M documents, and two of the fields for each doc is
"source" and "url". And I want to know the top 500 most frequent urls from
Video source.
So I did a facet with
"fq=source:Video&facet=true&facet.field=url&facet.limit=500", and the
matching documents are about 9 millions.