Hi,

OK, if SOLR-2403 being related to the bug I described, has been fixed in
SOLR 3.4 than we are safe, since we are in the process of migration. Is it
possible to verify this somehow? Is FacetComponent class is the one I should
start checking this from? Can you give any other pointers?

OK, for the use case. Two things:

1. Regarding your confusion on docid and facets being sometimes 2: without
revealing the business details, I can tell you that we split our incoming
documents in parts and store them as separate documents in solr. But we use
common docid for all of them in order to restore the original document when
needed. However we would like to facet on them in normal solr facet meaning.
2. We have shards, because of the volume of data. As previously said, we use
logical sharding. Since each shard belong to a certain period of time (non
intersecting with other shards), we know in advance that a user query most
likely hits only specific shards. So we didn't want other shards to search
in vain and solr merger waiting for them. We implemented a router by
extending solr 1.4 source code. The router operates with 15 shards
currently.

Let me know, if I can help with more details on our use case.

Dmitry

On Wed, Sep 21, 2011 at 8:56 AM, Chris Hostetter
<hossman_luc...@fucit.org>wrote:

>
> : with the setup you describe, there's no why i can imagine executing a
> : search that results in constraints being returned that come from multiple
> : shards with some constraints being "missing" from the middle of hte list,
> : near the border of values for that field that signify a change in shard.
>
> I take thta back ... after replying to your email i noticed the "1.4" in
> the subject, and it occured to me there might have been a bug in 1.4 that
> was since fixed.  doing a quick search i realized there *is* an open bug
> that relates to your problem that i just wasn't aware of...
>
> https://issues.apache.org/jira/browse/SOLR-2403
>
> there's some discussion in there that explains how the problem is
> happening -- the crux being that becuase the mincount can't be checked
> until the per-shard requests are merged, it's possible to miss some values
> when doing index (aka: "lex") ordering.
>
> the bad news is, a general fix sounds kind of hard.  the good news is,
> that for mincount=1, the solution seems pretty straight forward -- we just
> need someone to try working up a patch and some test cases.
>
> FWIW: I'd still like to here more about your usecase, because i still
> think there might be a better alternative...
>
> : Furthermore: what you are doing is a *really* wacky use of faceting ... i
> : have honestly never seen anything like it, hence my question about the
> : significance of the "docid1" and "docid2" in your example field values --
> : can you elaborate on what these values mean, and how you are ultimately
> : using the facet results you get back?  because i am seriously curious as
> : to your use case, and more then a little suspicios that there might be a
> : simpler and more efficient way to solve whatever use case you have...
> :
> : https://people.apache.org/~hossman/#xyproblem
> : XY Problem
> :
> : Your question appears to be an "XY Problem" ... that is: you are dealing
> : with "X", you are assuming "Y" will help you, and you are asking about
> "Y"
> : without giving more details about the "X" so that we can understand the
> : full issue.  Perhaps the best solution doesn't involve "Y" at all?
> : See Also: http://www.perlmonks.org/index.pl?node_id=542341
>
>
> -Hoss
>



-- 
Regards,

Dmitry Kan

Reply via email to