Yeah, the first thing I thought of was that perhaps there was something wrong with the uniqueKey and they were clashing between the indexes, however upon visual inspection of the data the field we are using as the unique key in each of the indexes is grossly different between the two databases, so there is no chance of them clashing.
Unfortunately, I cannot provide the data in order to reproduce, however I will try and produce a set of sample data that will reproduce the problem. Although I must add that when we were testing the shard feature on smaller sets of data, we did not notice this issue ( < 100,000 docs per index ) but when we fully filled each index, the issue became more apparent ( > 1,000,000 docs per index ). This is not to say that the issue wasn't there before, we just never noticed it. On Monday, I will provide some configuration information and see if that helps. Yonik Seeley-2 wrote: > > Certainly does seem strange. > Do you have the same uniqueKeyField in both indexes? > Any way you can provide some configuration and some data to reproduce > this? > > -Yonik > > On Fri, May 15, 2009 at 10:40 AM, CB-PO <charles.bush...@gmail.com> wrote: >> >> Hello, >> What we have done is created multiple solr instances on the same server, >> where each instance is created with the DataImportHandler from a >> different >> DB. The information on each DB is similar, so the schema's for each >> instance are pretty much the same. Our goal is to use the shards feature >> to >> combine the results into a single table. >> >> The problem is that when we use shards, the "numFound" is acting very >> strangely. Here are some examples: >> >> 2 solr instances: >> localhost:8080/alpha/ >> localhost:8080/bravo/ >> >> Lets say i'm searching for the term "fred". If I do: >> >> localhost:8080/alpha/select?q=fred&rows=10&start=0 >> I get numFound="0". That's fine >> >> localhost:8080/bravo/select?q=fred&rows=10&start=0 >> I get: <result name="response" numFound="27" start="0"> Followed by 10 >> <doc></doc>'s. This is also fine. >> >> When i do these [same result for both]: >> localhost:8080/alpha/select?q=fred&rows=10&start=0&shards=localhost:8080/alpha,localhost:8080/bravo >> localhost:8080/bravo/select?q=fred&rows=10&start=0&shards=localhost:8080/alpha,localhost:8080/bravo >> >> I get: <result name="response" numFound="18" start="0"> followed by 1 >> <doc></doc> >> >> So... something weird happened... There should be 27 results, but even if >> it >> thought there were only 18 results, it should have displayed 10 of them. >> >> >> Alright, so I tried: >> >> localhost:8080/alpha/select?q=fred&rows=1&start=0&shards=localhost:8080/alpha,localhost:8080/bravo >> localhost:8080/bravo/select?q=fred&rows=1&start=0&shards=localhost:8080/alpha,localhost:8080/bravo >> >> I got: <result name="response" numFound="27" start="0"> followed by 1 >> <doc></doc> >> Seems to be working alright with this... But lets try... >> >> localhost:8080/alpha/select?q=fred&rows=1&start=1&shards=localhost:8080/alpha,localhost:8080/bravo >> localhost:8080/bravo/select?q=fred&rows=1&start=1&shards=localhost:8080/alpha,localhost:8080/bravo >> >> I got: <result name="response" numFound="26" start="1"> with no >> <doc></doc>'s... wtf? >> >> I continued this up to start=10, and numFound decreased by 1 every time, >> with no more <doc></doc>'s. >> So i changed it to rows=100&start=0 and i got: <result name="response" >> numFound="2" start="0"> followed by 2 <doc></doc>'s. >> >> This issue is happening with multiple search queries, however with some >> other search queries, it works fine and returns the proper number for >> numFound, and however many <doc>'s there are supposed to be. >> >> Has anyone seen this issue before? > > -- View this message in context: http://www.nabble.com/Solr-Shard---Strange-results-tp23561201p23566574.html Sent from the Solr - User mailing list archive at Nabble.com.