We've got a largish corpus (~94 million documents). We'd like to be able 
to sort on one of the string fields. However this takes an incredibly 
long time. A warming query for that field takes about ~20 minutes.

However most of the time the result sets are small since we use filters 
heavily - typically a result set is between 2 and 100 documents.

Yet sorting on the string field is still very, very slow.

Now, as I understand it sorting on a field requires building a 
FieldCache for every document no matter how many documents actually 
match the query.

Is there any way round that - is there any way to say "just sort the 
matched documents"?

We can probably work round this by sorting in application space but I 
wanted to double check that I'm not missing anything before I implement 
that.

thanks,

Simon


Reply via email to