Trie has a custom parser that can load the FieldCache for sorting. Its
basically a built in type now, that supports fieldcache, sorting, stored
fields, etc.

On Sat, Jul 4, 2009 at 3:27 PM, Chris Hostetter <hossman_luc...@fucit.org>wrote:

>
> : My data are library call numbers, normalized to be comparable, resulting
> in
> : (maximum) 21-character strings of the form "RK 052180H359~999~999"
> :
> : Now, these are fine -- they work for sorting and ranges and the whole
> thing,
> : but right now I can't use them because I've got two or three for each of
> my
> : 6M documents and on a 32-bit machine I run out of heap.
> :
> : Another option would be to turn them into longs (using roughly 56 bits of
> : the 64 bit space) and use a trie type. Is there any sort of a win
> involved
> : there?
>
> I don't think Trie fields can be used for sorting (because they result in
> multiple terms per doc) but i could be wrong about that, smarter people
> then me may have done something cool with the TreiField that i'm not aware
> of.
>
> As a general rule: if you have character data that fits a rigid enough set
> of constraints that you can encode any legal value into a single
> numberic value (either int, or long) such that they still sort properly,
> sorting on those encoded values is going to be more memory efficient (and
> probably just as fast) as sorting on the string values.
>
>
> -Hoss
>
>


-- 
-- 
- Mark

http://www.lucidimagination.com

Reply via email to