On Jan 24, 2011, at 2:42 PM, Salman Akram wrote:

> Hi,
> 
> Does anyone have any benchmarks how much highlighting speeds up with Term
> Vectors (compared to without it)? e.g. if highlighting on 20 documents take
> 1 sec with Term Vectors any idea how long it will take without them?
> 
> I need to know since the index used for highlighting has a TVF file of
> around 450GB (approx 65% of total index size) so I am trying to see whether
> the decreasing the index size by dropping TVF would be more helpful for
> performance (less RAM, should be good for I/O too I guess) or keeping it is
> still better?
> 
> I know the best way is try it out but indexing takes a very long time so
> trying to see whether its even worthy or not.


Try testing on a smaller set.  In general, you are saving the process of 
re-analyzing the content, so, to some extent it is going to be dependent on how 
fast your analyzer chain is.  At the size you are at, I don't know if storing 
TVs is worth it.

Reply via email to