On Jan 24, 2011, at 2:42 PM, Salman Akram wrote: > Hi, > > Does anyone have any benchmarks how much highlighting speeds up with Term > Vectors (compared to without it)? e.g. if highlighting on 20 documents take > 1 sec with Term Vectors any idea how long it will take without them? > > I need to know since the index used for highlighting has a TVF file of > around 450GB (approx 65% of total index size) so I am trying to see whether > the decreasing the index size by dropping TVF would be more helpful for > performance (less RAM, should be good for I/O too I guess) or keeping it is > still better? > > I know the best way is try it out but indexing takes a very long time so > trying to see whether its even worthy or not.
Try testing on a smaller set. In general, you are saving the process of re-analyzing the content, so, to some extent it is going to be dependent on how fast your analyzer chain is. At the size you are at, I don't know if storing TVs is worth it.