Fer-Bj schrieb:
for all the documents we have a field called "small_body" , which is a
60 chars max text field that were we store the "abstract" for each
article.

we need to display this small_body we want to compress every time.

If this works like compressing individual files, the overhead for just
60 characters (which may be no more than 60 bytes) may mean that any
attempt at compression results in inflation.

On the other hand, if lower-level units (pages) are compressed (as
opposed to individual fields), then I don't know what sense a
configurable compression threshold might make.

Maybe one of the pros can clarify.

Last question: what's the best way to determine the compress
threshold ?

One fairly obvious way would be to index the same set of documents
twice, with compression and then without, and then to compare the index
size on disk. If you don't save, say, five or ten percent (YMMV), it
might not be worth the effort.

Michael Ludwig

Reply via email to