Fer-Bj schrieb:
for all the documents we have a field called "small_body" , which is a 60 chars max text field that were we store the "abstract" for each article.
we need to display this small_body we want to compress every time.
If this works like compressing individual files, the overhead for just 60 characters (which may be no more than 60 bytes) may mean that any attempt at compression results in inflation. On the other hand, if lower-level units (pages) are compressed (as opposed to individual fields), then I don't know what sense a configurable compression threshold might make. Maybe one of the pros can clarify.
Last question: what's the best way to determine the compress threshold ?
One fairly obvious way would be to index the same set of documents twice, with compression and then without, and then to compare the index size on disk. If you don't save, say, five or ten percent (YMMV), it might not be worth the effort. Michael Ludwig