On 12/20/2017 6:09 PM, S G wrote:
One of our Solr users is trying to set docValues="true" for multivalued
string fields and boolean-type fields.

I am not sure what the performance impact of that would be.
Can docValues negatively affect performance in any way?

Adding to what Emir said:

The docValues data will be the same as stored data, but it will be uncompressed, and written in such a way that Lucene can read all values for one field simply by reading data off the disk, no computations or seeks within the file are required.

If the field is indexed and stored, then docValues will not be accessed during normal queries unless there is a sort parameter or a facet parameter that mentions a field with docValues. If present, docValues data will be used for sorting and facets, otherwise indexed values will be used. Usually, sorting or facets with docValues uses less memory and performs faster than the same operation without docValues. If the machine has insufficient system RAM to effectively cache index data, the performance may not improve.

When docValues is added to a field, a complete reindex is required, or Solr will not work properly.

If a field that already contains docValues has a change in the setting for multiValued, then that will require a reindex, but you must also take another step -- completely wiping the index directory before reloading or restarting. If the wipe doesn't happen in this situation, then the core is going to completely break and throw exceptions.

Thanks,
Shawn

Reply via email to