Keith Turner created ACCUMULO-4314:
--------------------------------------

             Summary: Use statistics to choose better keys for RFile index
                 Key: ACCUMULO-4314
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-4314
             Project: Accumulo
          Issue Type: Improvement
            Reporter: Keith Turner
            Assignee: Keith Turner
             Fix For: 1.6.6, 1.7.2


The commit for ACCUMULO-1124 makes two changes :
  * Generates shorter keys that may not exist in data to place in RFile index
  * Use statistics to make better choices about what keys to place in index.  
These changes look for keys that are average or below and excludes large keys 
(keys that are > 3 std dev).

The change to generate shorter keys can not be made in 1.7.X and 1.6.X because 
it would generate RFiles that may not work properly with older 1.6 and 1.7 
versions.   However the changes to use statistics to pick better keys could be 
made in 1.6 and 1.7. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to