kkrugler opened a new issue #6473:
URL: https://github.com/apache/incubator-pinot/issues/6473


   Currently a text column can be created without any forward index, which is 
useful when using the column only for filtering. In this situation, the raw 
(original) text data is not needed, only the text index (see 
https://github.com/apache/incubator-pinot/pull/6284/).
   
   There are other situations for non-text columns where this same 
functionality is useful to reduce the size of the column. In our particular use 
case, we're generating unique terms for a (large) string field, which we save 
as a multi-value STRING column. We need an inverted index for fast filtering, 
but we do not need the forward index, which (leaving aside the inverted index, 
which is built at load time) accounts for more than 80% of the total segment 
size.
   
   @kishoreg suggested "having a empty forward Index reader impl" as a way of 
implementing this.
   
   We could possible handle the configuration of this via a new 
`noFwdIndexColumns` table config field, similar to the `noDictionaryColumns` 
config setting.
   
   There would be situations where specifying no forward index for a column 
would trigger a table config error, for example doing this for a metrics column 
(or so I assume).
   
   I'm also not sure whether it would be valid to have a column that has no 
index/dictionary/forward index; does this mean ignore the field in the input 
data?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to