lnbest0707-uber opened a new pull request, #12779:
URL: https://github.com/apache/pinot/pull/12779

   tags: `feature`, `refactor`, `release-notes`
    
   This adds an evolved version of ShemaConformingTransformerV2, it evolves 
from the existing one with following new features:
   
   1. Refactored code with better readability and extensibility
   2. Support over-lapping schema fields, in which case it could support schema 
column "a" and "a.b" at the same time. And it only allows primitive type fields 
to be the value.
   3. Extract flattened key-value pairs as mergedTextIndex for better text 
searching.
   4. Add shingle index tokenization functionality for extremely large text 
fields.
   5. Add flexibility to map json extracted field name to meaningful user 
specified column name
   6. Improve serialization logics to handle nested json fields
   7. Enforce graceful handling on extracted String type column. Will convert 
collection or array to String if column type is singleField.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to