lnbest0707-uber opened a new pull request, #12779: URL: https://github.com/apache/pinot/pull/12779
tags: `feature`, `refactor`, `release-notes` This adds an evolved version of ShemaConformingTransformerV2, it evolves from the existing one with following new features: 1. Refactored code with better readability and extensibility 2. Support over-lapping schema fields, in which case it could support schema column "a" and "a.b" at the same time. And it only allows primitive type fields to be the value. 3. Extract flattened key-value pairs as mergedTextIndex for better text searching. 4. Add shingle index tokenization functionality for extremely large text fields. 5. Add flexibility to map json extracted field name to meaningful user specified column name 6. Improve serialization logics to handle nested json fields 7. Enforce graceful handling on extracted String type column. Will convert collection or array to String if column type is singleField. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org