jamesyfshao opened a new issue #4187: Issue with ingestion data with empty array URL: https://github.com/apache/incubator-pinot/issues/4187 In a recent [change](https://github.com/apache/incubator-pinot/commit/5c296de1c4b91d4cc3b99ee6ccf15c077f9b4dcc) to pinot ingestion workflow, we refactor the PlainFieldExtractor class to the DataTypeTransformer class. However, we noticed that in this change, the decoding of an empty array (zero-length array) column caused an exception in indexing and pinot ingestor will throw away such row (causing data loss). After some investigation, the root cause seems to be with[ an array operation](https://github.com/apache/incubator-pinot/commit/5c296de1c4b91d4cc3b99ee6ccf15c077f9b4dcc#diff-6e075be1e432676f216423d6b086ed22R94) without checking the proper length (compare to [origin design](https://github.com/apache/incubator-pinot/commit/5c296de1c4b91d4cc3b99ee6ccf15c077f9b4dcc#diff-b363445cd0a59e1a271f755d944af31fL200) where we perform such check). I wonder if we have already working on fixing this issue. If not, we can look into fixing this issue
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org