elvestar opened a new pull request, #30809: URL: https://github.com/apache/doris/pull/30809
## Proposed changes Issue Number: close #xxx We have a large amount of data stored in HDFS in GZIP-compressed JSON format, and currently Doris does not support loading compressed JSON format data. Upgrade points: 1. Upgraded the NewJsonReader (in new_json_reader.cpp) to construct the _decompressor during its initialization based on the compression type specified during load, and then use the _decompressor to initialize the LineReader. 2. Refactored the _create_decompressor() method of CsvReader by extracting the code that constructs the _decompressor based on _file_compress_type or _file_format_type. This code was moved to the Decompressor class as a factory method create_decompressor(). This allows the code to be reused by NewJsonReader. ## Further comments If this is a relatively large or complex change, kick off the discussion at [d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you chose the solution you did and what alternatives you considered, etc... -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org