elvestar opened a new pull request, #30809:
URL: https://github.com/apache/doris/pull/30809

   ## Proposed changes
   
   Issue Number: close #xxx
   
   We have a large amount of data stored in HDFS in GZIP-compressed JSON 
format, and currently Doris does not support loading compressed JSON format 
data.
   
   Upgrade points:
   1. Upgraded the NewJsonReader (in new_json_reader.cpp) to construct the 
_decompressor during its initialization based on the compression type specified 
during load, and then use the _decompressor to initialize the LineReader.
   2. Refactored the _create_decompressor() method of CsvReader by extracting 
the code that constructs the _decompressor based on _file_compress_type or 
_file_format_type. This code was moved to the Decompressor class as a factory 
method create_decompressor(). This allows the code to be reused by 
NewJsonReader.
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at 
[d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you 
chose the solution you did and what alternatives you considered, etc...
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to