xy720 opened a new pull request #4163:
URL: https://github.com/apache/incubator-doris/pull/4163


   ## Proposed changes
   
   Please see the main description in issue #4101 
   
   ### Resume
   When users use spark load, they have to upload the dependent jars to hdfs 
every time.
   This cl will add a self-generated repository under working_dir folder in 
hdfs for saving dependecies of spark dpp programe and spark platform.
   Note that, the dependcies we upload to repository include:
   1、`spark-dpp.jar`
   2、`spark2x.zip`
   1 is the dpp library which built with spark-dpp submodule. See details about 
spark-dpp submodule in pr #4146 .
   2 is the spark2.x.x platform library which contains all jars in 
$SPARK_HOME/jars
   
   **The repository structure** will be like this:
   
   ```
   __spark_repository__/
       |-__archive_1_0_0/
       |        |-__lib_990325d2c0d1d5e45bf675e54e44fb16_spark-dpp.jar
       |        |-__lib_7670c29daf535efe3c9b923f778f61fc_spark-2x.zip
       |-__archive_2_2_0/
       |        |-__lib_64d5696f99c379af2bee28c1c84271d5_spark-dpp.jar
       |        |-__lib_1bbb74bb6b264a270bc7fca3e964160f_spark-2x.zip
       |-__archive_3_2_0/
       |        |-...
   ```
   
   The followinng conditions will force fe to upload dependencies:
   1、When fe find its dppVersion is absent in repository.
   2、The MD5 value of remote file does not match the local file.
   Before Fe uploads the dependencies, it will create an archive directory with 
name `__archive_{dppVersion}` under the repository.
   
   ## Types of changes
   
   - [ ] Bugfix (non-breaking change which fixes an issue)
   - [x] New feature (non-breaking change which adds functionality)
   - [ ] Breaking change (fix or feature that would cause existing 
functionality to not work as expected)
   - [ ] Documentation Update (if none of the other choices apply)
   
   ## Checklist
   
   - [x] I have create an issue on [Doris's 
issues](https://github.com/apache/incubator-doris/issues), and have described 
the bug/feature there in detail
   - [x] Commit messages in my PR start with the related issues ID, like "#4071 
Add pull request template to doris project"
   - [x] Compiling and unit tests pass locally with my changes
   - [ ] I have added tests that prove my fix is effective or that my feature 
works
   - [ ] If this change need a document change, I have updated the document
   - [ ] Any dependent changes have been merged


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to