amarnathkarthik commented on issue #6349:
URL: 
https://github.com/apache/incubator-pinot/issues/6349#issuecomment-749209495


   Here is my analysis, Looked at `LaunchDataIngestionJobCommand.java` and it's 
all job runner (Segment Generator and Push job). Did not see implementation to 
clean up `outputDirURI` after the push but do see tempDir cleanup after copying 
the generated segments to outputDirURI.
   
   @kishoreg @plaisted We have 2 option, let me know which one would be 
appropriate:
   
   1. **Backward compatible** - Introduce new property in 
`SegmentGenerationJobSpec._cleanupOutputAfterPush` to remove segments from 
`outputDirURI` once after push call finishes successfully. 
CleanupOutputAfterPush can be configurable using yaml and default it to `false`
   2. **Cleanup by default** - Enhance the push job to remove the segments from 
the outputDirURI once after push call finishes successfully.
   
   Note: 
   Both Spark and Hadoop push jobs does not support cleanup today.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to