Jackie-Jiang commented on a change in pull request #7222:
URL: https://github.com/apache/pinot/pull/7222#discussion_r681344491



##########
File path: 
pinot-plugins/pinot-batch-ingestion/pinot-batch-ingestion-common/src/main/java/org/apache/pinot/plugin/ingestion/batch/common/SegmentGenerationTaskRunner.java
##########
@@ -149,6 +155,9 @@ private SegmentNameGenerator getSegmentNameGenerator() {
             
Boolean.parseBoolean(segmentNameGeneratorConfigs.get(EXCLUDE_SEQUENCE_ID)),
             IngestionConfigUtils.getBatchSegmentIngestionType(tableConfig),
             
IngestionConfigUtils.getBatchSegmentIngestionFrequency(tableConfig), 
dateTimeFormatSpec);
+      case INPUT_FILE_SEGMENT_NAME_GENERATOR:
+        return new 
InputFileSegmentNameGenerator(segmentNameGeneratorConfigs.get(FILE_PATH_PATTERN),

Review comment:
       My concern here is that in several use cases (realtime to offline, 
segment merge etc.), the segments are not generated from input files, but from 
other segments. Modeling input file path as a parameter in 
`SegmentNameGenerator.generateSegmentName()` can cause issues for these cases. 
Seems generating segment name from input path only apply to 
`SegmentGenerationTaskRunner` so I would suggest resolving the segment name 
there.
   But this cannot solve the local copy change path problem. Another approach 
would be directly setting the table name within the `SegmentNameGeneratorSpec` 
before copying the remote file to local




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to