sylph-eu opened a new pull request, #11226:
URL: https://github.com/apache/pinot/pull/11226

   **Context**
   We're running Pinot on K8S in one of the public clouds, with 
`pinot.server.instance.segment.directory.loader=tierBased` and multiple 
tiers/volumes and data directories.
   
   We've noticed that working directory of Pinot (located on ephemeral storage) 
collects plenty of real-time segments that, per configuration, shall reside in 
the respective data directory for a tier (persistent storage). Further 
investigation revealed that `RealtimeTableDataManager` doesn't initialize 
`IndexLoadingConfig` during segment loading, thus forcing the segment to be 
moved to `<empty>/<segment_dir>`, which results in a folder in the working 
directory. The latter contributes to the instability of Pinot and longer 
restart times.
   
   **How to reproduce**:
   1. Set `pinot.server.instance.segment.directory.loader=tierBased` 
   2. Create a completed real-time segment.
   3. Restart pinot-server, observe the relocation of the segment to the 
working dir.
   
   **Changes**:
   - Initialize `indexLoadingConfig` with table data dir, in the same way as 
`BaseTableDataManager.addSegment` is doing.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to