9aman commented on code in PR #14920: URL: https://github.com/apache/pinot/pull/14920#discussion_r1938922613
########## pinot-controller/src/main/java/org/apache/pinot/controller/helix/core/realtime/PinotLLCRealtimeSegmentManager.java: ########## @@ -2096,6 +2104,137 @@ URI createSegmentPath(String rawTableName, String segmentName) { return URIUtils.getUri(_controllerConf.getDataDir(), rawTableName, URIUtils.encode(segmentName)); } + /** + * Re-ingests segments that are in ERROR state in EV but ONLINE in IS with no peer copy on any server. This method + * will call the server reIngestSegment API + * on one of the alive servers that are supposed to host that segment according to IdealState. + * + * API signature: + * POST http://[serverURL]/reIngestSegment + * Request body (JSON): + * { + * "tableNameWithType": [tableName], + * "segmentName": [segmentName] + * } + * + * @param tableNameWithType The table name with type, e.g. "myTable_REALTIME" + */ + public void reIngestSegmentsWithErrorState(String tableNameWithType) { + // Step 1: Fetch the ExternalView and all segments + ExternalView externalView = getExternalView(tableNameWithType); + IdealState idealState = getIdealState(tableNameWithType); + Map<String, Map<String, String>> segmentToInstanceCurrentStateMap = externalView.getRecord().getMapFields(); + Map<String, Map<String, String>> segmentToInstanceIdealStateMap = idealState.getRecord().getMapFields(); + + // find segments in ERROR state in externalView + List<String> segmentsInErrorState = new ArrayList<>(); + for (Map.Entry<String, Map<String, String>> entry : segmentToInstanceCurrentStateMap.entrySet()) { + String segmentName = entry.getKey(); + Map<String, String> instanceStateMap = entry.getValue(); + boolean allReplicasInError = true; + for (String state : instanceStateMap.values()) { + if (!SegmentStateModel.ERROR.equals(state)) { + allReplicasInError = false; + break; + } + } + if (allReplicasInError) { + segmentsInErrorState.add(segmentName); + } + } + + if (segmentsInErrorState.isEmpty()) { + LOGGER.info("No segments found in ERROR state for table {}", tableNameWithType); + return; + } + + // filter out segments that are not ONLINE in IdealState + for (String segmentName : segmentsInErrorState) { Review Comment: Want to understand the scenario when a segment is marked ONLINE in the IS for one of the servers while not for others. What happens to the IS when a server is disabled ? Is it removed from the IS or marked OFFLINE ? What would happen to the IS in this case ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org