Heyw opened a new issue, #17285: URL: https://github.com/apache/dolphinscheduler/issues/17285
### Search before asking - [x] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar issues. ### What happened A workflow instance with id=415 keeps running state over ten hours after recover tolerance fault; here are the state_histories: [{"time":"2025-06-18 22:40:23","state":"RUNNING_EXECUTION","desc":"init running"},{"time":"2025-06-18 22:40:23","state":"RUNNING_EXECUTION","desc":"start a new process"},{"time":"2025-06-18 22:40:23","state":"SERIAL_WAIT","desc":"wait by serial_wait strategy"},{"time":"2025-06-18 22:40:23","state":"RUNNING_EXECUTION","desc":"submit from serial_discard strategy"},{"time":"2025-06-18 22:45:32","state":"RUNNING_EXECUTION","desc":"recover tolerance fault process"},{"time":"2025-06-18 22:45:32","state":"SERIAL_WAIT","desc":"wait by serial_wait strategy"},{"time":"2025-06-18 22:45:32","state":"RUNNING_EXECUTION","desc":"submit from serial_discard strategy"},{"time":"2025-06-19 13:46:32","state":"RUNNING_EXECUTION","desc":"recover tolerance fault process"},{"time":"2025-06-19 13:46:32","state":"SERIAL_WAIT","desc":"wait by serial_wait strategy"},{"time":"2025-06-19 13:46:32","state":"RUNNING_EXECUTION","desc":"submit from serial_discard strategy"},{"time":"2025-06-19 13:46:32","state":"S UCCESS","desc":"update by workflow executor"}] ### What you expected to happen WorkflowExecuteRunnable might went wrong when deal the retryable failed task instance, it put the task instance in readyToSubmitTaskQueue and waitToRetryTaskInstanceMap; so that when the task instance complete ,the instance would be always in the waitToRetryTaskInstanceMap and would not be retry ### How to reproduce none ### Anything else _No response_ ### Version 3.1.x ### Are you willing to submit PR? - [x] Yes I am willing to submit a PR! ### Code of Conduct - [x] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
