pnowojski opened a new pull request, #27589:
URL: https://github.com/apache/flink/pull/27589

   This change depends on #27440
   
   ## What is the purpose of the change
   
       Befor this change, when watermark alignment is enabled, it can prevent 
backlogged jobs
       from using all available resources. Inadvertently watermark alignment 
configured with
       maxAllowedWatermarkDrift and updateInterval was de facto capping the 
backlog processing
       speed to maxAllowedWatermarkDrift (event time) / updateInterval (wall 
clock). For example
       when maxAllowedWatermarkDrift=30s and updateInterval=1s, backlog could 
not be processed
       faster than 30s (event time) / 1s (wall clock). In that case, if job had 
1 day of records
       to process in the backlog (for example after 24h downtime), this backlog 
could not be
       processed more quickly than in 48 minutes, regardless of available 
resources and number
       of actual records.
       
       This change adds SamplingWatermarksRingBuffer that will hide the latency 
between
       SourceOperators and SourceCoordinator. For more information please look 
into the ticket.
   
   ## Brief change log
   
   please check individual commits
   
   ## Verifying this change
   
   PR adds new unit tests
   
   ## Does this pull request potentially affect one of the following parts:
   
     - Dependencies (does it add or upgrade a dependency): (yes / **no**)
     - The public API, i.e., is any changed class annotated with 
`@Public(Evolving)`: (yes / **no**)
     - The serializers: (yes / **no** / don't know)
     - The runtime per-record code paths (performance sensitive): (yes / **no** 
/ don't know)
     - Anything that affects deployment or recovery: JobManager (and its 
components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (yes / no ****/ don't 
know)
     - The S3 file system connector: (yes / **no** / don't know)
   
   ## Documentation
   
     - Does this pull request introduce a new feature? (yes / no)
     - If yes, how is the feature documented? (not applicable / docs / JavaDocs 
/ not documented)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to