Re: Don't synchronously wait for already-in-progress IO in read stream

Melanie Plageman Wed, 25 Mar 2026 08:33:38 -0700

On Wed, Mar 25, 2026 at 11:15 AM Andres Freund <[email protected]> wrote:
>
> > > In the attached prototype I replaced your patch introducing
> > > PrepareHeadBufferReadIO()/ PrepareAdditionalBufferReadIO() with one that
> > > instead revises StartBufferIO() to have the enum return value you 
> > > introduced
> > > and a PgAioWaitRef * argument that callers that would like to 
> > > asynchronously
> > > wait for the IO to complete (others pass NULL). There are some other 
> > > cleanups
> > > in it too, see the commit message for more details.
> >
> > I've come around to this. The aspect I like least is that io_wref is
> > used both as an output parameter _and_ a decision input parameter.
>
> Do you have an alternative suggestion? We could add a dedicated parameter for
> that, but then it just opens up different ways of calling the function with
> the wrong arguments.


Yea, I don't know. The cleanest way to handle callers with different
intentions and tolerances is to have different functions that allow
different behavior. But that's the opposite of what we're currently
trying to do :) I tried to come up with some intention-related enum
input argument, but that seems like a bit of over-engineering.

> > Some callers never want to wait on in-progress IO (and don't have to),
> > while others must eventually wait but can defer that waiting as long
> > as they have a wait reference. If they can't get a wait reference,
> > they have no way to wait later, so they must wait now. The presence of
> > io_wref indicates this difference.
> >
> > I think it's important to express that less mechanically than in your
> > current header comment and comment in the else block of
> > StartSharedBufferIO() where we do the waiting. Explaining first—before
> > detailing argument combinations—why a caller would want to pass
> > io_wref might help.
>
> I'm not entirely sure what you'd like here.  Would the following comment
> do the trick?
>
>  * In several scenarios the buffer may already be undergoing I/O in this or
>  * another backend. How to best handle that depends on the caller's
>  * situation. It might be appropriate to wait synchronously (e.g., because the
>  * buffer is about to be invalidated); wait asynchronously, using the buffer's
>  * IO wait reference (e.g., because the caller is doing readahead and doesn't
>  * need the buffer to be ready immediately); or to not wait at all (e.g.,
>  * because the caller is trying to combine IO for this buffer with another
>  * buffer).
>  *
>  * How and whether to wait is controlled by the wait in io_wref parameters. In
>  * detail:
>  *
>  * <existing comment>

Sounds good to me.

> > However, I do think you need to enumerate the different combinations
> > of wait and io_wref (as you've done) to make it clear what they are.
> >
> > I, for example, find it very confusing what wait == false and io_wref
> > not NULL would mean. If IO is in progress on the buffer and the
> > io_wref is not valid yet, the caller would get the expected
> > BUFFER_IO_IN_PROGRESS return value but io_wref won't be set. I could
> > see callers easily misinterpreting the API and passing this
> > combination when what they want is wait == true and io_wref not NULL
> > -- because they don't want to synchronously wait.
>
> Hm. I started out proposing that we should just add an assert documenting this
> is a nonsensical combination. But when writing the comment for that I realized
> that it theoretically could make sense to pass in wait == false and io_wref !=
> NULL, if you wanted to get a wait reference, but would not want to do a
> WaitIO() if there's no wait reference set.
>
> I don't think that's something we need right now, but ...

We should somehow express this in the comment.

- Melanie

Re: Don't synchronously wait for already-in-progress IO in read stream

Reply via email to