On 2026-06-10 23:17, Iago Toral wrote:
> Hi Maíra,
> 
> I have a couple of thoughts here:
> 
> 1. The original code seems to very specifica about targeting this only
> for the last job, which makes it look like a very intentional decision,
> so I wonder if there is something we are missing here. I am adding Emma
> to the CC in case she has any thoughts about it.
> 
> 2. I am not aware of any issues despite not having implicit sync for
> all the jobs in the chanin. Like you mention, this is because user-
> space is already trying to handle job depedencies, so I wonder if we
> should instead work on the opposite direction and try to drop implicit
> sync in the kernel entirely. Would that make sense? What are other
> drivers doing in this regard?

Yeah, you should be working toward dropping implicit sync entirely. 
tu's VM_BIND is really nice and where you want to be eventually, but
even if not, then you at least want something like
MSM_SUBMIT_NO_IMPLICIT.

I probably just didn't think about this implicit sync case in the
initial implementation, because a bin job dependency on results from
another context is just not a thing that's going to happen in practice. 
The implicit sync was mostly about render targets / texturing to be
ordered correctly with X11, and about lifetime management of the BO
(which only needed the last use).

> As for the patch itself, I have a comment below:
> 
> El mié, 10-06-2026 a las 19:50 -0300, Maíra Canal escribió:
>> A submission can expand into a chain of jobs (e.g. bin + render +
>> cache
>> clean), but v3d_lookup_bos() only looked up the user's BO list onto
>> the
>> *last* job of the submission. Every earlier job was left with
>> bo_count == 0 and an empty bo[] array.
>> 
>> As a consequence, when implicit synchronization happens in
>> v3d_submit_lock_reservations(), earlier jobs get no implicit
>> dependencies at all, as the loop is gated on job->bo_count and
>> earlier
>> jobs don't have any BO attached to them. With that, the BIN job reads
>> the
>> same buffers as the RENDER job, yet nothing made it wait for a prior
>> writer to finish. The BIN job could therefore be dispatched to the
>> hardware and read a BO while another context was still writing it,
>> leading to data corruption that was only avoided as the userspace
>> adds
>> explicit syncobjs.
>> 
>> Fix this by calling v3d_lookup_bos() for each job that references the
>> submission's BOs, so every job carries its own bo[]/bo_count and
>> picks
>> up the correct implicit dependencies during reservation locking.
>> 
>> Fixes: dffa9b7a78c4 ("drm/v3d: Add missing implicit
>> synchronization.")
>> Signed-off-by: Maíra Canal <[email protected]>
>> ---
>>  drivers/gpu/drm/v3d/v3d_submit.c | 41 +++++++++++++++++++++---------
>> ----------
>>  1 file changed, 22 insertions(+), 19 deletions(-)
>> 
>> diff --git a/drivers/gpu/drm/v3d/v3d_submit.c
>> b/drivers/gpu/drm/v3d/v3d_submit.c
>> index ee2ac2540ed5..3d6582dfb1bf 100644
>> --- a/drivers/gpu/drm/v3d/v3d_submit.c
>> +++ b/drivers/gpu/drm/v3d/v3d_submit.c
>> @@ -68,7 +68,6 @@ v3d_submit_unlock_reservations(struct v3d_submit
>> *submit)
>>  /**
>>   * v3d_lookup_bos() - Sets up job->bo[] with the GEM objects
>>   * referenced by the job.
>> - * @dev: DRM device
>>   * @file_priv: DRM file for this fd
>>   * @job: V3D job being set up
>>   * @bo_handles: GEM handles
>> @@ -82,23 +81,19 @@ v3d_submit_unlock_reservations(struct v3d_submit
>> *submit)
>>   * failure, because that will happen at `v3d_job_free()`.
>>   */
>>  static int
>> -v3d_lookup_bos(struct v3d_submit *submit, u64 bo_handles, u32
>> bo_count)
>> +v3d_lookup_bos(struct drm_file *file_priv, struct v3d_job *job,
>> +           u64 bo_handles, u32 bo_count)
>>  {
> 
> Wouldn't it make more sense to have this function take the submit like
> it did originally and loop through all the jobs in a chain instead of
> calling this multiple times for each job in a chain?
> 
>> -    struct v3d_job *last_job = submit->jobs[submit->job_count -
>> 1];
>> -
>> -    last_job->bo_count = bo_count;
>> -
>> -    if (!last_job->bo_count) {
>> -            /* See comment on bo_index for why we have to check
>> -             * this.
>> -             */
>> -            drm_warn(&submit->v3d->drm, "Rendering requires
>> BOs\n");
>> +    if (!bo_count) {
>> +            drm_warn(&job->v3d->drm, "Rendering requires
>> BOs\n");
>>              return -EINVAL;
>>      }
>>  
>> -    return drm_gem_objects_lookup(submit->file_priv,
>> +    job->bo_count = bo_count;
>> +
>> +    return drm_gem_objects_lookup(file_priv,
>>                                    (void __user
>> *)(uintptr_t)bo_handles,
>> -                                  last_job->bo_count, &last_job-
>> >bo);
>> +                                  job->bo_count, &job->bo);
>>  }
>>  
>>  static void
>> @@ -446,7 +441,8 @@ v3d_setup_csd_jobs_and_bos(struct v3d_submit
>> *submit,
>>      if (IS_ERR(clean_job))
>>              return PTR_ERR(clean_job);
>>  
>> -    return v3d_lookup_bos(submit, args->bo_handles, args-
>> >bo_handle_count);
>> +    return v3d_lookup_bos(submit->file_priv, &job->base,
>> +                          args->bo_handles, args-
>> >bo_handle_count);
>>  }
>>  
>>  static void
>> @@ -1066,6 +1062,11 @@ v3d_submit_cl_ioctl(struct drm_device *dev,
>> void *data,
>>                                         &se);
>>              if (ret)
>>                      goto fail;
>> +
>> +            ret = v3d_lookup_bos(submit.file_priv, &bin->base,
>> +                                 args->bo_handles, args-
>> >bo_handle_count);
>> +            if (ret)
>> +                    goto fail;
>>      }
>>  
>>      render = (struct v3d_render_job
>> *)v3d_submit_add_job(&submit, V3D_RENDER);
>> @@ -1085,6 +1086,11 @@ v3d_submit_cl_ioctl(struct drm_device *dev,
>> void *data,
>>      if (ret)
>>              goto fail;
>>  
>> +    ret = v3d_lookup_bos(submit.file_priv, &render->base,
>> +                         args->bo_handles, args-
>> >bo_handle_count);
>> +    if (ret)
>> +            goto fail;
>> +
>>      if (args->flags & DRM_V3D_SUBMIT_CL_FLUSH_CACHE) {
>>              clean_job = v3d_submit_add_job(&submit,
>> V3D_CACHE_CLEAN);
>>              if (IS_ERR(clean_job)) {
>> @@ -1097,10 +1103,6 @@ v3d_submit_cl_ioctl(struct drm_device *dev,
>> void *data,
>>      if (ret)
>>              goto fail;
>>  
>> -    ret = v3d_lookup_bos(&submit, args->bo_handles, args-
>> >bo_handle_count);
>> -    if (ret)
>> -            goto fail;
>> -
>>      ret = v3d_submit_lock_reservations(&submit);
>>      if (ret)
>>              goto fail;
>> @@ -1359,7 +1361,8 @@ v3d_submit_cpu_ioctl(struct drm_device *dev,
>> void *data,
>>       * the CSD and clean jobs in the case of indirect CSD job.
>>       */
>>      if (args->bo_handle_count) {
>> -            ret = v3d_lookup_bos(&submit, args->bo_handles,
>> args->bo_handle_count);
>> +            ret = v3d_lookup_bos(submit.file_priv, &cpu_job-
>> >base,
>> +                                 args->bo_handles, args-
>> >bo_handle_count);
>>              if (ret)
>>                      goto fail;
>>      }
>>

Reply via email to