[Kernel-packages] [Bug 2072755] Re: i915: Fixup regressions introduced with enabling single CCS engine

Ubuntu Kernel Bot Fri, 25 Oct 2024 00:22:26 -0700

This bug is awaiting verification that the linux-nvidia-
tegra/6.8.0-1001.1 kernel in -proposed solves the problem. Please test
the kernel and update this bug with the results. If the problem is
solved, change the tag 'verification-needed-noble-linux-nvidia-tegra' to
'verification-done-noble-linux-nvidia-tegra'. If the problem still
exists, change the tag 'verification-needed-noble-linux-nvidia-tegra' to
'verification-failed-noble-linux-nvidia-tegra'.



If verification is not done by 5 working days from today, this fix will
be dropped from the source code, and this bug will be closed.


See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!


** Tags added: kernel-spammed-noble-linux-nvidia-tegra-v2 
verification-needed-noble-linux-nvidia-tegra

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/2072755

Title:
  i915: Fixup regressions introduced with enabling single CCS engine

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Noble:
  Fix Released

Bug description:
  BugLink: https://bugs.launchpad.net/bugs/2072755

  [Impact]

  Recently, the Intel i915 susbsystem underwent a change that limited
  the number of CCS engines that were initialised by default, and
  exposed to the user. Different chipsets have differing amounts of CCS
  engines, but most available in the market have 4 CCS engines. The new
  change just starts a single engine only, and allocates all CCS slices
  to this single engine. This single engine is then exposed to
  userspace. This effort is to workaround a hardware bug.

  This all happened in:

  commit 6db31251bb265813994bfb104eb4b4d0f44d64fb
  Author: Andi Shyti <andi.sh...@linux.intel.com>
  Date:   Thu Mar 28 08:34:05 2024 +0100
  Subject: drm/i915/gt: Enable only one CCS for compute workload
  Link: 
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=6db31251bb265813994bfb104eb4b4d0f44d64fb

  which landed in:

  $ git describe --contains 67f164e8510b16bda18642464863dba87a33d8cb
  Ubuntu-6.8.0-38.38~525

  There have been some side effects as a result of these changes,
  leading to failure of userspace applications, namely in video
  transcoding with ffmepg, resulting in fence expiration errors in dmesg
  like:

  [ 81.026591] Fence expiration time out
  i915-0000:01:00.0:ffmpeg[521]:2!

  There has also been a performance impact introduced by this change,
  which dropped performance of the GPU to 1/4 of what it was previously.
  This is likely due to most ARC GPUs usually having 4 CCS engines, and
  going down to 1 only without actually allocating the other three.

  There are no workarounds. Users are suggested to downgrade to
  6.8.0-36-generic while the fix is coming.

  [Fix]

  The regression was fixed by these two commits:

  commit aee54e282002a127612b71255bbe879ec0103afd
  Author: Andi Shyti <andi.sh...@linux.intel.com>
  Date: Fri Apr 26 02:07:23 2024 +0200
  Subject: drm/i915/gt: Automate CCS Mode setting during engine resets
  Link: 
https://git.launchpad.net/~ubuntu-kernel/ubuntu/+source/linux/+git/noble/commit/?id=aee54e282002a127612b71255bbe879ec0103afd

  commit ee01b6a386eaf9984b58a2476e8f531149679da9
  Author: Andi Shyti <andi.sh...@linux.intel.com>
  Date: Fri May 17 11:06:16 2024 +0200
  Subject: drm/i915/gt: Fix CCS id's calculation for CCS mode setting
  Link: 
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=ee01b6a386eaf9984b58a2476e8f531149679da9

  "drm/i915/gt: Automate CCS Mode setting during engine resets" is
  already applied to noble/master-next through upstream stable v6.8.10.

  We just need "drm/i915/gt: Fix CCS id's calculation for CCS mode
  setting". It is queued up for v6.9.4, but that could still be another
  SRU cycle or two away. So send it now.

  "drm/i915/gt: Fix CCS id's calculation for CCS mode setting" restores
  another 1/4 performance, but some performance issues still remain, and
  will hopefully be addressed in a future patch.

  [Testcase]

  This affects video transcoding with ffmpeg, on machines equipped with
  Intel ARC GPUs.

  An example ffmpeg command might be:

  /usr/lib/jellyfin-ffmpeg/ffmpeg -analyzeduration 200M -probesize 1G
  -ss 00:00:03.000 -noaccurate_seek -init_hw_device
  vaapi=va:,kernel_driver=i915,driver=iHD -init_hw_device qsv=qs@va
  -filter_hw_device qs -hwaccel vaapi -hwaccel_output_format vaapi
  -noautorotate -i file:"/path/to/1080_video.mkv" -noautoscale
  -map_metadata -1 -map_chapters -1 -threads 0 -map 0:0 -map 0:1 -map
  -0:s -codec:v:0 av1_qsv -preset veryfast -b:v 3616000 -maxrate 3616000
  -bufsize 7232000 -g:v:0 72 -keyint_min:v:0 72 -vf
  
"setparams=color_primaries=bt709:color_trc=bt709:colorspace=bt709,scale_vaapi=w=1280:h=720:format=nv12:extra_hw_frames=24,hwmap=derive_device=qsv,format=qsv"
  -codec:a:0 libfdk_aac -ac 2 -vbr:a 5 -copyts -avoid_negative_ts
  disabled -max_muxing_queue_size 2048 -f hls -max_delay 5000000
  -hls_time 3 -hls_segment_type fmp4 -hls_fmp4_init_filename
  "c30716eb121448346fcc00a2440071a3-1.mp4" -start_number 1
  -hls_segment_filename
  "/var/lib/jellyfin/transcodes/c30716eb121448346fcc00a2440071a3%d.mp4"
  -hls_playlist_type vod -hls_list_size 0 -y
  "/var/lib/jellyfin/transcodes/c30716eb121448346fcc00a2440071a3.m3u8

  Another user on bug 2072933 came up with this minimalist reproducer:

  #include <cstdio>
  #include <sycl/sycl.hpp>

  int main() {
    // auto selector = sycl::cpu_selector_v; // Works fine
    auto selector = sycl::gpu_selector_v;

    auto queue = sycl::queue(selector);

    printf("Hello\n");
    queue.submit([&](sycl::handler &cgh) {
      cgh.parallel_for(sycl::range(1), [=](sycl::item<1> item) {});
    });
    queue.wait();
    printf("Bye\n");

    return 0;
  }

  $ icpx -fsycl sycltest.cpp -o sycltest
  $ ./sycltest

  These commands should run successfully to completion. On failure, they will
  emit in dmesg:

  [ 81.026591] Fence expiration time out
  i915-0000:01:00.0:ffmpeg[521]:2!

  A test kernel is available in the following ppa:

  https://launchpad.net/~mruffell/+archive/ubuntu/lp2072755-test

  If you install the test kernel, things should work correctly.

  [Where problems could occur]

  This issue affects users of i915, which is a pretty universal
  integrated GPU present on Intel processors. While these patches are
  unlikely to cause outages that stop the primary display from
  functioning, any further regressions may add additional performance
  impact or prevent workloads from executing correctly.

  These patches are all accepted into upstream -stable, and we would
  consume them in due course anyway.

  If a regression were to occur, there are no workarounds, and users
  would need to select an older kernel until a fix is available.

  [Other info]

  Upstream Bug:
  https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/10895

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2072755/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

[Kernel-packages] [Bug 2072755] Re: i915: Fixup regressions introduced with enabling single CCS engine

Reply via email to