We see unknown chipset errors in 4.15 GA kernel in Bionic, but the
missing firmware issue is now fixed in bionic. We have support for this
Nvidia GPU in the bionic (4.18) HWE kernel and so I am marking this as
verification-done.

** Tags removed: verification-needed verification-needed-bionic
** Tags added: verification-done verification-done-bionic

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/1794055

Title:
  [Witherspoon-DD2.2][Ubu 18.10] [4.18.0-7-generic ] OS booting thrown
  with nouveau errors; OS booted successfully

Status in The Ubuntu-power-systems project:
  Fix Committed
Status in linux package in Ubuntu:
  Won't Fix
Status in linux-firmware package in Ubuntu:
  Fix Released
Status in linux source package in Bionic:
  Won't Fix
Status in linux-firmware source package in Bionic:
  Fix Committed
Status in linux source package in Cosmic:
  Won't Fix
Status in linux-firmware source package in Cosmic:
  Fix Released

Bug description:
  SRU Justification

  Impact: Missing firmware for nouveau is causing errors to appear in
  dmesg.

  Fix: Add missing firmware files from upstream linux-firmware.

  Test Case: Confirm that errors in dmesg are gone once new firmware
  files are present.

  Regression Potential: New and updated firmware always has potential to
  cause regressions, however this firmware has been in disco for several
  months with no reported issues.

  ---

  == Comment: #0 - Kalpana Shetty <kalsh...@in.ibm.com> - 2018-09-15 23:55:13 ==
  ---Problem Description---
  [Witherspoon-DD2.2][Ubu 18.10] [4.18.0-7-generic ] OS booting thrown with 
nouveau errors

  Contact Information = kalsh...@in.ibm.com, preeti.tha...@in.ibm.com

  ---uname output---
  root@ltc-wcwsp3:~# uname -a Linux ltc-wcwsp3 4.18.0-7-generic #8-Ubuntu SMP 
Tue Aug 28 18:20:56 UTC 2018 ppc64le ppc64le ppc64le GNU/Linux

  Machine Type = Witherspoon DD2.2 LC

  Steps:
  1. Netinstall Ubu 18.10 on Witherspoon-LC-DD2.2 6GPU system ------> PASS
  2. Boot the OS ---> PASS but error thrown on the console related open source 
NVIDIA driver.

    [Disk: sdb2 / c0302064-c5a3-49a7-8bd4-402283e6fcbe]
      Ubuntu, with Linux 4.18.0-7-generic (recovery mode)
      Ubuntu, with Linux 4.18.0-7-generic
      Ubuntu
    [Disk: nvme0n1p2 / c5d042f1-812e-49e0-94b2-ade477084061]
      Ubuntu, with Linux 4.18.0-7-generic (recovery mode)
   *  Ubuntu, with Linux 4.18.0-7-generic
      Ubuntu

    System information
    System configuration
    System status log
    Language
    Rescan devices
    Retrieve config from URL
    Plugins (0)
    Exit to shell
   
??????????????????????????????????????????????????????????????????????????????
   Enter=accept, e=edit, n=new, x=exit, l=language, g=log, h=help
  The system is going down NOW!
  Sent SIGTERM to all processes
  Sent SIGKILL to all processes
  [   57.513329] kexec_core: Starting new kernel
  [  149.358703978,5] OPAL: Switch to big-endian OS
  [  153.355498935,5] OPAL: Switch to little-endian OS
  [    2.943735] integrity: Unable to open file: /etc/keys/x509_ima.der (-2)
  [    2.943738] integrity: Unable to open file: /etc/keys/x509_evm.der (-2)
  [    3.132733] vio vio: uevent: failed to send synthetic uevent
  [    4.058698] nouveau 0004:04:00.0: gr: failed to load gr/sw_nonctx
  [    4.129215] nouveau 0004:04:00.0: DRM: failed to create kernel channel, -22
  [   19.126509] nouveau 0004:04:00.0: DRM: failed to idle channel 0 [DRM]
  [   19.281450] nouveau 0004:05:00.0: gr: failed to load gr/sw_nonctx
  [   19.351322] nouveau 0004:05:00.0: DRM: failed to create kernel channel, -22
  [   34.350509] nouveau 0004:05:00.0: DRM: failed to idle channel 0 [DRM]
  [   34.502063] nouveau 0004:06:00.0: gr: failed to load gr/sw_nonctx
  [   34.572144] nouveau 0004:06:00.0: DRM: failed to create kernel channel, -22
  [   49.570509] nouveau 0004:06:00.0: DRM: failed to idle channel 0 [DRM]
  [   49.734754] nouveau 0035:03:00.0: gr: failed to load gr/sw_nonctx
  [   49.805057] nouveau 0035:03:00.0: DRM: failed to create kernel channel, -22
  [   64.802510] nouveau 0035:03:00.0: DRM: failed to idle channel 0 [DRM]
  [   64.955442] nouveau 0035:04:00.0: gr: failed to load gr/sw_nonctx
  [   65.025537] nouveau 0035:04:00.0: DRM: failed to create kernel channel, -22

  [   80.022509] nouveau 0035:04:00.0: DRM: failed to idle channel 0 [DRM]
  [   80.181169] nouveau 0035:05:00.0: gr: failed to load gr/sw_nonctx
  [   80.251481] nouveau 0035:05:00.0: DRM: failed to create kernel channel, -22
  [   95.250509] nouveau 0035:05:00.0: DRM: failed to idle channel 0 [DRM]
  /dev/nvme0n1p2: recovering journal
  /dev/nvme0n1p2: clean, 72569/97681408 files, 7384418/390701312 blocks
  -.mount
  kmod-static-nodes.service
  dev-hugepages.mount
  dev-mqueue.mount
  sys-kernel-debug.mount
  ufw.service
  lvm2-lvmetad.service
  systemd-remount-fs.service
  systemd-random-seed.service
  systemd-sysusers.service
  keyboard-setup.service
  systemd-tmpfiles-setup-dev.service
  lvm2-monitor.service
  finalrd.service
  console-setup.service
  swapfile.swap
  ebtables.service
  systemd-udevd.service
  systemd-journald.service
  systemd-journal-flush.service
  systemd-tmpfiles-setup.service
  systemd-update-utmp.service
  [  100.997765] vio vio: uevent: failed to send synthetic uevent
  systemd-udev-trigger.service
  systemd-timesyncd.service
  apparmor.service
  lvm2-pvscan@8:3.service
  systemd-modules-load.service
  sys-kernel-config.mount
  sys-fs-fuse-connections.mount
  systemd-sysctl.service
  ondemand.service
  dbus.service
  irqbalance.service
  opal-prd.service
  lxcfs.service
  atd.service
  cron.service
  iprdump.service
  iprinit.service
  systemd-logind.service
  iprupdate.service
  systemd-networkd.service
  rsyslog.service
  polkit.service
  accounts-daemon.service
  lxd-containers.service
  networkd-dispatcher.service
  var-lib-lxcfs.mount
  tmp-selftest\x2dmountpoint\x2d039055037.mount
  snapd.service
  snapd.seeded.service
  systemd-resolved.service
  systemd-networkd-wait-online.service
  blk-availability.service
  systemd-user-sessions.service
  apport.service

  Ubuntu Cosmic Cuttlefish (development branch) ltc-wcwsp3 hvc0

  ltc-wcwsp3 login:

  == Comment: #2 - Kalpana Shetty <kalsh...@in.ibm.com> - 2018-09-16 00:07:26 ==
  sosreport -> 
http://9.114.13.132/repo/bugs/ubu/sosreport-BZ171506.171506-20180915235600.tar.xz

  == Comment: #3 - Kalpana Shetty <kalsh...@in.ibm.com> - 2018-09-16
  00:33:02 ==

  == Comment: #4 - Praveen K. Pandey <praveen.pan...@in.ibm.com> - 2018-09-19 
05:52:23 ==
  facing nouveau related error on power8 system as well

  [    4.764818] nouveau 0002:01:00.0: fifo: fault 00 [READ] at 
0000000000020000 engine 0c [HOST6] client 06 [GPC0/L1_2] reason 02 [PTE] on 
channel 0 [03ffb18000 DRM]
  [    4.942169] nouveau 000a:01:00.0: fifo: fault 00 [READ] at 
0000000000020000 engine 0c [HOST6] client 06 [GPC0/L1_2] reason 02 [PTE] on 
channel 0 [03ffb18000 DRM]
  /dev/sdb2: clean, 132397/61054976 files, 5995714/244188416 blocks
  [   11.206278] vio vio: uevent: failed to send synthetic uevent
  [  OK  ] Started Show Plymouth Boot Screen.
  [  OK  ] Reached target Local Encrypted Volumes.
  [  OK  ] Started Forward Password Requests to Plymouth Directory Watch.
  plymouth-start.service
  [  OK  ] Started ebtables ruleset management.

  == Comment: #5 - Chandni Verma <chand...@in.ibm.com> - 2018-09-20 16:41:49 ==
  --- screening ---

  From provided dmesg, I notice:

  1294 [   19.281478] nouveau 0004:05:00.0: bios: version 88.00.13.00.02
  1295 [   19.282753] nouveau 0004:05:00.0: Direct firmware load for 
nvidia/gv100/gr/sw_nonctx.bin failed with error -2
  1296 [   19.282755] nouveau 0004:05:00.0: gr: failed to load gr/sw_nonctx
  1297 [   19.282813] nouveau 0004:05:00.0: Using 32-bit DMA via iommu

  ..

  1322 [   34.367713] nouveau 0004:06:00.0: NVIDIA GV100 (140000a1)
  1323 [   34.497152] nouveau 0004:06:00.0: bios: version 88.00.13.00.02
  1324 [   34.502736] nouveau 0004:06:00.0: Direct firmware load for 
nvidia/gv100/gr/sw_nonctx.bin failed with error -2
  1325 [   34.502738] nouveau 0004:06:00.0: gr: failed to load gr/sw_nonctx
  1326 [   34.502797] nouveau 0004:06:00.0: Using 32-bit DMA via iommu

  ..

  upto 6 instances of the above...

  Looks like an NVIDIA firmware issue.

  == Comment: #6 - Luciano Chavez <cha...@us.ibm.com> - 2018-09-20 17:03:31 ==
  (In reply to comment #5)
  > --- screening ---
  >
  > From provided dmesg, I notice:
  >
  >
  > 1294 [   19.281478] nouveau 0004:05:00.0: bios: version 88.00.13.00.02
  > 1295 [   19.282753] nouveau 0004:05:00.0: Direct firmware load for
  > nvidia/gv100/gr/sw_nonctx.bin failed with error -2
  > 1296 [   19.282755] nouveau 0004:05:00.0: gr: failed to load gr/sw_nonctx
  > 1297 [   19.282813] nouveau 0004:05:00.0: Using 32-bit DMA via iommu
  >
  > ..
  >
  > 1322 [   34.367713] nouveau 0004:06:00.0: NVIDIA GV100 (140000a1)
  > 1323 [   34.497152] nouveau 0004:06:00.0: bios: version 88.00.13.00.02
  > 1324 [   34.502736] nouveau 0004:06:00.0: Direct firmware load for
  > nvidia/gv100/gr/sw_nonctx.bin failed with error -2
  > 1325 [   34.502738] nouveau 0004:06:00.0: gr: failed to load gr/sw_nonctx
  > 1326 [   34.502797] nouveau 0004:06:00.0: Using 32-bit DMA via iommu
  >
  > ..
  >
  > upto 6 instances of the above...
  >
  >
  > Looks like an NVIDIA firmware issue.

  Well, I think those message mean that the nouveau module can't find
  the firmware file as opposed to it being a FW issue. Might be a
  packaging issue if this is actually not causing any real issues.
  Probably best to mirror this to Canonical for their comment.

  == Comment: #10 - Chandni Verma <chand...@in.ibm.com> - 2018-09-24
  03:25:35 ==

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1794055/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to