On Sun, Sep 19, 2021 at 01:22:30PM +0200, Roger Price wrote: > My Nvidia NVS 310 card with the nvidia 390.144 driver starts off perfectly, > but after two days freezes: no reaction to keyboard or mouse action. > > After reboot, command journalctl -b -1 --no-pager | grep -i -E > "(nvidia|NVRM)" reports > > Sep 17 10:43:25 titan kernel: nvidia: loading out-of-tree module taints > kernel. > Sep 17 10:43:25 titan kernel: nvidia: module license 'NVIDIA' taints kernel. > Sep 17 10:43:25 titan kernel: nvidia: module verification failed: signature > and/or required key missing - tainting kernel > Sep 17 10:43:25 titan kernel: nvidia-nvlink: Nvlink Core is being > initialized, major device number 246 > Sep 17 10:43:25 titan kernel: nvidia 0000:03:00.0: vgaarb: changed VGA > decodes: olddecodes=io+mem,decodes=none:owns=io+mem > Sep 17 10:43:25 titan kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module > 390.144 Wed Jun 2 23:02:41 UTC 2021 (using threaded interrupts) > Sep 17 10:43:25 titan kernel: nvidia-modeset: Loading NVIDIA Kernel Mode > Setting Driver for UNIX platforms 390.144 Wed Jun 2 22:58:46 UTC 2021 > Sep 17 10:43:25 titan kernel: [drm] [nvidia-drm] [GPU ID 0x00000300] Loading > driver > Sep 17 10:43:25 titan kernel: [drm] Initialized nvidia-drm 0.0.0 20160202 > for 0000:03:00.0 on minor 0 > Sep 17 10:43:25 titan systemd-modules-load[308]: Inserted module 'nvidia_drm' > Sep 17 10:43:25 titan kernel: input: HDA NVidia HDMI/DP,pcm=3 as > /devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card1/input4 > Sep 17 10:43:25 titan kernel: input: HDA NVidia HDMI/DP,pcm=7 as > /devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card1/input5 > Sep 17 10:44:13 titan kernel: caller _nv027867rm+0x58/0x90 [nvidia] mapping > multiple BARs > Sep 17 10:44:13 titan kernel: caller _nv001015rm+0x1bf/0x1f0 [nvidia] > mapping multiple BARs > Sep 17 10:44:13 titan kernel: caller _nv001015rm+0x1bf/0x1f0 [nvidia] > mapping multiple BARs > Sep 19 12:04:04 titan kernel: NVRM: GPU at PCI:0000:03:00: > GPU-b2e8a5f5-e5f3-195f-0aa3-869e3fc027ac > Sep 19 12:04:04 titan kernel: NVRM: Xid (PCI:0000:03:00): 8, Channel 00000006 > Sep 19 12:14:19 titan kernel: nvidia-modeset: ERROR: GPU:0: Idling display > engine timed out: 0x0000907c:1:0 > Sep 19 12:14:21 titan kernel: nvidia-modeset: ERROR: GPU:0: Idling display > engine timed out: 0x0000907c:1:0 > Sep 19 12:14:23 titan kernel: nvidia-modeset: ERROR: GPU:0: Idling display > engine timed out: 0x0000907c:1:0 > Sep 19 12:14:25 titan kernel: nvidia-modeset: ERROR: GPU:0: Idling display > engine timed out: 0x0000907c:1:0 > Sep 19 12:14:27 titan kernel: nvidia-modeset: ERROR: GPU:0: Idling display > engine timed out: 0x0000907d:0:0 > Sep 19 12:14:29 titan kernel: nvidia-modeset: ERROR: GPU:0: Idling display > engine timed out: 0x0000907c:0:0 > Sep 19 12:14:31 titan kernel: nvidia-modeset: ERROR: GPU:0: Idling display > engine timed out: 0x0000907c:1:0 > Sep 19 12:14:39 titan kernel: nvidia-modeset: ERROR: GPU:0: Idling display > engine timed out: 0x0000907d:0:0 > > inxi -SxGay reports: > > System: > Host: titan Kernel: 5.10.0-8-amd64 x86_64 bits: 64 compiler: gcc v: 10.2.1 > parameters: BOOT_IMAGE=/boot/vmlinuz-5.10.0-8-amd64 > root=UUID=faab0405-0274-4392-bca4-0f6fb6635a87 ro log_buf_len=1M > ipv6.disable=1 net.ifnames=0 3 quiet > Desktop: Xfce 4.16.0 tk: Gtk 3.24.24 info: xfce4-panel wm: xfwm4 > dm: LightDM 1.26.0 Distro: Debian GNU/Linux 11 (bullseye) Graphics: > Device-1: NVIDIA GF119 [NVS 310] vendor: Hewlett-Packard driver: nvidia > v: 390.144 bus ID: 03:00.0 chip ID: 10de:107d class ID: 0300 > Display: server: X.Org 1.20.11 driver: loaded: nvidia > unloaded: fbdev,modesetting,nouveau,vesa alternate: nv > display ID: localhost:10.0 screens: 1 > Screen-1: 0 s-res: 2880x2560 s-dpi: 96 s-size: 762x677mm (30.0x26.7") > s-diag: 1019mm (40.1") > Monitor-1: DP-2 res: 1440x2560 hz: 60 > Monitor-2: DP-3 res: 1440x2560 hz: 60 > OpenGL: renderer: NVS 310/PCIe/SSE2 v: 4.6.0 NVIDIA 390.144 > direct render: Yes > > I still have nouveau present. dpkg-query -l | grep nouveau reports: > ii libdrm-nouveau2:amd64 2.4.104-1 amd64 Userspace interface to > nouveau-specific kernel DRM services -- runtime > ii xserver-xorg-video-nouveau 1:1.0.17-1 amd64 X.Org X server -- Nouveau > display driver > > Any suggestion or hint for avoiding this freeze would be much appreciated. > > Roger >
This comes down, perhaps, to having both nouveau and nididia drivers on the same system. I've previously had success only by installing a minimal (command line) Debian, installing the prerequisites needed to build the proprietary Nvidia driver, building it and only then installing X Windows / Wayland and a desktop environment via tasksel. That more or less ensures that you never load nouveau at all - blacklisting it / unloading it once installed is tedious. The merits of the proprietary driver are that it mostly works: the downside is that you can't fix it when something is wrong. Cue hundreds of hours of flamewars as to why you should / shouldn't buy/use Nvidia, which drivers you should install ... All the very best, as ever, Andy Cater