NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make s 2026-5-10 09:46:48 Author: dyrnq.com(查看原文) 阅读量:4 收藏

NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

dmesg | grep -i nvidia
[    2.500890] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input5
[    2.500919] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input6
[    2.500946] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input7
[    2.500977] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input8
[ 4768.060904] eglinfo[6073]: segfault at 10 ip 00007f5cf064188c sp 00007ffefb476d40 error 4 in libnvidia-egl-gbm.so.1.1.3[188c,7f5cf0641000+3000] likely on CPU 4 (core 8, socket 0)

内核升级了,得重装!!!

CUR_REL=$(uname -r)
apt install linux-headers-$CUR_REL -y
echo "--- 正在锁定当前内核相关软件包,防止自动升级 ---"
PACKAGES=$(dpkg -l | grep "$CUR_REL" | awk '{print $2}')
if [ -z "$PACKAGES" ]; then
    echo "错误: 未能找到当前内核的软件包,请检查 dpkg 状态。"
    exit 1
fi
for PKG in $PACKAGES; do
    echo "锁定: $PKG"
    apt-mark hold "$PKG"
done
apt-mark showhold
## hold住内核,平时apt upgrade跳过升级

文章来源: https://dyrnq.com/nvidia-smi-has-failed-because-it-couldnt-communicate-with-the-nvidia-driver-make-sure-that-the-latest-nvidia-driver-is-installed-and-running/
如有侵权请联系:admin#unsafe.sh