NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

查看显卡驱动: nvidia-smi

报错: 

NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

 

查看cuda是否还在: nvcc -V

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2020 NVIDIA Corporation
Built on Thu_Jun_11_22:26:38_PDT_2020
Cuda compilation tools, release 11.0, V11.0.194
Build cuda_11.0_bu.TC445_37.28540450_0

 

查看nvidia版本:  ls /usr/src | grep nvidia

nvidia-450.80.02

 

执行命令

sudo apt install dkms
sudo dkms install -m nvidia -v 450.80.02

 

再执行: nvidia-smi

Mon Jan 25 18:28:31 2021       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.80.02    Driver Version: 450.80.02    CUDA Version: 11.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  GeForce RTX 206...  Off  | 00000000:07:00.0 Off |                  N/A |
| 46%   21C    P0    23W / 175W |      0MiB /  7974MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

 

参考链接 

 

如果在执行下面后出现系统内核错误,那就是系统内核升级了。由于驱动是在之前的内核编译的,内核升级后就无法识别使用。除非是重新安装驱动?

sudo apt install dkms
sudo dkms install -m nvidia -v 450.80.02

在终端执行,查看当前系统内核

uname -r

目前免重装驱动的方法就是在电脑启动引导系统选项时,选择ubuntu高级选项,选择与当前内核最接近的那个低版本内核。进入系统显卡驱动就回来了。

 

ubuntu-driver devices

 

 

posted @ 2021-01-25 18:30  小小灰迪  阅读(1902)  评论(0编辑  收藏  举报