Ubuntu Nvidia driver驱动安装(新)

前言

英伟达更新了安装驱动的方式,更新一下文档

旧文:Ubuntu Nvidia driver驱动安装及卸载

下载官方驱动安装

1.安装驱动前一定要更新软件列表和安装必要软件、依赖(必须)

sudo apt-get update   #更新软件列表
 
sudo apt-get install -y g++ gcc make

2.查看GPU型号(你自己知道显卡型号就不用查了,直接下一步;通用查看显卡型号命令:lspci | grep -i vga)

lspci | grep -i nvidia

得到如下输出,其中 Tesla T4 就是型号

00:06.0 3D controller: NVIDIA Corporation TU104GL [Tesla T4] (rev a1)
00:07.0 3D controller: NVIDIA Corporation TU104GL [Tesla T4] (rev a1)

3.官网下载对应驱动,注意要把 nvidia 驱动放在英文名文件夹下

https://www.nvidia.cn/drivers/lookup/

4.卸载原有驱动,请看下方「正确卸载驱动以及查看是否卸载干净命令」

5.禁用nouveau(nouveau是通用的驱动程序)(必须)

Ubuntu 系统集成的显卡驱动程序是 nouveau ,它是第三方为 NVIDIA 开发的开源驱动,我们需要先将其屏蔽再安装 NVIDIA 官方驱动,否则直接安全驱动的话往往会报以下错误:ERROR: The Nouveau kernel driver is currently in use by your system. This driver is incompatible with the NVIDIA driver

nouveau 驱动加入黑名单,将如下内容加入 /etc/modprobe.d/blacklist.conf 文件中:

sudo vim /etc/modprobe.d/blacklist.conf

# 添加如下内容
blacklist vga16fb
blacklist nouveau
blacklist rivafb
blacklist rivatv
blacklist nvidiafb

然后执行命令

sudo update-initramfs -u

重启机器后(必须),执行如下命令,如果没有输出则证明禁用成功

sudo lsmod | grep nouveau

6.开始安装 NVIDIA 官方驱动

执行 dpkg -i 将本地仓库包安装到系统中

sudo dpkg -i nvidia-driver-local-repo-ubuntu2004-560.35.03_1.0-1_amd64.deb

输出

sudo dpkg -i nvidia-driver-local-repo-ubuntu2004-560.35.03_1.0-1_amd64.deb
Selecting previously unselected package nvidia-driver-local-repo-ubuntu2004-560.35.03.
(Reading database ... 114186 files and directories currently installed.)
Preparing to unpack nvidia-driver-local-repo-ubuntu2004-560.35.03_1.0-1_amd64.deb ...
Unpacking nvidia-driver-local-repo-ubuntu2004-560.35.03 (1.0-1) ...
Setting up nvidia-driver-local-repo-ubuntu2004-560.35.03 (1.0-1) ...

The public nvidia-driver-local-repo-ubuntu2004-560.35.03 GPG key does not appear to be installed.
To install the key, run this command:
sudo cp /var/nvidia-driver-local-repo-ubuntu2004-560.35.03/nvidia-driver-local-63C60622-keyring.gpg /usr/share/keyrings/

此时复制授权密钥

sudo cp /var/nvidia-driver-local-repo-ubuntu2004-560.35.03/nvidia-driver-local-63C60622-keyring.gpg /usr/share/keyrings/

再次执行

sudo dpkg -i nvidia-driver-local-repo-ubuntu2004-560.35.03_1.0-1_amd64.deb

修复依赖关系(如果有问题):

sudo apt-get install -f

更新包列表

sudo apt-get update

搜索驱动 nvidia-driver-560,是否存在

sudo apt-cache search nvidia-driver-560

输出

nvidia-driver-530-open - Transitional package for nvidia-driver-560-open
nvidia-driver-530 - Transitional package for nvidia-driver-560
nvidia-driver-560-open - NVIDIA driver (open kernel) metapackage
nvidia-driver-560 - NVIDIA driver metapackage
nvidia-headless-560-open - NVIDIA headless metapackage (open kernel module)
nvidia-headless-560 - NVIDIA headless metapackage
nvidia-headless-no-dkms-560-open - NVIDIA headless metapackage - no DKMS (open kernel module)
nvidia-headless-no-dkms-560 - NVIDIA headless metapackage - no DKMS
xserver-xorg-video-nvidia-560 - NVIDIA binary Xorg driver

开始安装驱动

sudo apt-get install nvidia-driver-560

安装完成,最后执行 nvidia-smi,有输出,表示安装成功!!

nvidia-smi
Wed Sep  4 10:06:18 2024
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 560.35.03              Driver Version: 560.35.03      CUDA Version: 12.6     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  Tesla T4                       Off |   00000000:00:06.0 Off |                    0 |
| N/A   62C    P0             29W /   70W |       1MiB /  15360MiB |      5%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+
|   1  Tesla T4                       Off |   00000000:00:07.0 Off |                    0 |
| N/A   67C    P0             31W /   70W |       1MiB /  15360MiB |      5%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|  No running processes found                                                             |
+-----------------------------------------------------------------------------------------+

正确卸载驱动以及查看是否卸载干净命令

如果安装完显卡驱动之后,发现有问题,需要卸载,用下面方法,一步步来。
方法一:
 
#先查看驱动以及版本安装情况,命令如下:
ls /usr/src | grep nvidia
 
#进入安装目录,用驱动自带卸载命令卸载,一般情况下能卸载干净
cd /usr/bin
ls nvidia-*
sudo nvidia-uninstall
 
#然后用上面的查看命令查看是否卸载干净,完成卸载
ls /usr/src | grep nvidia
 
#如果需要卸载干净所有英伟达驱动命令,如下
sudo apt-get remove --purge nvidia-*(/nvidia*)
sudo apt autoremove   # 此命令在万般无奈之下在使用,可能会卸载无辜软件,影响稳定性。。。
 
 
方法二:(推荐)
 
#直接卸载驱动
sudo ./显卡驱动包名称 --uninstall
 
#卸载两件套
sudo apt-get purge nvidia*
sudo apt-get autoremove
sudo reboot
posted @ 2024-09-04 18:21  牛奔  阅读(3015)  评论(0编辑  收藏  举报