NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver

Aidan Lew 20 Reputation points
2023-07-12T08:58:34.96+00:00

Hi,I meet some problems installing NVIDIA driver.

My server is Azure with Standard_NC48ads_A100_v4, with no NVIDIA driver.

So I followed the steps in https://learn.microsoft.com/zh-cn/azure/virtual-machines/linux/n-series-driver-setup.

But after download and install,I entered "nvidia-smi" and received

"NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver.Make sure that the latest NVIDIA driver is installed and running"

Can you help us with the problem?Or can you help us with the NVIDIA driver install?

Some info of my server are as follows:

lspci | grep -i NVIDIA

0001:00:00.0 3D controller: NVIDIA Corporation Device 20b5 (rev al)

0002:00:00.0 3D controller: NVIDIA Corporation Device 20b5 (rev al)

Lsmod | grep

(empty)

dkms status

nvidia,535.54.03,5.15.0-1040-azure, x86 64: installed

nvidia,535.54.03,5.15.0-1041-azure, x86_64: installed

nokutil --sb-state

ScureBoot enabled

nvcc --version

nvcc: NVIDIA (R)Cuda compiler driver

Copyright (c)2005-2019 NVIDIA Corporation

Built on Sun Jul_28_19:07:16_PDT_2019

Cuda compilation tools,release 10.1,V10.1.243

uname -a

Linux ai 5.15.0-1041-azure #48~20.04.1-Ubuntu SMP Wed Jun 21 15:03:04 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux

Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,251 questions
0 comments No comments
{count} votes

3 answers

Sort by: Most helpful
  1. CodeKidz 35 Reputation points
    2024-03-19T02:25:05.5433333+00:00

    Not work for me, I've almost got madness of this

    1 person found this answer helpful.
    0 comments No comments

  2. Karthikeyan Kannan 5 Reputation points
    2024-05-10T09:37:56.92+00:00

    Hi,

    I'm also facing the similar issue. I have deployed Standard_NC48ads_A100_v4 vm. Have installed the drivers and been using in development for a month.

    All of a sudden got stuck into this issue. Please anyone could help me with this. How did you guys resolve it?

    1 person found this answer helpful.

  3. deherman-MSFT 34,021 Reputation points Microsoft Employee
    2023-07-12T22:01:03.2066667+00:00

    @Aidan Lew

    Update:
    Sorry, I misread before and thought you were using 22.04. Since you are using 20.04 I highly recommend using the NVIDIA GPU Driver Extension for Linux. I launched an Ubuntu 20.04 and was able to install the driver and run the nvidia-smi command without issue.

    Please try this and let me know if it doesn't work for you.


    If you still have questions, please let us know in the "comments" and we would be happy to help you. Comment is the fastest way of notifying the experts.

    If the answer has been helpful, we appreciate hearing from you and would love to help others who may have the same question. Accepting answers helps increase visibility of this question for other members of the Microsoft Q&A community.

    Thank you for helping to improve Microsoft Q&A! User's image