Skip to content

Unable to use nvidia container on RTX5090 in ubuntu 22.04 #1121

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
RENyunfan opened this issue May 30, 2025 · 1 comment
Open

Unable to use nvidia container on RTX5090 in ubuntu 22.04 #1121

RENyunfan opened this issue May 30, 2025 · 1 comment

Comments

@RENyunfan
Copy link

Issue with GPU Access in Docker

After successfully installing Docker and the NVIDIA driver, I ran the following commands to check the installation:

NVIDIA Driver Status

➜  ~ nvidia-smi
Sat May 31 00:29:13 2025       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 570.133.07             Driver Version: 570.133.07     CUDA Version: 12.8     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 5090 ...    Off |   00000000:C4:00.0  On |                  N/A |
| N/A   43C    P8             21W /   95W |     416MiB /  24463MiB |      7%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+
                                                                                         
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI              PID   Type   Process name                        GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|    0   N/A  N/A            2133      G   /usr/lib/xorg/Xorg                      128MiB |
|    0   N/A  N/A            2306      G   /usr/bin/gnome-shell                     50MiB |
|    0   N/A  N/A            4360      G   ...ess --variations-seed-version         17MiB |
|    0   N/A  N/A            5089      G   /opt/microsoft/msedge/msedge              4MiB |
|    0   N/A  N/A            5142      G   ...144 --variations-seed-version        114MiB |
+-----------------------------------------------------------------------------------------+

NVIDIA Container Information

➜  ~ nvidia-container-cli --version
cli-version: 1.17.8
lib-version: 1.17.8
build date: 2025-05-30T13:47+00:00
build revision: 6eda4d76c8c5f8fc174e4abca83e513fb4dd63b0
build compiler: x86_64-linux-gnu-gcc-7 7.5.0
build platform: x86_64
build flags: -D_GNU_SOURCE -D_FORTIFY_SOURCE=2 -DNDEBUG -std=gnu11 -O2 -g -fdata-sections -ffunction-sections -fplan9-extensions -fstack-protector -fno-strict-aliasing -fvisibility=hidden -Wall -Wextra -Wcast-align -Wpointer-arith -Wmissing-prototypes -Wnonnull -Wwrite-strings -Wlogical-op -Wformat=2 -Wmissing-format-attribute -Winit-self -Wshadow -Wstrict-prototypes -Wunreachable-code -Wconversion -Wsign-conversion -Wno-unknown-warning-option -Wno-format-extra-args -Wno-gnu-alignof-expression -Wl,-zrelro -Wl,-znow -Wl,-zdefs -Wl,--gc-sections

  ~ nvidia-container-runtime --version
NVIDIA Container Runtime version 1.17.8
commit: f202b80a9b9d0db00d9b1d73c0128c8962c55f4d
spec: 1.2.1

runc version 1.2.5
commit: v1.2.5-0-g59923ef
spec: 1.2.0
go: go1.23.7
libseccomp: 2.5.3
➜  ~ 

I attempted to run a Docker container with GPU support:

➜  ~ docker run --rm --gpus all nvidia/cuda:12.8.0-base-ubuntu22.04 nvidia-smi
docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]]

Run 'docker run --help' for more information
➜  ~ 

I still cannot utilize the GPU in Docker. Does anyone have any ideas or suggestions to resolve this issue?

@jianli-NV
Copy link

@RENyunfan It seems the GPU0 is being used by some other applications, did you have a retry after terminating the following processes:

Image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants