nvidia-smi
显示“未找到正在运行的进程”。lsof /dev/nvidia*
为空。但是,当我尝试执行nvidia-smi -r
时,它显示:
# nvidia-smi -r -i 0
GPU 00000000:xxxxxx is currently in use by another process.
1 device is currently being used by one or more other processes (e.g., Fabric Manager, CUDA application, graphics application such as an X server, or a monitoring application such as another instance of nvidia-smi). Please first kill all processes using this device and all compute applications running in the system.
我怎样才能找到哪个进程实际上正在使用它?
2条答案
按热度按时间brccelvz1#
使用以下命令获取使用GPU的进程列表:
cyej8jka2#
原来这个消息有点误导。只要执行
modprobe -r nvidia_drm
,那么重置就应该按预期工作。