带有 GPU 的 Docker 映像给出错误 535.129.3 与 DSO 版本 545.23.6 不匹配

问题描述 投票:0回答:1

我认为带有 GPU 支持的 docker 镜像可以解决我安装 cuda 的问题。 我已经跑了:

 docker pull tensorflow/tensorflow:latest-gpu-jupyter
 docker run --gpus all -it --rm -p 8889:8888 tensorflow/tensorflow:latest-gpu-jupyter

但是在检查 jupyter 服务器是否有 GPU 支持时,它给了我错误:

I tensorflow/core/util/port.cc:113] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.
E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 AVX_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
E external/local_xla/xla/stream_executor/cuda/cuda_driver.cc:274] failed call to cuInit: CUDA_ERROR_COMPAT_NOT_SUPPORTED_ON_DEVICE: forward compatibility was attempted on non supported HW
I external/local_xla/xla/stream_executor/cuda/cuda_diagnostics.cc:129] retrieving CUDA diagnostic information for host: 0e940b862ceb
I external/local_xla/xla/stream_executor/cuda/cuda_diagnostics.cc:136] hostname: 0e940b862ceb
I external/local_xla/xla/stream_executor/cuda/cuda_diagnostics.cc:159] libcuda reported version is: 545.23.6
I external/local_xla/xla/stream_executor/cuda/cuda_diagnostics.cc:163] kernel reported version is: 535.129.3
E external/local_xla/xla/stream_executor/cuda/cuda_diagnostics.cc:244] kernel version 535.129.3 does not match DSO version 545.23.6 -- cannot find working devices in this configuration
docker tensorflow cuda
1个回答
0
投票

我有类似的输出/问题。 您是否尝试在 rtx 40xx 卡上运行 docker? (我正在尝试在 rtx 4060 ti 16Gb 上运行)。

过了一段时间,我尝试运行,而不是上一个版本,而是之前的版本: docker run --gpus all -it tensorflow/tensorflow:2.14.0-gpu python。

它在加载时给我一些错误,但它似乎可以很好地使用 GPU 训练模型(现在快速测试,我现在将进行长时间测试)

© www.soinside.com 2019 - 2024. All rights reserved.