| 
 | 
 
 
 楼主 |
发表于 2024-2-27 22:45:25
|
显示全部楼层
 
 
 
 
请问有方法验证驱动的问题吗?因为用tensorflow跑一个简单的算法的话,输出的结果貌似显示cuda和cudnn都是通的。比如随便跑个简单的卷积模型 
import tensorflow as tf 
 
# 构建一个简单的卷积神经网络 
model = tf.keras.models.Sequential([ 
    tf.keras.layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1)), 
    tf.keras.layers.MaxPooling2D((2, 2)), 
    tf.keras.layers.Flatten(), 
    tf.keras.layers.Dense(64, activation='relu'), 
    tf.keras.layers.Dense(10, activation='softmax') 
]) 
 
# 编译模型 
model.compile(optimizer='adam', 
              loss='sparse_categorical_crossentropy', 
              metrics=['accuracy']) 
 
# 打印模型摘要 
model.summary() 
 
# 检查是否使用了 GPU 
if tf.test.is_gpu_available(cuda_only=True): 
    print("CUDA is available!") 
else: 
    print("CUDA is not available.") 
 
我的会输出 
$python test_cudnn.py  
2024-02-27 15:57:41.403509: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.10.1 
2024-02-27 15:57:42.018270: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set 
2024-02-27 15:57:42.018812: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcuda.so.1 
2024-02-27 15:57:42.027762: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:941] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero 
2024-02-27 15:57:42.027864: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties:  
pciBusID: 0000:01:00.0 name: NVIDIA GeForce RTX 3090 computeCapability: 8.6 
coreClock: 1.695GHz coreCount: 82 deviceMemorySize: 23.68GiB deviceMemoryBandwidth: 871.81GiB/s 
2024-02-27 15:57:42.027875: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.10.1 
2024-02-27 15:57:42.029176: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.10 
2024-02-27 15:57:42.029199: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.10 
2024-02-27 15:57:42.030278: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10 
2024-02-27 15:57:42.030463: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10 
2024-02-27 15:57:42.031427: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10 
2024-02-27 15:57:42.031931: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.10 
2024-02-27 15:57:42.033867: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.7 
2024-02-27 15:57:42.033946: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:941] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero 
2024-02-27 15:57:42.034056: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:941] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero 
2024-02-27 15:57:42.034108: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0 
2024-02-27 15:57:42.034290: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  SSE4.1 SSE4.2 AVX AVX2 AVX512F FMA 
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. 
2024-02-27 15:57:42.034751: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set 
2024-02-27 15:57:42.034802: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:941] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero 
2024-02-27 15:57:42.034867: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1720] Found device 0 with properties:  
pciBusID: 0000:01:00.0 name: NVIDIA GeForce RTX 3090 computeCapability: 8.6 
coreClock: 1.695GHz coreCount: 82 deviceMemorySize: 23.68GiB deviceMemoryBandwidth: 871.81GiB/s 
2024-02-27 15:57:42.034878: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.10.1 
2024-02-27 15:57:42.034887: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.10 
2024-02-27 15:57:42.034893: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.10 
2024-02-27 15:57:42.034898: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcufft.so.10 
2024-02-27 15:57:42.034903: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcurand.so.10 
2024-02-27 15:57:42.034909: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusolver.so.10 
2024-02-27 15:57:42.034914: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcusparse.so.10 
2024-02-27 15:57:42.034919: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudnn.so.7 
2024-02-27 15:57:42.034942: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:941] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero 
2024-02-27 15:57:42.035029: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:941] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero 
2024-02-27 15:57:42.035078: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0 
2024-02-27 15:57:42.035093: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcudart.so.10.1 
 
看起来cuda和cudnn都很正常。。。。显卡用的是3090,驱动版本是545.29.06 cuda是11.8 cudnn是8.9.7。好像nvidia不提供查询是否用F16半精度的方法?是只能通过加一个conda环境激活脚本来强制float32吗? |   
 
 
 
 |