CUDA_Toolkit_Release_Notes
CUDA_Toolkit_Release_Notes
---------------------------------
The release notes for the NVIDIA® CUDA® Toolkit can be found
online at
https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html.
Note: The release notes have been reorganized into two major
sections: the general CUDA release notes, and the CUDA
libraries release notes including historical information for
11.x releases.
CUDA Components
Component Name
Version Information
Supported Architectures
11.8.68
11.8.68
cuobjdump
11.8.68
CUPTI
11.8.68
11.8.68
11.8.68
x86_64
CUDA GDB
11.8.68
CUDA Memcheck
11.8.68
x86_64, POWER
CUDA Nsight
11.8.68
x86_64, POWER
CUDA NVCC
11.8.68
CUDA nvdisasm
11.8.68
11.8.68
CUDA nvprof
11.8.68
x86_64, POWER
CUDA nvprune
11.8.68
11.8.68
CUDA NVTX
11.8.68
CUDA NVVP
11.8.68
x86_64, POWER
11.8.68
CUDA cuBLAS
11.11.0.32
CUDA cuDLA
11.8.68
AArch64
CUDA cuFFT
10.9.0.40
CUDA cuFile
1.4.0.31
x86_64
CUDA cuRAND
10.3.0.68
CUDA cuSOLVER
11.4.1.30
x86_64, POWER, AArch64
CUDA cuSPARSE
11.7.5.68
CUDA NPP
11.8.0.68
CUDA nvJPEG
11.9.0.68
Nsight Compute
2022.3.0.14
Nsight NVTX
1.21018621
x86_64 (Windows)
Nsight Systems
2022.3.1.32
2022.3.0.22185
x86_64 (Windows)
nvidia_fs1
2.13.5
x86_64, AArch64
11.8.68
x86_64 (Windows)
520.43
x86_64, POWER, AArch64
521.14
x86_64 (Windows)
CUDA Driver
CUDA Toolkit
CUDA 11.8.x
>=450.80.02
>=452.39
CUDA 11.7.x
CUDA 11.6.x
CUDA 11.5.x
CUDA 11.4.x
CUDA 11.3.x
CUDA 11.2.x
>=450.36.06**
>=450.28.01**
>=451.22**
CUDA Toolkit
CUDA 11.8
>=520.43
>=521.14
>=515.48.07
>=516.31
CUDA 11.7 GA
>=515.43.04
>=516.01
>=510.47.03
>=511.65
>=510.47.03
>=511.65
CUDA 11.6 GA
>=510.39.01
>=511.23
>=495.29.05
>=496.13
>=495.29.05
>=496.13
CUDA 11.5 GA
>=495.29.05
>=496.04
>=470.82.01
>=472.50
>=470.82.01
>=472.50
>=470.57.02
>=471.41
>=470.57.02
>=471.41
CUDA 11.4.0 GA
>=470.42.01
>=471.11
>=465.19.01
>=465.89
CUDA 11.3.0 GA
>=465.19.01
>=465.89
>=460.32.03
>=461.33
>=460.32.03
>=461.09
CUDA 11.2.0 GA
>=460.27.03
>=460.82
>=455.32
>=456.81
CUDA 11.1 GA
>=455.23
>=456.38
>= 450.51.06
>= 451.82
CUDA 11.0.2 GA
>= 450.51.05
>= 451.48
CUDA 11.0.1 RC
>= 450.36.06
>= 451.22
CUDA 10.2.89
>= 440.33
>= 441.22
>= 418.39
>= 418.96
CUDA 10.0.130
>= 410.48
>= 411.31
>= 396.37
>= 398.26
>= 396.26
>= 397.44
>= 390.46
>= 391.29
>= 384.81
>= 385.54
>= 375.26
>= 376.51
>= 369.30
>= 352.31
>= 353.66
>= 346.46
>= 347.62
https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#package-
manager-metas
11.7. Update 1
11.7
NVIDIA Open GPU Kernel Modules: With CUDA 11.7 and R515
driver, NVIDIA is open sourcing the GPU kernel mode driver
under dual GPL/MIT license. Refer to
https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#open-gpu-
kernel-modules
for more information.
11.7
11.7
General CUDA
CUDA Compiler
2. CUDA Libraries
-----------------
* sm_35 (Kepler)
* sm_37 (Kepler)
* New Features
* Performance improvements for batched GEMV.
* Known Issues
* Resolved Issues
* New Features
* Resolved Issues
* Resolved Issues
* New Features
* Known Issues
* Resolved Issues
* Resolved Issues
* New Features
* Resolved Issues
* Known Issues
* Deprecated Features
* Known Issues
* Known Issues
* New Features
* Resolved Issues
* Resolved Issues
* New Features
* "0" - Off
* "1" - Error
* "2" - Trace
* "4" - Hints
* "8" - Heuristics
* typedef void(*cublasLtLoggerCallback_t)(int
logLevel, const char* functionName, const
char* message) -- A type of callback function
pointer.
* cublasStatus_t
cublasLtLoggerSetCallback(cublasLtLoggerCallback_t
callback) -- Allows to set a call back
functions that will be called for every
message that is logged by the library.
* cublasStatus_t cublasLtLoggerSetFile(FILE*
file) -- Allows to set the output file for the
logger. The file must be open and have write
permissions.
* cublasStatus_t cublasLtLoggerOpenFile(const
char* logFile) -- Allows to give a path in
which the logger should create the log file.
* cublasStatus_t cublasLtLoggerSetLevel(int
level) -- Allows to set the log level to one
of the above mentioned levels.
* cublasStatus_t cublasLtLoggerForceDisable() --
Allows to disable to logger for the entire
session. Once this API is being called, the
logger cannot be reactivated in the current
session.
* New Features
* cublasGemmEx, cublasGemmBatchedEx,
cublasGemmStridedBatchedEx and cublasLtMatmul
added new data type support for __nv_bfloat16
(CUDA_R_16BF).
* Deprecated Features
* Resolved Issues
* Known Issues
* Known Issues
* Known Issues
* Resolved Issues
* Known Issues
* Resolved Issues
* Known Issues
* New Features
* Performance improvements.
* Known Issues
* Deprecated Features
* New Features
* cuFFT shared libraries are now linked statically
against libstdc++ on Linux platforms.
* Known Issues
* Known Issues
* Resolved Issues
* Known Issues
* New Features
* Resolved Issues
* Known Issues
* New Features
* Resolved Issues
* Known Issues
* New Features
* Resolved Issues
* Known Issues
* New Features
* Supported PRNGs:
* CURAND_RNG_PSEUDO_XORWOW
* CURAND_RNG_PSEUDO_MRG32K3A
* CURAND_RNG_PSEUDO_MTGP32
* CURAND_RNG_PSEUDO_PHILOX4_32_10
* Resolved Issues
* Known Issues
* Resolved Issues
* Resolved Issues
* Resolved Issues
* Resolved Issues
* New Features
* Known Issues
* New Features
* Resolved Issues
* Resolved Issues
* Known Issues
* New Features
* cusolverDnXpotrf_bufferSize
* cusolverDnXpotrf
* cusolverDnXpotrs
* cusolverDnXgeqrf_bufferSize
* cusolverDnXgeqrf
* cusolverDnXgetrf_bufferSize
* cusolverDnXgetrf
* cusolverDnXgetrs
* cusolverDnXsyevd_bufferSize
* cusolverDnXsyevd
* cusolverDnXsyevdx_bufferSize
* cusolverDnXsyevdx
* cusolverDnXgesvd_bufferSize
* cusolverDnXgesvd
* cusolverDnPotrf_bufferSize
* cusolverDnPotrf
* cusolverDnPotrs
* cusolverDnGeqrf_bufferSize
* cusolverDnGeqrf
* cusolverDnGetrf_bufferSize
* cusolverDnGetrf
* cusolverDnGetrs
* cusolverDnSyevd_bufferSize
* cusolverDnSyevd
* cusolverDnSyevdx_bufferSize
* cusolverDnSyevdx
* cusolverDnGesvd_bufferSize
* cusolverDnGesvd
* New Features
* Resolved Issues
* New Features
* New Features
*
Better performance for cusparseSpMM COO Alg3 and
cusparseSpSM.
* Resolved Issues
Known Issues
* New Features
* Resolved Issues
Known Issues
* New Features
* Better performance
* Known Issues
* New Features
* Resolved Issues
* New Features
* Resolved Issues
* Resolved Issues
* Known Issues
* New Features
* Resolved Issues
* Deprecated Features
* cusparseXcsrsm2_zeroPivot, cusparseXcsrsm2_solve,
cusparseXcsrsm2_analysis, and
cusparseScsrsm2_bufferSizeExt have been deprecated in
favor of cusparseSpSM Generic APIs
* Deprecated Features
* cusparseScsrsv2_analysis, cusparseScsrsv2_solve,
cusparseXcsrsv2_zeroPivot, and
cusparseScsrsv2_bufferSize have been deprecated in
favor of cusparseSpSV.
* Resolved Issues
* Known Issues
* cusparseDestroySpVec, cusparseDestroyDnVec,
cusparseDestroySpMat, cusparseDestroyDnMat,
cusparseDestroy with NULL argument could cause
segmentation fault on Windows.
* New Features
* Resolved Issues
cusparseDestroySpVec, cusparseDestroyDnVec,
cusparseDestroySpMat, cusparseDestroyDnMat,
cusparseDestroy with NULL argument could cause
segmentation fault on Windows.
* Deprecated Features
* Known Issues
* New Features
* cusparseSparseToDense
* cusparseDenseToSparse
* Deprecated Features
* New Features
Ci=A⋅Bi
Ci=Ai⋅B
Ci=Ai⋅Bi
* New Features
Ci=A⋅Bi
Ci=Ai⋅B
Ci=Ai⋅Bi
* cusparse<t>gemmi()
* SpGEMM: cusparseXcsrgemm2_bufferSizeExt,
cusparseXcsrgemm2Nnz, cusparseXcsrgemm2
* New Features
* Deprecations
* Resolved Issues
* New Features
* Resolved Issues
* New Features
* Resolved Issues
* New Features
* Resolved Issues
* Resolved Issues
* New Features
* nppiSignedDistanceTransformAbsPBA_xxxxx_C1R_Ctx()
– Input and output combination supports (xxxxxx)
- 32f, 32f64f, 64f
* nppiDistanceTransformAbsPBA_xxxxx_C1R_Ctx() –
Input and output combination supports (xxxxxx) -
8u16u, 8s16u, 16u16u, 16s16u, 8u32f, 8s32f,
16u32f, 16s32f, 8u64f, 8s64f, 16u64f, 16s64f,
32f64f, 64f
* Resolved Issues
* New Features
* New Features
* New Features
* nppiDistanceTransformPBA_xxxxx_C1R_Ctx() – where
xxxxx specifies the input and output combination:
8u16u, 8s16u, 16u16u, 16s16u, 8u32f, 8s32f, 16u32f,
16s32f
* nppiSignedDistanceTransformPBA_32f_C1R_Ctx()
* Resolved Issues
* New Features
* New Features
*
Added nppiSegmentWatershed functions.
* Resolved Issues
* Known Issues
* Resolved Issues
* Resolved Issues
* Resolved Issues
* nvjpegDecodeBatchedEx()
* nvjpegDecodeBatchedSupportedEx()
* New Features
* Known Issues
* New Features
* New Features
* Known Issues
* Deprecated Features
Notices
-------
Notice
OpenCL
Trademarks
Copyright
-----------------------------------------