日本电子维修技术显卡cuDNN 7.0 RC也发布了。tensor core调用方式判明。

日期：2021-09-29 栏目：维修经验

老黄好忙。不过这些软件才是老黄的非对称优势。
https://developer.nvidia.com/rdp/cudnn-download

cuda 9.0 RC刚刚发布。
https://www.chiphell.com/thread-1761446-1-1.html

就一个头文件，看了下，多是paper算法实现层面的，ctc，rnn。和线代库（cublas）一样也是加了一个传统math和tensor op的枚举。

捕获.JPG (20.45 KB, 下载次数: 0)

2017-8-5 05:40 上传

要么和v100相关的内容还没添加进去（毕竟只是rc），要么tensor core的操作在用户层面是透明的。

======================更新=========================

果然是的，这个枚举和tensor core有关，文档中有描述

2.7.1. Tensor Core Operations
NotesSome notes on Tensor Core Operations use in cuDNN v7 on sm_70:
Tensor Core operations are supported on the Volta GPU family, those operationsperform parallel floating point accumulation of multiple floating point products.Setting the math mode to CUDNN_TENSOR_OP_MATH indicates that thelibrary will use Tensor Core operations as mention previously. The default isCUDNN_DEFAULT_MATH, this default indicates that the Tensor Core operationswill be avoided by the library.

设置为CUDNN_TENSOR_OP_MATH，将会启用tensor core进行运算，设置为CUDNN_DEFAULT_MATH，运算时会自动略过tensor core。

The default mode is a serialized operation, the TensorCore operations are parallelized operation, thus the two might result in slight differentnumerical results due to the different sequencing of operations. Note: The library fallsback to the default math mode when Tensor Core operations are not supported or notpermitted.The result of multiplying two matrices using Tensor Core Operations is very close, butnot always identical, to the product achieved using some sequence of legacy scalarfloating point operations.

So cuDNN requires explicit user opt-in before enabling theuse of Tensor Core Operations. However, experiments training common Deep Learningmodels show negligible difference between using Tensor Core Operations and legacyfloating point paths as measured by both final network accuracy and iteration count toconvergence. Consequently, the library treats both modes of operation as functionallyindistinguishable, and allows for the legacy paths to serve as legitimate fallbacks forcases in which the use of Tensor Core Operations is unsuitable.

用户层面需要显式的指定运算类型（是用tensor core进行运算还是使用传统的sp进行运算），在最终精度上两者没有什么区别（区别在于收敛性能）。当tensor core调用失败后，系统会自动使用传统sp来进行运算，无需用户再进行干涉。

新的方法，给tensorDescription设置运算类型。

捕获2.JPG (36.36 KB, 下载次数: 1)

2017-8-5 05:53 上传

评论
支持inner product/linear/matrix multiplication layer吗？

评论

都支持的，这些都算分子级的运算。

评论
能不能用OpenCL开发个类似的东西？

评论

当然能做，而且光做cudnn其实一点都不难，其实它就是每年各种新深度学习算法的cuda based实现。难的是做cuda库。电路电子维修求创维42c08RD电路图评论电视的图纸很少见评论电视的图纸很少见评论创维的图纸你要说版号，不然无能为力评论板号5800-p42ALM-0050 168P-P42CLM-01 电路电子维修我现在把定影部分拆出来了。想换下滚，因为卡纸。但是我发现灯管挡住了。拆不了。不会拆。论坛里的高手拆解过吗？评论认真看，认真瞧。果然有收
·中文新闻阿曼达·斯蒂尔（Amanda Platell）：哦，迈琳（Myleene），我对你的
·中文新闻迪拜的秘密性交易 - 和加油的英国男人 - 暴露了：在一次特别调

维修经验

日本电子维修技术显卡cuDNN 7.0 RC也发布了。tensor core调用方式判明。

CPUcpu-z 1.77版低调发布

CPU这几天经常开机黑屏，热重启后又正常

CPU超频求助！关于华擎H170和6700K

CPU液态金属会侵蚀cpu核心吗？

CPUAMD Zen处理器、AM4接口实物曝光：1331个针脚

CPUm6i究竟支不支持e3 1231v3

CPU华擎 HYPER 妖板正确玩法

CPUE5 2686 V3和i7 6800K如何选择

CPUHD530硬解4K能力还是有点弱呀！

CPU在组一个小机箱，关于i5 6600和i7 6700的选择

CPUwin10超频稳定，但是睡眠唤醒不了，pll电压di

CPU6900k 1.25V到4.2体质怎么样

CPUI3 6100 华擎B150M pro4超4.5g测试。

CPU系统稳定性测试，我发现prime95半个小时内问题

CPU7系u会兼容100系主板吗？

CPU请教各位：J3710和G1840，哪个性能稍好些？

CPU昨日遇到土豪朋友，又被吓到了，有朋友比这

CPU有心入5820k了，求教下温度问题

CPU6600&6600K才100的差价

CPU打算组双路E5 2670，大家有什么好的建议吗？

日本电子维修技术 显卡cuDNN 7.0 RC也发布了。tensor core调用方式判明。

相关推荐

日本电子维修技术显卡cuDNN 7.0 RC也发布了。tensor core调用方式判明。