BlueOceans - 博客园

2017年11月8日

为什么ResNet和DenseNet可以这么深？一文详解残差块为何有助于解决梯度弥散问题

摘要： https://zhuanlan.zhihu.com/p/28124810 阅读全文

posted @ 2017-11-08 20:27 BlueOceans 阅读(727) 评论(0) 推荐(0) 编辑

2017年11月6日

cuda编程-并行规约

摘要：利用shared memory计算，并避免bank conflict；通过每个block内部规约，然后再把所有block的计算结果在CPU端累加代码：阅读全文

posted @ 2017-11-06 22:48 BlueOceans 阅读(681) 评论(0) 推荐(0) 编辑

cuda编程-矩阵乘法（2）

摘要：采用shared memory加速代码合并访存：tile_A按行存储，tile_B按列存储，sum=row_tile_A * row_tile_B 阅读全文

posted @ 2017-11-06 21:28 BlueOceans 阅读(704) 评论(0) 推荐(0) 编辑

2017年11月5日

cuda编程-矩阵乘法（1）

摘要：本方法采用简单的单线程计算每组行和列乘加运算代码如下：阅读全文

posted @ 2017-11-05 21:54 BlueOceans 阅读(2239) 评论(0) 推荐(0) 编辑

linux利用CMakeLists编译cuda程序

摘要：文件目录： cudaTest |--utils.cu |--utils.h |--squaresum.cu |--squaresum.h |--test.cpp |--CMakeLists.txt 编译命令： $cd /root/cudaTest $mkdir build $cd build $cm 阅读全文

posted @ 2017-11-05 17:58 BlueOceans 阅读(7462) 评论(0) 推荐(0) 编辑

cuda编程视频资料

摘要：胡文美教授 http://www.gpuworld.cn/article/show/463.html 阅读全文

posted @ 2017-11-05 10:38 BlueOceans 阅读(493) 评论(0) 推荐(0) 编辑

2017年10月27日

Python在函数中使用*和**接收元组和列表

摘要： http://blog.csdn.net/delphiwcdj/article/details/5746560 阅读全文

posted @ 2017-10-27 16:02 BlueOceans 阅读(333) 评论(0) 推荐(0) 编辑

2017年10月24日

nvidia-smi实时刷新并高亮显示状态

摘要： watch -n 1 -d nvidia-smi 间隔1秒刷新阅读全文

posted @ 2017-10-24 20:04 BlueOceans 阅读(20749) 评论(0) 推荐(0) 编辑

摘要： http://www.cnblogs.com/simplelovecs/p/5145305.html 阅读全文

posted @ 2017-10-24 16:30 BlueOceans 阅读(155) 评论(0) 推荐(0) 编辑

离线安装Python包hickle，easydict

摘要：安装hickle source： https://github.com/telegraphic/hickle 1. cd to your downloaded hickle directory 2. python setup.py install 其他包类似安装，若已经安装了anaconda2则默认阅读全文

posted @ 2017-10-24 16:12 BlueOceans 阅读(5919) 评论(0) 推荐(0) 编辑