摘要: 矩阵转置:__global__ void TransDtD(float*des, float*src, int srcH, int srcW){ int idx = blockIdx.x*blockDim.x + threadIdx.x; //如果srcH*srcW>BLOCK_NUM*THREA... 阅读全文
posted @ 2014-08-24 22:08 默如诉 阅读(293) 评论(0) 推荐(0) 编辑