摘要: 1、cudaMemcpy()<--> cudaMalloc() //线性内存拷贝1 //线性内存拷贝2 cudaMalloc((void**)&dev_A, data_size);3 cudaMemcpy(dev_A, host_A, data_size, cudaMemcpyHostToDevice);2、cudaMemcpy2D()<-->cudaMallocPitch() //线性内存拷贝cudaError_t cudaMemcpy2D( void * dst, size_t dpitch, const void * src, si... 阅读全文
posted @ 2013-04-12 14:09 traceorigin 阅读(2184) 评论(0) 推荐(1) 编辑
摘要: cuda内存分配真是乱啊,这次总结一下数组的分配。概述:数组分配可以通过cudaMallocArray()和cudaMalloc3DArray()1、cudaMallocArray()cudaError_t cudaMallocArray ( struct cudaArray ** array, const struct cudaChannelFormatDesc * desc, size_t width, size_t height = 0, unsigned int flags = 0 ) 例:2DA... 阅读全文
posted @ 2013-04-12 13:57 traceorigin 阅读(2637) 评论(0) 推荐(0) 编辑