首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > CUDA内存优化技术在NVIDIA Fermi GPU的CUDA应用程序运行

CUDA内存优化技术在NVIDIA Fermi GPU的CUDA应用程序运行

  • 资源大小:391.75 kB
  • 上传时间:2021-06-30
  • 下载次数:0次
  • 浏览次数:1次
  • 资源积分:1积分
  • 标      签: cuda Academic Performance GPU

资 源 简 介

This project demonstrated optimization techniques for Nvidia’s Telsa 10 and Fermi devices using the CUDA programming language.Techniques utilized were: 1. Use of shared memory, L1 and L2 caches. 2. Avoid divergence branch 3. memory coalescing 4. data prefetching 5. conflict free shared memory. Optimizing techniques were demonstrated on two different applications-BFS and Matrix-Multiply which resulted in reduction in execution time for both applications. This speed up is gained by cutting down on the amount of time it takes to access the global memory of the GPU.

相 关 资 源

您 可 能 感 兴 趣 的

同 类 别 推 荐

VIP VIP