首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > OpenCL内核代码自动向量化

OpenCL内核代码自动向量化

  • 资源大小:726.88 kB
  • 上传时间:2021-06-30
  • 下载次数:0次
  • 浏览次数:1次
  • 资源积分:1积分
  • 标      签: opencl 代码 内核 量化 动向

资 源 简 介

We are now living in the SIMD world. The cores from NVIDIA GPU is scalar core, while there are several processing elements in AMD GPU, labeled as vector/streaming core. The new released multi-core CPU from Intel features AVX, with the register file increasing from 128 bits to 256 bits. To distinguish these SIMD from the large-scale SIMD in vector machine and SIMT from NVIDIA GPUs, we name them as small-scale SIMD or short SIMD, i.e. SSIMD. When mapping OpenCL programs to many-core processors, programmers typically map one work-item onto one core (scalar core or stream core). Although work-items in terms of work-groups are mapped to one core on multi-core processors, they will be scheduled to the responding core in the one-by-one pattern. If we code in the traditional CUDA-style (without any vectorization), only a small proportion of the computational potential, in theory, is tapped for these machines with SSIMD. Therefore, if we do not want

文 件 列 表

vectorizer_04122k12
benchmarks
intra_v
inter_v

相 关 资 源

您 可 能 感 兴 趣 的

同 类 别 推 荐

VIP VIP
  • 猕猴桃 44分钟前 成为了本站会员

  • 11 4小时前 成为了本站会员

  • 开心快活人 7小时前 成为了本站会员

  • 晋财 7小时前 成为了本站会员

  • WYG 1天前 成为了本站会员

  • Shine 1天前 成为了本站会员

  • 柳贻 1天前 成为了本站会员

  • hallelujah_HL 1天前 成为了本站会员

  • 焦昱贺 1天前 成为了本站会员

  • Rubin 1天前 成为了本站会员

0.234850s