本文介绍在Windows环境下,如何搭建Visual Studio + CUDA的开发环境
首先需要安装好Visual Studio 2015/2017/2019
CUDA版本确定
在选择cuda安装时,cuda的版本需要和显卡型号对应算力进行匹配,例如GeForce GT 730算力为3.5。如何查看自己显卡的算力呢?安装了CUDA后可通过CUDA内置的deviceQuery工具输出显卡相关信息,”CUDA Capability Major/Minor version number”这一行的数字就是显卡算力的版本
data:image/s3,"s3://crabby-images/4966f/4966f3284501cf10bd68ff419cd1e650859d7a44" alt=""
cudav8.0、cudav9.0到cudav10.0都可以兼容最高3.5算力,可以在以下链接中选择不同的cuda版本
https://developer.nvidia.com/cuda-toolkit-archive
阅读在线文档中的”xxx Compatibility Guide”文档,其中显示的编译选项中显示了算力能力
data:image/s3,"s3://crabby-images/1d7b4/1d7b458b07526b8cbc045546e48a0bb62a4caa3b" alt=""
compute_20表示算力2.0,compute_35表示算力3.5
下载安装对应cuda后,在cmd中输入nvcc -V可查看版本
data:image/s3,"s3://crabby-images/b490f/b490fea8d26890fd4351b24f87283fd1d9b20aa3" alt=""
hello world
打开vs,文件→新建→NVIDIA→CUDA 8.0,新建一个test工程
data:image/s3,"s3://crabby-images/19ac5/19ac5213a5ad1de7bd22568b691317cc5019f092" alt=""
创建项目后,工程会自动生成一个kernel.cu源码文件
点击项目→test属性
data:image/s3,"s3://crabby-images/c33c5/c33c5476e5c0914af0e884b64b3d550f3b4a293c" alt=""
选中VC++目录→包含目录,为其添加CUDA的头文件路径 “$(CUDA_PATH)\include”
data:image/s3,"s3://crabby-images/f02c4/f02c464a80c42cc08016380e0572ad88dc3461ec" alt=""
data:image/s3,"s3://crabby-images/157fa/157fab092fc508f380cf976c28c8a2a7e7d1b7db" alt=""
选中库目录,为工程添加CUDA库目录 “$(CUDA_PATH)\lib\x64”
data:image/s3,"s3://crabby-images/15be1/15be1e7be20e5fa5905370a5dc2e3a6d21f132f2" alt=""
data:image/s3,"s3://crabby-images/f6790/f6790baa261746f9f1be26a3fa22593f39135c58" alt=""
选择Debug模式为x64
data:image/s3,"s3://crabby-images/bba76/bba761151b8235a3e6137fd1d6d53ae51079c555" alt=""
在main函数最后添加如下函数,暂停程序退出
1 | getc(stdin); |
点击本地Windows调试器,运行程序
data:image/s3,"s3://crabby-images/96ced/96ced6ec04b618a8b3b163554ea0019c87595916" alt=""
Nsight可视化调试
手动运行Nsight Monitor,在电脑右下脚图标处,右键Nsight→options
data:image/s3,"s3://crabby-images/dd1f5/dd1f575095192d1b47e45e784b51e856c400738c" alt=""
设置以下选项为True
data:image/s3,"s3://crabby-images/f86a7/f86a7557a4eb78d6285e6b512ef6ce7d1dabc67e" alt=""
再次右击Nsight图标点击Exit
在vs中打开Nsight,也在option中选择以下内容为True
data:image/s3,"s3://crabby-images/61c1a/61c1a8034189127bbe79524ea13dd24e76108468" alt=""
点击vs中的Nsight→Start Performance Analysis
data:image/s3,"s3://crabby-images/31bea/31bea0cecfaade2855d05cc8c6a2d1031299b450" alt=""
出现如下界面
data:image/s3,"s3://crabby-images/83882/838824e83e971ab3b077a36e424cca64fe55a1b0" alt=""
选中Trace Settings→System下面的几个选项和CUDA
data:image/s3,"s3://crabby-images/01029/01029d8787f3aec1d7713a11ab5a9056fb31745a" alt=""
点击下方中间状态灯的Launch
data:image/s3,"s3://crabby-images/7e5cd/7e5cde82d26799c5a11d50bc9e17a1db08cb100d" alt=""
运行完毕后关闭程序,Nsight会自动显示Session Overview
data:image/s3,"s3://crabby-images/d1077/d1077381d79023f0b558283ad8d7f64140b78255" alt=""
显示了整个GPU运行的概要信息,左上角下拉框选择Timeline可显示时间线
data:image/s3,"s3://crabby-images/bd5e7/bd5e7744f4b8f16e0385680083e210e939ec5f29" alt=""