當(dāng)前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

在.c文件中调用cuda函数

發(fā)布時(shí)間：2025/3/15 编程问答 18 豆豆

生活随笔收集整理的這篇文章主要介紹了在.c文件中调用cuda函数小編覺得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.

在.c文件中調(diào)用cuda函數(shù)

2014-04-19 17:17 446人閱讀評(píng)論(0) 收藏舉報(bào) 分類： cuda編程（1）

版權(quán)聲明：本文為博主原創(chuàng)文章，未經(jīng)博主允許不得轉(zhuǎn)載。

問題描述：假設(shè)在Ubuntu的一個(gè)用戶目錄下有2個(gè)文件，main.c， VectorAdd.cu，其中?VectorAdd.cu有vectorAdd函數(shù)，main.c提供程序的入口main函數(shù)。現(xiàn)在為了在main.c中實(shí)現(xiàn)兩個(gè)向量相加的操作，就需要調(diào)用?VectorAdd.cu中的vectorAdd函數(shù)

首先列出兩個(gè)文件中的內(nèi)容

[cpp] view plaincopy

//VectorAdd.cu??

#include?<cutil.h>??

extern?"C"?void?VectAdd(int?*a,?int?*b,?int?*c,?int?length);??

__global__?void?Add(int?*d_a,?int?*d_b,?int?*d_c,?int?length)??

{??

????int?id?=?threadIdx.x;??

????if(id?<?length)??

????????d_c[id]?=?d_a[id]?+?d_b[id];??

}??

void?VectAdd(int?*a,?int?*b,?int?*c,?int?length)??

{?????

????unsigned?int?size?=?sizeof(int)?*?length;???

????int?*d_a;??

????cudaMalloc((void**)&d_a,size);??

????int?*d_b;??

????cudaMalloc((void**)&d_b,size);??

????int?*d_c;??

????cudaMalloc((void**)&d_c,size);??

????cudaMemcpy(d_a,?a,?size,?cudaMemcpyHostToDevice);??

????cudaMemcpy(d_b,?b,?size,?cudaMemcpyHostToDevice);??

??????

????Add<<<1,?length>>>(d_a,?d_b,?d_c,?length);??

????cudaMemcpy(c,?d_c,?size,?cudaMemcpyDeviceToHost);??

????cudaFree(d_a);??

????cudaFree(d_b);??

????cudaFree(d_c);??

}??

//main.c文件??

#include?<stdio.h>??

#include?<malloc.h>??

int?main()??

{??

????int?*a,?*b,?*c;??

????int?length?=?32;??

????int?i;??

????a?=?(int*)malloc(sizeof(int)?*?length);??

????b?=?(int*)malloc(sizeof(int)?*?length);??

????c?=?(int*)malloc(sizeof(int)?*?length);??

????for(i?=?0;?i?<?length;?++i)??

????{??

????????a[i]?=?i;??

????????b[i]?=?i;??

????}??

????VectAdd(a,b,c,length);??

????for(i?=?0;?i?<?length;?i++)??

????{??

????????printf("%d?",c[i]);??

????}??

????printf("\n");??

????????return?0;??

}??

.cu文件實(shí)際上是按c++的語法規(guī)則來編譯的，因此上述問題的實(shí)質(zhì)也是如何在.c文件中調(diào)用.cpp，為方便討論，假設(shè)在.c文件中調(diào)用.cu中的函數(shù)，為.cu文件使用nvcc編譯，對(duì).c文件使用gcc編譯，具體的編譯命令如在makefile文件所示： [cpp] view plaincopy

default:?libcuda?run??

CUDA_DIR=/usr/local/cuda??

SDK_DIR=/home/NVIDIA_GPU_Computing_SDK/C??

CC=nvcc??

C?=?gcc??

CPP?=?g++??

SOURCE?=?main.c??

DEST?=?main??

libcuda:??

?????$(CC)?$(INC)?$(LIB)?-c?VectorAdd.cu?-o?VectorAddCu.o??

?????ar?cr?libVectorAddCu.a?VectorAddCu.o??

run:??

?????<span?style="color:#ff6666;">$(C)?$(SOURCE)?-lstdc++?-o?$(DEST)?-L$(CUDA_DIR)/lib64?-lcudart?libVectorAddCu.a??

?????$(CPP)?$(SOURCE)?-o?$(DEST)?-L$(CUDA_DIR)/lib64?-lcudart?libVectorAddCu.a</span>??

? ? ? ? ? 這篇《在.c文件中調(diào)用cuda函數(shù)》與《在.c文件中調(diào)用c++定義的函數(shù)》有很多相同的地方，詳細(xì)的講解我不再說明，不懂的可以去這里看，在.c文件中調(diào)用c++定義的函數(shù)，這里我主要說說兩者的不同。

? ? ? ? ?上面的Makefile文件中有這樣一句話：

[cpp] view plaincopy

-L$(CUDA_DIR)/lib64?-lcudart?libVectorAddCu.a??

? ? ? ? ?通過測試，如果不要這句話就會(huì)出現(xiàn)下面這樣的錯(cuò)誤： [cpp] view plaincopy

libVectAddCu.a(VectAddCu.o):?In?function?`__sti____cudaRegisterAll_42_tmpxft_00004317_00000000_4_VectAdd_cpp1_ii_e5583c85()':??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0xe):?undefined?reference?to?`__cudaRegisterFatBinary'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x69):?undefined?reference?to?`__cudaRegisterFunction'??

libVectAddCu.a(VectAddCu.o):?In?function?`__cudaUnregisterBinaryUtil()':??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x88):?undefined?reference?to?`__cudaUnregisterFatBinary'??

libVectAddCu.a(VectAddCu.o):?In?function?`__device_stub__Z3AddPiS_S_i(int*,?int*,?int*,?int)':??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0xb4):?undefined?reference?to?`cudaSetupArgument'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0xd0):?undefined?reference?to?`cudaSetupArgument'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0xec):?undefined?reference?to?`cudaSetupArgument'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x108):?undefined?reference?to?`cudaSetupArgument'??

libVectAddCu.a(VectAddCu.o):?In?function?`VectAdd':??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x187):?undefined?reference?to?`cudaMalloc'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x193):?undefined?reference?to?`cudaMalloc'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x19f):?undefined?reference?to?`cudaMalloc'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x1b4):?undefined?reference?to?`cudaMemcpy'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x1c9):?undefined?reference?to?`cudaMemcpy'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x216):?undefined?reference?to?`cudaConfigureCall'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x243):?undefined?reference?to?`cudaMemcpy'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x24c):?undefined?reference?to?`cudaFree'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x255):?undefined?reference?to?`cudaFree'??

tmpxft_00004317_00000000-1_VectAdd.cudafe1.cpp:(.text+0x25e):?undefined?reference?to?`cudaFree'??

這些錯(cuò)誤是什么東西呢，是說程序在鏈接的時(shí)候找不到上面的這些符號(hào)，像'cudaMalloc'、‘cudaFree'等，這些符號(hào)對(duì)應(yīng)的函數(shù)我沒有定義，那么他們來自哪里呢。對(duì)，就是這些函數(shù)是有系統(tǒng)提供的，是存在于系統(tǒng)的庫文件中-L$(CUDA_DIR)/lib64 -lcudart就是讓編譯器去鏈接(CUDA_DIR)/lib64中的libcudart.so文件，里面有它想要的東西的。

libVectorAddCu.a是我自己生成的一個(gè)庫文件，這個(gè)庫文件好像必須要放在最后面才行

[cpp] view plaincopy

<span?style="color:#ff6666;">$(C)?$(SOURCE)?-lstdc++?-o?$(DEST)?-L$(CUDA_DIR)/lib64?-lcudart?libVectorAddCu.a??

$(CPP)?$(SOURCE)?-o?$(DEST)?-L$(CUDA_DIR)/lib64?-lcudart?libVectorAddCu.a</span>??

這兩行的代碼中的內(nèi)容是有一定的順序的，由于我對(duì)linux不是太了解，目前對(duì)這種順序還不是太了解，希望知道的能給我留言，謝謝！

總結(jié)

以上是生活随笔為你收集整理的在.c文件中调用cuda函数的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯(cuò)，歡迎將生活随笔推薦給好友。

上一篇： 802.11 n wlan linux驱
下一篇： xp系统c 语言命令重定向,Xp命令解释

3atv精品不卡视频,97人人超碰国产精品最新,中文字幕av一区二区三区人妻少妇,久久久精品波多野结衣,日韩一区二区三区精品

编程问答

在.c文件中调用cuda函数

在.c文件中調(diào)用cuda函數(shù)

總結(jié)