代码之家 › 专栏 › 技术社区 › Rahn

将GPU内存分配给类的成员变量的正确方法是什么?

cuda memory c++

Rahn · 技术社区 · 1 年前

基本上我希望为类分配GPU内存 Image 的成员变量 frame 具有 cudaMallocManaged :

class Image {
public:
    Color* frame;
};

但通过以下代码,我得到了 CUDA error = 1 为指针分配时 框架 :

#include <iostream>

// limited version of checkCudaErrors from helper_cuda.h in CUDA examples
#define checkCudaErrors(val) check_cuda((val), #val, __FILE__, __LINE__)

void check_cuda(cudaError_t result, char const* const func, const char* const file,
                int const line) {
    if (result) {
        std::cerr << "CUDA error = " << static_cast<unsigned int>(result) << " at " << file << ":"
                << line << " '" << func << "' \n";
        // Make sure we call CUDA Device Reset before exiting
        cudaDeviceReset();
        exit(-1);
    }
}

class Color {
public:
    double r, g, b;

    __host__ __device__ Color() : r(0.0), g(0.0), b(0.0) {
    }
};


class Image {
public:
    Color* frame;
};

int main() {
    int width = 1960;
    int height = 1080;

    Image *image;
    checkCudaErrors(cudaMallocManaged((void **)&image, sizeof(Image)));
    checkCudaErrors(cudaMallocManaged((void **)&(image->frame), sizeof(Color) * width, height));

    return 0;
}

将GPU内存分配给该成员变量的正确方法是什么?

1 回复 | 直到 1 年前

463035818_is_not_an_ai 1 年前

cudaMallocManaged 声明如下:

__host__ âcudaError_t cudaMallocManaged(void** devPtr, size_t size,
                                       unsigned int flags = cudaMemAttachGlobal)

所以你可以使用这个:

checkCudaErrors(cudaMallocManaged((void **)&(image->frame),
                                  sizeof(Color) * width * height));

注:第三个论点, flags ,在此处左侧使用默认值 cudaMemAttachGlobal 大小是通过乘以 sizeof(Color) , width 和 height .

推荐文章

AstralHex · 矩阵乘法代码工作不正常

7 月前

Baba Dan Constantin · SSE4.1在矩阵4x4乘法上比SSE3慢?

7 月前

Giogre · 为包含许多数值字段的简单“struct”重载比较运算符

8 月前

einpoklum · 定义一个并不真正提供now()函数的std::chrono Clock是“合法的”吗?

8 月前

Fishie · 作为类成员的智能指针是否仍然自动释放?[关闭]

8 月前

Die4Toast · 递归调用成员箭头运算符->

8 月前

Angle.Bracket · 如何用C++将UTF-8文件名写入MS Windows控制台?

8 月前

Anka HanÄ±m · 关于结构和动态数组地址的问题

8 月前

Adam Barnes · 我如何定义一个基于constexpr函数返回值进行限制的概念?

8 月前

user2138149 · 为什么我不能获取包含多个元素的结构体中某些元素的地址?[副本]

8 月前