CudaTools

Namespace: HEaaN::CudaTools
Purpose: Provides basic utility functions for CUDA device management and monitoring

bool isAvailable()

#include <HEaaN/HEaaN.hpp>
...
if(!HEaaN::CudaTools::isAvailable()) {
    return -1;
}

void cudaDeviceSynchronize()

Function: Waits until all CUDA device operations are completed
When to Use: Blocks the host thread until all previously launched CUDA kernels have completed their execution.
- For detailed information, please refer to the official documentation of NVIDIA CUDA Runtime API. NVIDIA CUDA RUNTIME API

int cudaGetDevice()
int cudaDeviceCount()
void cudaSetDevice(int device_id)

Functions:
- cudaGetDevice() : Returns the ID of the currently active CUDA device
- cudaDeviceCount() : Returns the number of available CUDA devices in the System
- cudaSetDevice(int device_id) : Sets the active CUDA device
Usage Example:

int device_count = HEaaN::CudaTools::cudaGetDeviceCount();
if (device_count > 0) {
    HEaaN::CudaTools::cudaSetDevice(0); // Use first GPU
}

Note: Currently gpu-run allocates a single gpu to run your binary. Using device_id other than 0 will elad to a runtime error.

void nvtxPush(const char *msg)
void nvtxPop()

HEaaN::CudaTools::nvtxPush("CUDA function start");
// ... perform cuda operation ...
HEaaN::CudaTools::nvtxPop();

std::pair<u64, u64> getCudaMemoryInfo()

auto [free, total] = HEaaN::CudaTools::getCudaMemoryInfo();

Last updated 1 month ago

Was this helpful?