oneAPI DPC++ / SYCL runtime + math libraries
selects the device, manages USM/queues, exposes device-side templates, hands device images to the driver
LibraryWhat it isNVIDIA counterpart
libsycl.soDPC++ SYCL runtime
The SYCL runtime: device discovery, queues, USM, kernel launch, and host-side template glue. Owns most runtime error messages.
NVIDIAlibcudart.so (CUDA Runtime) + libnvrtc
ext::oneapi::matrixjoint_matrix<…>
Header-only SYCL templates for warp/sub-group matrix-multiply on the XMX engines; lowered to DPAS by IGC.
NVIDIAnvcuda::wmma / mma.sync (Tensor Core intrinsics)
oneDNNlibdnnl
Deep-learning primitive library: matmul, convolution, attention, with its own tuned GPU codegen.
NVIDIAcuDNN
oneMKLlibmkl_sycl
Math kernel library: dense/sparse BLAS, LAPACK, FFT, RNG for the GPU.
NVIDIAcuBLAS / cuSOLVER / cuFFT / cuRAND
libimf / SVMLdevice math
Intel math functions (sin, exp, …) linked into device code for accurate/vectorized transcendentals.
NVIDIAlibdevice (NVVM math bitcode)