CUDALink`

CUDADot

CUDADot[cuvec₁,cuvec₂]

gives the dot product of CUDA vectors cuvec₁ and cuvec₂.

CUDADot[cumat,cuvec]

gives the matrix-vector product of CUDA matrix cumat and CUDA vector cuvec.

CUDADot[cumat,cuspvec]

gives the matrix-vector product of CUDA matrix cumat and CUDA sparse vector cuspvec.

CUDADot[cumat₁, cumat₂]

gives the matrix-matrix product of CUDA matrices cumat₁ and cumat₂.

CUDADot[cuspmat₁, cuspmat₂]

gives the matrix-matrix product of CUDA sparse matrices cuspmat₁ and cuspmat₂.

CUDADot[cuspmat, cumat]

gives the matrix-matrix product of CUDA sparse matrix cuspmat and CUDA matrix and cumat.

CUDADot[vec₁,vec₂]

gives the dot product of vec₁ and vec₂.

CUDADot[mat,vec]

gives the matrix-vector product of mat and vec.

CUDADot[mat₁, mat₂]

gives the matrix-matrix product of mat₁ and mat₂.

Details and Options

The CUDALink application must be loaded using Needs["CUDALink`"].
CUDADot works on CUDA matrix and CUDA vectors of types "Real64", "ComplexReal64", "Real32" and "ComplexReal32".
CUDADot works only on general vectors types such as "Float", "Double", ….
CUDADot does not work on fixed vector structure types like "Float[2]", "Integer32[2]", ….

Examples

open allclose all

Basic Examples (7)

First, load the CUDALink application:

This performs the dot product:

This performs matrix-vector multiplication:

Contents of CUDAVector:

Multiply a CUDA matrix and a CUDA sparse vector

Contents of CUDAVector:

Multiply two CUDA matrices, result is a CUDA matrix:

Contents of CUDAMatrix:

Multiply two CUDA sparse matrices, result is a CUDA sparse matrix:

Contents of CUDASparseMatrix:

Multiply CUDA sparse matrix with CUDA matrix, result is a CUDA matrix:

Contents of CUDAMatrix:

Scope (3)

For large vectors, performing operations on the GPU can be quicker.

This creates a large CUDA vector of type "Real64":

This performs dot product on the GPU and measures timing:

This performs dot product on the CPU and measures timing:

Performing dot product on the GPU is faster:

Performance can be improved by using CUDA vector of type "Real32":

Performing dot product of CUDA vector of "Real32" on GPU is even faster:

CUDADot works with vectors in list form:

CUDADot supports CUDAMemory:

This multiplies the two input memories together :

Memory is retrieved using CUDAMemoryGet:

Memory must be freed with CUDAMemoryUnload :

Top

CUDADot

Details and Options

Examples

Basic Examples (7)

Scope (3)

Text

CMS

APA

BibTeX

BibLaTeX

CUDADot

Details and Options

Examples

Basic Examples (7)

Scope (3)

See Also

Tech Notes

Related Guides

Text

CMS

APA

BibTeX

BibLaTeX