trmv (USM Version)¶
Computes a matrix-vector product using a triangular matrix.
Description¶
The trmv
routines compute a matrix-vector product with a triangular
matrix. The operation is defined
where:
op(
A
) is one of op(A
) =A
, or op(A
) =A
T, or op(A
) =A
H,A
is ann
-by-n
unit or non-unit, upper or lower triangular band matrix,x
is a vector of lengthn
.
API¶
Syntax¶
namespace oneapi::mkl::blas::column_major {
sycl::event trmv(sycl::queue &queue,
onemkl::uplo upper_lower,
onemkl::transpose trans,
onemkl::diag unit_nonunit,
std::int64_t n,
const T *a,
std::int64_t lda,
T *x,
std::int64_t incx,
const std::vector<sycl::event> &dependencies = {})
}
namespace oneapi::mkl::blas::row_major {
sycl::event trmv(sycl::queue &queue,
onemkl::uplo upper_lower,
onemkl::transpose trans,
onemkl::diag unit_nonunit,
std::int64_t n,
const T *a,
std::int64_t lda,
T *x,
std::int64_t incx,
const std::vector<sycl::event> &dependencies = {})
}
The USM version of trmv
supports the following precisions and devices.
T |
Devices Supported |
---|---|
|
Host, CPU, and GPU |
|
Host, CPU, and GPU |
|
Host, CPU, and GPU |
|
Host, CPU, and GPU |
Input Parameters¶
- exec_queue
The queue where the routine should be executed.
- upper_lower
Specifies whether
A
is upper or lower triangular. See Data Types for more details.- trans
Specifies op(
A
), the transposition operation applied toA
. See Data Types for more details.- unit_nonunit
Specifies whether the matrix
A
is unit triangular or not. See Data Types for more details.- n
Numbers of rows and columns of
A
. Must be at least zero.- a
Pointer to input matrix
A
. The array holding input matrixA
must have size at leastlda
*n
. See Matrix Storage for more details.- lda
Leading dimension of matrix
A
. Must be at leastn
, and positive.- x
Pointer to input vector
x
. The array holding input vectorx
must be of size at least (1 + (n
- 1)*abs(incx
)). See Matrix Storage for more details.- incx
Stride of vector
x
.- dependencies
List of events to wait for before starting computation, if any. If omitted, defaults to no dependencies.
Output Parameters¶
- x
Pointer to the updated vector
x
.
Return Values¶
Output event to wait on to ensure computation is complete.