DPCT1018¶
Message¶
The <API name> was migrated, but the generated code performance may be sub-optimal due to the following reasons:
The <parameter name> could not be evaluated and may be smaller than <other parameter name>.
The <parameter name> is equal to <other parameter name>, but is greater than 1.
Detailed Help¶
This warning appears in two cases:
cublasSetMatrix
The Intel® DPC++ Compatibility Tool replaced thecublasSetMatrix
with memory copying from the host to the device. When the rows parameter of thecublasSetMatrix
is smaller than thelda
parameter, the generated code copies more data (lda*cols
) than the actual data available in the matrix (rows*cols
).To improve performance, consider changing the values of
lda
andldb
. If the rows parameter is greater than or equal tolda
, no action is required for this code.cublasSetVector
The Intel® DPC++ Compatibility Tool replaced thecublasSetVector
with memory copying from the host to the device. When theincx
parameter of thecublasSetVector
equals theincy
parameter, but is greater than 1, the generated code copies more data (incx*n
) than the actual data available in the vector (n
). To improve performance, consider changing the values ofincx
andincy
.
Suggestions to Fix¶
If the rows parameter of the cublasSetMatrix
is smaller than the lda
parameter and you observe performance issues, consider changing the values of
lda
and ldb
.
If the incx
parameter of the cublasSetVector
equals the incy
parameter,
but is greater than 1 and you observe performance issues, consider changing the
values of incx
and incy
.