Intel® oneAPI Math Kernel Library Developer Reference - Fortran
Copies from a global parallel array into a local replicated array or vice versa.
call pslacp3(m, i, a, desca, b, ldb, ii, jj, rev)
call pdlacp3(m, i, a, desca, b, ldb, ii, jj, rev)
call pclacp3(m, i, a, desca, b, ldb, ii, jj, rev)
call pzlacp3(m, i, a, desca, b, ldb, ii, jj, rev)
This is an auxiliary routine that copies from a global parallel array into a local replicated array or vise versa. Note that the entire submatrix that is copied gets placed on one node or more. The receiving node can be specified precisely, or all nodes can receive, or just one row or column of nodes.
Optimization Notice |
---|
Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #20110804 |
(global) INTEGER.
m is the order of the square submatrix that is copied.
m≥ 0. Unchanged on exit.
(global) INTEGER. A(i, i) is the global location that the copying starts from. Unchanged on exit.
(local) REAL for pslacp3
DOUBLE PRECISION for pdlacp3
COMPLEX for pclacp3
DOUBLE COMPLEX for pzlacp3
Array of size (lld_a,LOCc(n_a)). On entry, the parallel matrix to be copied into or from.
(global and local) INTEGER array of size dlen_. The array descriptor for the distributed matrix A.
(local)
REAL for pslacp3
DOUBLE PRECISION for pdlacp3
COMPLEX for pclacp3
DOUBLE COMPLEX for pzlacp3
Array of size (lld_b,LOCc(m)). If rev = 0, this is the global portion of the matrix A(i:i+m-1, i:i+m-1). If rev = 1, this is unchanged on exit.
(local)
INTEGER.
The leading dimension of B.
(global) INTEGER. By using rev 0 and 1, data can be sent out and returned again. If rev = 0, then ii is destination row index and jj is destination column index for the node(s) receiving the replicated B. If ii ≥ 0, jj ≥ 0, then node (ii, jj) receives the data. If ii = -1, jj ≥ 0, then all rows in column jj receive the data. If ii ≥ 0, jj = -1, then all cols in row ii receive the data. If ii = -1, jj = -1, then all nodes receive the data. If rev !=0, then ii is the source row index for the node(s) sending the replicated B.
(global) INTEGER. Use rev = 0 to send global A into locally replicated B (on node (ii, jj)). Use rev != 0 to send locally replicated B from node (ii, jj) to its owner (which changes depending on its location in A) into the global A.
On exit, if rev = 1, the copied data. Unchanged on exit if rev = 0.
If rev = 1, this is unchanged on exit.