1. Multiplication of a matrix and its transpose : Consider matrix A of size N by M and its transpose A T of size M by N . Your task is to design and implement a parallel algorithm for multiplication...

1.
Multiplication of a matrix and its transpose: Consider matrix
A
of size
N
by
M
and its transpose
A^T

of size
M
by
N. Your task is to design and implement a parallel algorithm for multiplication of a matrix and its transpose, i.e.,
C
=
AA^T
, for distributed-memory multi-computers in which the processors are organized as a one-dimensional linear array.

In the parallel algorithm design you must consider efficiency issues, i.e., try to minimize computation and communication costs and balance the workloads among all processors. Since the resulting matrix
C
is symmetric, i.e.,
c_ij
=
c_ji
, for example, in your algorithm only the elements in the upper (or lower) triangular of the matrix need to be calculated. (In other words you must not calculate both
c_ij

and
c_ji

as they are the same.)

In the implementation
a. You must use MPI non-blocking send/recv communication functions to overlap computation and communication.
b. You can assume
N
≥
p
for
p
being the number of processes organized as a one-dimensional linear array.
c. Your program must produce correct results for
p
being greater than or equal to one.
d. For simplicity you may restrict
p
to be either an odd, or even number to achieve the best possible load balancing.
e. Your program needs to ask for the matrix sizes
N
and
M
as user defined parameters, and must print out the results in the row-wise order as shown in an example below.

c₀₀
c₀₁
c₀₂
c₀₃
c₀₄

c₁₁
c₁₂
c₁₃
c₁₄

c₂₂
c₂₃
c₂₄

c₃₃
c₃₄

c₄₄

After the parallel computation, you main program must conduct a self-checking, i.e., first perform a sequential computation using the same data set and then compare the two results.

Jun 05, 2021

SOLUTION.PDF

1. Multiplication of a matrix and its transpose : Consider matrix A of size N by M and its transpose A T of size M by N . Your task is to design and implement a parallel algorithm for multiplication...

Get Answer To This Question

Related Questions & Answers

Submit New Assignment