Search results

1 – 2 of 2

Per page

10 20 50

(0)

Citations:

View access options

Article

Publication date: 1 March 2002

Mapping full‐systolic arrays for matrix product on XILINX's XC4000(E,EX) FPGAs

A.K. Oudjida, S. Titri and M. Hamerlain

Matrix product is a compute bound problem that can be efficiently handled by elementary systolic algorithms. From a theoretical point of view, most of the algorithms are very…

HTML

PDF (576 KB)

Downloads

550

Abstract

Matrix product is a compute bound problem that can be efficiently handled by elementary systolic algorithms. From a theoretical point of view, most of the algorithms are very simple and sometimes even trivial. However, the task of designing efficient implementation on a fixed‐connection network, such as on FPGA where resources are very limited, has been more demanding, and sometimes quite tedious. The objective of this paper is twofold: we first describe a full‐systolic algorithm for matrix product that has the merit over its existing counterparts, to require no preloading of input data into elementary processors (EPs) and generates output data only from boundary EPs. The resulting architecture can accept an uninterrupted stream of input data and produces an uninterrupted one with a latency of 2N‐1 for N×N matrix product. This architecture is also scalable and complies with the constraint of problem‐size independence (ψ). Secondly, we present a methodology for generating a family of very compact MP arrays on FPGA based essentially upon manual mapping at CLB level coupled with VHDL structural level.

Details

COMPEL - The international journal for computation and mathematics in electrical and electronic engineering, vol. 21 no. 1

Type: Research Article

DOI:

ISSN: 0332-1649

Keywords

View access options

Article

Publication date: 1 September 2002

N latency 2N I/O‐bandwidth 2D‐array matrix multiplication algorithm

A.K. Oudjida, S. Titr and M. Hamarlain

The emergence of the systolic paradigm in 1978 inspired the first 2D‐array parallelization of the sequential matrix multiplication algorithm. Since then, and due to its attractive…

HTML

PDF (410 KB)

Downloads

348

Abstract

The emergence of the systolic paradigm in 1978 inspired the first 2D‐array parallelization of the sequential matrix multiplication algorithm. Since then, and due to its attractive and appealing features, systolic approach has been gaining great momentum to the point where all 2D‐array parallelization attempts were exclusively systolic. As good result, latency has been successively reduced a number of times (5N, 3N, 2N, 3N/2), where N is the matrix size. But as latency was getting lower, further irregularities were introduced into the array, making the implementation severely compromised either at VLSI level or at system level. The best illustrative case of such irregularities are the two designs proposed by Tsay and Chang in 1995 and considered as the fastest designs (3N/2) that have been developed so far. The purpose of this paper is twofold: we first demonstrate that N+√N/2 is the minimal latency that can be achieved using the systolic approach. Afterwards, we introduce a full‐parallel 2D‐array algorithm with N latency and 2N I/O‐bandwidth. This novel algorithm is not only the fastest algorithm, but is also the most regular one too. A 3D parallel version with O(log N) latency is also presented.

Details

COMPEL - The international journal for computation and mathematics in electrical and electronic engineering, vol. 21 no. 3

Type: Research Article

DOI:

ISSN: 0332-1649

Keywords

Access

Year

All dates (2)
From To Go

Content type

Article (2)

1 – 2 of 2

Per page

10 20 50

Mapping full‐systolic arrays for matrix product on XILINX's XC4000(E,EX) FPGAs

Abstract

Details

Keywords

N latency 2N I/O‐bandwidth 2D‐array matrix multiplication algorithm

Abstract

Details

Keywords

Access

Year

Content type

All feedback is valuable

Report an issue or find answers to frequently asked questions