IIIT Hyderabad Publications |
|||||||||
|
Accelerating LU-Decomposition of Arbitrarily Sized Matrices on FPGAsAuthor: Krishna Kumar Maram 2018122003 Date: 2023-11-06 Report no: IIIT/TH/2023/151 Advisor:Suresh Purini AbstractScientific applications like aircraft-design and machine-learning algorithms involve solving a system of linear equations. LU decomposition is helpful if the system needs to be solved repeatedly for the same Coefficient matrix but for different Constants of a Linear Equation. It also aids in other matrix operations, such as computing the determinant and inverse of a matrix. This thesis proposes an architecture of hardware accelerator for computing the LU decomposition of an input matrix. Our accelerator consists of two simple linear arrays of Processing Engines (PEs), one on each of the two SLR regions of the FPGA. All the computations arising from the block LU decomposition are simplified and scheduled on these two PE arrays. On an Alveo U50 FPGA, our design achieves a peak floating-point performance of 128 GLOPS/s and an average performance of 95 GFLOPS/s. We achieve ≈ 15× speedup on latency compared to an Intel MKL implementation on a 4-core Intel Xeon CPU. Full thesis: pdf Centre for Others |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |