FPGA-Based Implementation of Low Complexity CORDIC-Based Scalable Complex QR Decomposition for MIMO-OFDM Systems

Asghariyehlou, F.; Javidan, J.

doi:10.22068/IJEEE.18.2.2206

Volume 18, Issue 2 (June 2022) IJEEE 2022, 18(2): 46-54 | Back to browse issues page

‎ 10.22068/IJEEE.18.2.2206

‎ 20.1001.1.17352827.2022.18.2.5.9

Mendeley

Zotero

RefWorks

Asghariyehlou F, Javidan J. FPGA-Based Implementation of Low Complexity CORDIC-Based Scalable Complex QR Decomposition for MIMO-OFDM Systems. IJEEE 2022; 18 (2) :46-54
URL: http://ijeee.iust.ac.ir/article-1-2206-en.html

FPGA-Based Implementation of Low Complexity CORDIC-Based Scalable Complex QR Decomposition for MIMO-OFDM Systems

F. Asghariyehlou

, J. Javidan

Abstract: (1634 Views)

This paper deals with the optimization of the CORDIC-based modified Gram-Schmidt (MGS) algorithm for QR decomposition (QRD) and presents a scalable algorithm with maximum throughput, the least possible latency, and hardware resources. The optimized algorithm is implemented on Xilinx Virtex 6 FPGA using ISE software as a fixed point with selected accuracy based on the results of MATLAB simulation. Using the loop unrolling technique with different coefficients, an attempt is made to reduce the latency and increase the throughput. In contrast, increasing the unrolling factor leads to a decrease in the frequency of the CORDIC unit as well as a decrease in the number of resources. As a result, there is a trade-off between the unrolling factor and the frequency of the CORDIC unit. By investigating the different unrolling factors, it is shown that the loop unrolling technique with a factor of 4 has the highest throughput with the value of 5.777 MQRD/s and the lowest latency with the value of 173 ns. Moreover, it is shown that throughput and latency are improved by 42.52% and 73.74% respectively compared to the not optimized case. The proposed method is also scalable for different sizes of m×m complex channel matrices, where log₂ m ∈ N.

Keywords: CORDIC Algorithm , MIMO Detection , QR Decomposition , Unrolling Technique

Full-Text [PDF 1786 kb] (1061 Downloads)

The number of the integer and fractional bits are determined by MATLAB simulation;
High throughput and low latency for CORDIC-based modified Gram-Schmidt algorithm for QR decomposition;
The number iterative of the CORDIC algorithm and unrolling coefficient is optimized for high throughput and low resource and latency.

Type of Study: Research Paper | Subject: VLSI
Received: 2021/06/06 | Revised: 2024/05/13 | Accepted: 2022/02/10

Rights and permissions
	This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

© 2022 by the authors. Licensee IUST, Tehran, Iran. This is an open access journal distributed under the terms and conditions of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license.

Iranian Journal of Electrical and Electronic Engineering

Iran University of Science and Technology

Aims & Scopes

Related Websites