VLSI Implementation of 2048 Point FFT

Zena Vatsa¹, Sumaya²

Department of Electronics and Communication Engineering, Sharda University, Greater Noida, India

Abstract: Orthogonal Frequency Division Multiplexing (OFDM) is a multi carrier modulation technique. OFDM provides high bandwidth efficiency because the carriers are orthogonal to each other and multiple carriers share the data among themselves. This paper focuses on the core processing blocks of an OFDM system, which are the Fast Fourier Transform (FFT) block and the Inverse Fast Fourier Transform (IFFT). The 8 points IFFT / FFT decimation-in-frequency (DIF) with radix-2 algorithm is analyzed to produce a solution that is suitable for FPGA implementation.

Keywords: Orthogonal Frequency Division Multiplexing (OFDM), Fast Fourier Transform (FFT), Inverse Fast Fourier Transform (IFFT), Field Programmable Gate Arrays (FPGA), radix-2, modulation, efficiency.

1. Introduction

A digital communication system involves the transmission of information in digital form from one point to another point. Regardless of the form of communication method, the three basic elements in a communication system consist of transmitter, channel and receiver. The source of information is the messages that are to be transmitted to the other end in the receiver. A transmitter can consist of source encoder, channel encoder and modulation. Source encoder employed an efficient representation of the information such that resources can be conserved. A channel encoder may include error detection and correction code. The aim is to increase the redundancy in the data to improve the reliability of transmission. A modulation process convert the base band signal into band pass signal before transmission. During transmission, the signal experiences impairment which attenuates the signals amplitude and distort signals phase. Also, the signals transmitting through a channel also impaired by noise, which is assumed to be Gaussian distributed component. In the receiver end, the reversed order of the steps in the transmitter is performed. Ideally, the same information must be decoded in the receiving end.

2. FFT Algorithms

FFT is mainly classified in two types, DIT (Decimation in Time) and DIF (Decimation in Frequency). General formula to compute N point FFT is,

\[ X(k) = \sum_{n=0}^{N-1} x(n)W_N^{nk}, \quad k = 0, \ldots, N-1 \]

For decimation in time algorithm \( x(n) \) is decomposed, and for decimation in frequency algorithm \( X(k) \) is decomposed. Both algorithms require same number of operations and require bit reversal at some places during the computation.

3. Radix-2 DIF Algorithm:

Assuming N is power of 2, we separate the index set of eq. (2.1) into two sets. Two proceed with the decomposition, now we separate the value of k into the sets of even values and odd values. Resolving this will led to \( X(2k) \) for k odd and \( X(2k+1) \) for k even.

Number of complex multiplication = \( \frac{N}{2}\log_2 N \)
Number of complex addition = \( N\log_2 N \)
4. Architecture

Most of the application in communication requires operation in real time. This high demand of real time systems made pipelined architecture a subject of research. DIT and DIF are two different ways one can perform FFT. DIF takes input in normal order and gives output in bit reverse order, so from now onwards we will stick to the DIF algorithm. Radix-2 algorithm is preferred for the VLSI implementation.

Architecture for VLSI Implementation

FFT/IFFT is very much symmetrical architecture. Two point butterfly unit is smallest unit of FFT/IFFT. Modules required implementing signal flow graph for radix-2 DIF algorithm are adder, subtractor, complex multiplier. A simplest architecture to implement FFT/IFFT with required modules and storage elements can be thought. Using this component for repetitive structure will acquire more area for high speed operation. For VLSI implementation we are always fighting with area, timing and power constraint. When it is required to get high speed processor in small area, only one idea strikes in everyone’s mind is parallel or pipelined architecture.

Pipeline will work with full efficiency only if it is filled and emptied at same rate. This can be thought as an assembly in a line industry, where output of one unit is supplied to the other and finally generated product is packed in particular way and ready to dispatch. Important thing to remember is when product is being packed other units are also in functioning stage and producing the output. Pipeline may be synchronous or asynchronous. A synchronous pipeline has a master clock and each stage must complete its work within one cycle. We will use synchronous pipeline structure for implementation. The minimum clock period is thus determined by the slowest stage. Asynchronous pipeline requires handshaking between stages, so that output of one stage is stored in memory till the previous outputs are not used by the next stage. Literature survey is done for different available pipelined architecture for the VLSI implementation of FFT/IFFT.

5. Implementation and Results

The FPGA implementation is performed using Very High Speed Integrated Circuit (VHSIC) Hardware Descriptive Language (VHDL). This performance of the coding is analyzed from the result of timing simulation using Altera Max Plus II.

Figure 2: Two point FFT calculation (Butterfly Structure)

Figure 3: Pipelined Architecture

Figure 4: RTL schematic of 2 point FFT
6. Conclusion

The Fast Fourier Transform (FFT) and Inverse Fast Fourier Transform (FFT) have been chosen to implement the design instead of the Discrete Fourier Transform and Inverse Discrete Fourier Transform because they offer better speed with less computational time. These methods require the odd and even samples inputs are process separately before they are combine to give the final output. The result of the computation is in integer bits which might comprises of real and imaginary components. The decimal value of the output if greater than 0.5 is approximated to 1 and vice versa.

References