Implementation of efficient carry select adder on FPGA

Similar documents
FPGA IMPEMENTATION OF LOW POWER AND AREA EFFICIENT CARRY SELECT ADDER

FPGA Implementation of Low Power and Area Efficient Carry Select Adder

An optimized implementation of 128 bit carry select adder using binary to excess-one converter for delay reduction and area efficiency

Implementation of Low Power and Area Efficient Carry Select Adder

Design of Carry Select Adder using Binary to Excess-3 Converter in VHDL

Improved 32 bit carry select adder for low area and low power

Implementation and Analysis of Area Efficient Architectures for CSLA by using CLA

Design and Implementation of Low-Power and Area-Efficient for Carry Select Adder (Csla)

ISSN:

128 BIT CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER FOR DELAY REDUCTION AND AREA EFFICIENCY

An Efficient 64-Bit Carry Select Adder With Less Delay And Reduced Area Application

Research Article Low Power 256-bit Modified Carry Select Adder

Implementation of High Speed Adder using DLATCH

128 BIT MODIFIED CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER

An Efficient Carry Select Adder

Design and Implementation of High Speed 256-Bit Modified Square Root Carry Select Adder

DESIGN OF HIGH PERFORMANCE, AREA EFFICIENT FIR FILTER USING CARRY SELECT ADDER

Design And Implimentation Of Modified Sqrt Carry Select Adder On FPGA

Pak. J. Biotechnol. Vol. 14 (Special Issue II) Pp (2017) Parjoona V. and P. Manimegalai

The main design objective in adder design are area, speed and power. Carry Select Adder (CSLA) is one of the fastest

Modified128 bit CSLA For Effective Area and Speed

Research Article Design and Implementation of High Speed and Low Power Modified Square Root Carry Select Adder (MSQRTCSLA)

Efficient Implementation of Multi Stage SQRT Carry Select Adder

DESIGN OF LOW POWER AND HIGH SPEED BEC 2248 EFFICIENT NOVEL CARRY SELECT ADDER

An MFA Binary Counter for Low Power Application

Design of Modified Carry Select Adder for Addition of More Than Two Numbers

A Review on Hybrid Adders in VHDL Payal V. Mawale #1, Swapnil Jain *2, Pravin W. Jaronde #3

Research Article VLSI Architecture Using a Modified SQRT Carry Select Adder in Image Compression

An Efficient High Speed Wallace Tree Multiplier

Design and analysis of RCA in Subthreshold Logic Circuits Using AFE

Hardware Modeling of Binary Coded Decimal Adder in Field Programmable Gate Array

Combinational Logic Design

Design and Analysis of Modified Fast Compressors for MAC Unit

An Efficient Reduction of Area in Multistandard Transform Core

Improve Performance of Low-Power Clock Branch Sharing Double-Edge Triggered Flip-Flop

LUT Optimization for Memory Based Computation using Modified OMS Technique

Clock Gating Aware Low Power ALU Design and Implementation on FPGA

University College of Engineering, JNTUK, Kakinada, India Member of Technical Staff, Seerakademi, Hyderabad

Efficient Architecture for Flexible Prescaler Using Multimodulo Prescaler

ALONG with the progressive device scaling, semiconductor

LUT OPTIMIZATION USING COMBINED APC-OMS TECHNIQUE

VLSI Technology used in Auto-Scan Delay Testing Design For Bench Mark Circuits

Design of Memory Based Implementation Using LUT Multiplier

A Novel Architecture of LUT Design Optimization for DSP Applications

Adaptive Fir Filter with Optimised Area and Power using Modified Inner-Product Block

Reconfigurable FPGA Implementation of FIR Filter using Modified DA Method

PERFORMANCE ANALYSIS OF AN EFFICIENT PULSE-TRIGGERED FLIP FLOPS FOR ULTRA LOW POWER APPLICATIONS

LUT Design Using OMS Technique for Memory Based Realization of FIR Filter

COMPUTATIONAL REDUCTION LOGIC FOR ADDERS

Midterm Exam 15 points total. March 28, 2011

Chapter 8 Functions of Combinational Logic

VLSI IEEE Projects Titles LeMeniz Infotech

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532

A Standard Cell Based Synchronous Dual-Bit Adder with Embedded Carry Look-Ahead

Low Power Area Efficient Parallel Counter Architecture

An FPGA Implementation of Shift Register Using Pulsed Latches

Available online at ScienceDirect. Procedia Computer Science 46 (2015 ) Aida S Tharakan a *, Binu K Mathew b

EEE130 Digital Electronics I Lecture #1_2. Dr. Shahrel A. Suandi

R13 SET - 1 '' ''' '' ' '''' Code No: RT21053

Peak Dynamic Power Estimation of FPGA-mapped Digital Designs

Implementation of Memory Based Multiplication Using Micro wind Software

Low Power Approach of Clock Gating in Synchronous System like FIFO: A Novel Clock Gating Approach and Comparative Analysis

FPGA Hardware Resource Specific Optimal Design for FIR Filters

A New Family of High-Performance Parallel Decimal Multipliers*

A NOVEL APPROACH TO ACHIEVE HIGH SPEED LOW-POWER HYBRID FLIP-FLOP

Objectives. Combinational logics Sequential logics Finite state machine Arithmetic circuits Datapath

R13. II B. Tech I Semester Regular Examinations, Jan DIGITAL LOGIC DESIGN (Com. to CSE, IT) PART-A

OMS Based LUT Optimization

DIGITAL CIRCUIT LOGIC UNIT 9: MULTIPLEXERS, DECODERS, AND PROGRAMMABLE LOGIC DEVICES

Performance Evolution of 16 Bit Processor in FPGA using State Encoding Techniques

High Performance Carry Chains for FPGAs

FUNCTIONS OF COMBINATIONAL LOGIC

EFFICIENT DESIGN OF SHIFT REGISTER FOR AREA AND POWER REDUCTION USING PULSED LATCH

Synthesis Techniques for Pseudo-Random Built-In Self-Test Based on the LFSR

Dev Bhoomi Institute Of Technology Department of Electronics and Communication Engineering PRACTICAL INSTRUCTION SHEET

Advanced Devices. Registers Counters Multiplexers Decoders Adders. CSC258 Lecture Slides Steve Engels, 2006 Slide 1 of 20

Comparative Analysis of Pulsed Latch and Flip-Flop based Shift Registers for High-Performance and Low-Power Systems

CHAPTER 4 RESULTS & DISCUSSION

AN EFFICIENT LOW POWER DESIGN FOR ASYNCHRONOUS DATA SAMPLING IN DOUBLE EDGE TRIGGERED FLIP-FLOPS

High Speed 8-bit Counters using State Excitation Logic and their Application in Frequency Divider

International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue8- August 2013

TEST PATTERN GENERATION USING PSEUDORANDOM BIST

Design of Polar List Decoder using 2-Bit SC Decoding Algorithm V Priya 1 M Parimaladevi 2

OF AN ADVANCED LUT METHODOLOGY BASED FIR FILTER DESIGN PROCESS

Further Details Contact: A. Vinay , , #301, 303 & 304,3rdFloor, AVR Buildings, Opp to SV Music College, Balaji

Distributed Arithmetic Unit Design for Fir Filter

12-bit Wallace Tree Multiplier CMPEN 411 Final Report Matthew Poremba 5/1/2009

International Journal of Scientific & Engineering Research, Volume 5, Issue 9, September ISSN

Design of BIST with Low Power Test Pattern Generator

Design and Simulation of Modified Alum Based On Glut

Aging Aware Multiplier with AHL using FPGA

Design and Implementation of Partial Reconfigurable Fir Filter Using Distributed Arithmetic Architecture

International Journal of Engineering Research-Online A Peer Reviewed International Journal

A High-Speed Low-Power Modulo 2 n +1 Multiplier Design Using Carbon-Nanotube Technology

Implementation of Area Efficient Memory-Based FIR Digital Filter Using LUT-Multiplier

nmos transistor Basics of VLSI Design and Test Solution: CMOS pmos transistor CMOS Inverter First-Order DC Analysis CMOS Inverter: Transient Response

Design and Implementation of LUT Optimization DSP Techniques

Low-Power and Area-Efficient Shift Register Using Pulsed Latches

Inside Digital Design Accompany Lab Manual

gate symbols will appear in schematic Dierent of a circuit. Standard gate symbols have been diagram Figures 5-3 and 5-4 show standard shapes introduce

Transcription:

Journal From the SelectedWorks of Kirat Pal Singh Summer May 18, 2016 Implementation of efficient carry select adder on FPGA Balaji Goswami, RajLakshmi Engineering College, Tamil Nadu, India Ms. Priya, RajLakshmi Engineering College, Tamil Nadu, India B. Rajanh, RajLakshmi Engineering College, Tamil Nadu, India This work is licensed under a Creative Commons CC_BY-NC International License. Available at https//works.bepress.com/kiratpalsingh/69/

IMPEMENTATION OF EFFICIENT CARRY SELECT ADDER ON FPGA Balaji Goswami [1], Priya [2],B. Rajanh [3] Associate Professor, [1] Assistant Professor, [3] student [2] Department of Electronics and Communication Engineering, RajLakshmi Engineering College, Tamil Nadu, India {priya4@gmail.com, balagoswami@tce.edu } Abstract In this paper, we have proposed a modified carry select adder which is known as fastest adders that can perform arithmetic operations in Digital signal processors. Modification in the gate level of the Square root Carry Select Adder (SQRT CSLA) structure results in the reduction of area and power of the CSLA structure which offers a simple and efficient function. Depending upon the Regular SQRT CSLA, we have modified the structure of the adders. The proposed design for 128-bit modified CSLA has reduced area and power as compared with the regular SQRT CSLA with only a slight increase in the delay. The ex peri mental result shows that the proposed CSLA structure is better than the regular SQRT CSLA. Keywords - Application-specific integrated circuit (ASIC), Carry Select Adder (CSLA), low power. I. INTRODUCTION Design of area and power efficient high-speed data path logic systems are one of the most substantial areas of research in VLSI system design. In digital adders, the speed of addition is limited by the time required to propagate a carry through the adder. The sum for each bit position in an elementary adder is generated sequentially only after the previous bit position has been summed and a carry propagated into the next position. The CSLA is used in many computational systems to alleviate the problem of carry propagation delay by independently generating multiple carries and then select a carry to generate the sum [1]. However, the CSLA is not area efficient because it uses multiple pairs of Ripple Carry Adders (RCA) to generate partial sum and carry by considering carry input C in = 0 and C in = 1, then the final sum and carry are selected by the multiplexers (mux) The basic idea of this paper is to use Binary to Excess-1 Converter (BEC) instead of RCA with C in =1 in the regular CSLA to achieve lower area and power consumption [2] [4]. The main advantage of this BEC logic comes from the lesser number of logic gates than the -bit Full Adder (FA) structure. The details of the BEC logic are discussed in Section B. This p ap er is structured as follows. The SQRT CSLA has been chosen for comparison with the proposed design as it has a more balanced delay, and requires lower power and area [5], [6]. The delay and area evaluation methodology of the regular and modified SQRT CSLA are presented in Sections II and III, respectively. The ASIC implementation details and results are analyzed in Section IV. Finally, the work is concluded in Section V. A. THE BASIC ADDER BLOCKS The AND, OR, and Inverter (AOI) implementation of an XOR gate is shown in Fig. 1. The gates between the dotted lines are performing the operations in parallel and the numeric representation of each gate indicates the delay contributed by that gate. The delay and area evaluation methodology considers all gates to be made up of AND, OR, and Inverter, each having delay equal to 1 unit and area equal to 1 unit. We then add up the number of gates in the longest path of a logic block that contributes to the maximum delay. The area evaluation is done by counting the total number of AOI gates required for each logic block. Based on this approach, the CSLA adder blocks of 21 mux, Half Adder (HA), and FA are evaluated and listed in Table I.

TABLE II FUNCTION TABLE OF THE 4-b BEC B[30] 0000 0001 1110 1111 X[30] 0001 0010 1111 0000 Fig. 1. XOR gate. Fig. 2.4-b BEC unit TABLE I DELAY AND AREA COUNT OF THE BLOCKS OF CSLA Adder blocks Delay Area XOR 3 5 21 Mux 3 4 Half Adder 3 6 Full Adder 6 13 B. BEC BASIC As stated above the main idea of this work is to use BEC instead of the RCA with Cin= 1 in order to reduce the area and power consumption of the regular CSLA. To replace the n-bit RCA, an n+ 1-bit BEC is required. A structure and the function table of a 4-b BEC are shown in Fig. 2 and Table II, respectively. Fig. 3 illustrates how the basic function of the CSLA is obtained by using the 4-bit BEC together with the mux. One input of the 84 mux gets as it input (B3, B2, B1, and B0) and another input of the mux is the BEC output. This produces the two possible partial results in parallel and the mux is used to select either the BEC output or the direct inputs according to the control signal C in. The importance of the BEC logic stems from the large silicon area reduction when the CSLA with large number of bits are designed. II. REGULAR 128-B SQRT CSLA The structure of the 128-b regular SQRT CSLA is shown in Fig. 4. It has five groups of different size RCA. The delay and area evaluation of each group are shown in Fig. 5, in which the numerals within specify the delay values, e.g., sum2 requires 10 gate delays. The steps leading to the evaluation are as follows. i) The group2 [see Fig. 5(a)] has two sets of 2-b RCA. Based on the consideration of delay values of Table I, the arrival time of selection input C 1 is earlier than s 3 and later than s 2. Thus, sum 3 is summation of s 3 and the corresponding mux and sum2 is summation of C 1 and its corresponding mux. Fig.4. 128-b SQRT CSLA Fig. 3. 4-b BEC with 84 mux.

{c6, sum [64]} {c10, sum [107]} {cout, sum [1511]} = c3 [t=10] + mux. = c6 [t=13] + mux. = c10 [t=16] + mux iii) The one set of 2-b RCA in group2 has two FA for C in = 1 and the other set has 1 FA and 1 HA for C in = 0 based on the area count. iv) Similarly, the estimated maximum delay and area of the other groups in the regular SQRT CSLA are evaluated and listed in Table III. TABLE III DELAY AND AREA COUNT OF REGULAR SQRT CSLA GROUPS Group Delay Area Group 2 11 57 Group 3 13 87 Group 4 16 117 Group 5 19 147 The total number of gate counts in group2 is determined as follows Gate count = 57(FA+HA+MUX) FA = 39(3*13) HA = 6(1*6) MUX = 12(3*4) III. MODIFIED 128-B SQRT CSLA Fig.5. Delay and area evaluation of Regular SQRT CLSA (a)group 2, (b)group3, (c)group4, (d)group 5 ii) Except for group2, the arrival time of mux selection input is always greater than the arrival time of data outputs from the RCA s. Thus, the delay of group3 to group5 is determined, respectively as follows The structure of the proposed 128-b SQRT CSLA using BEC for RCA with C in =1 to optimize the area and power is shown in Fig. 6. We again split the structure into five groups. The delay and area estimation of each group are shown in Fig. 7. The steps leading to the evaluation are given here. i) The group2 [see Fig. 7(a)] has one 2-b RCA which has 1 FA and 1 HA for C in =0. Instead of another 2-b RCA with C in =, a 1b BEC is used which adds one to the output from 2-b RCA. ii) For the remaining group s the arrival time of mux selection input is always greater than the arrival time of data inputs from the BEC s. Thus, the delay of the remaining groups depends on the arrival time of mux selection input and the mux delay.

(iii) The area count of group2 is determined as follows Gate count = 43(FA+HA+MUX+BEC) FA = 39(3*13) HA = 6(1*6) AND = 1 NOT = 1 XOR = 10(2*5) MUX = 12(3*4) Fig.6. Modified 128-b SQRT-CSLA 4) Similarly, the estimated maximum delay and area of the other groups of the modified SQRT CSLA are evaluated and listed in Table IV. TABLE IV DELAY AND AREA COUNT OF MODIFIED SQRT CSLA Group Delay Area Group 2 13 43 Group 3 16 61 Group 4 19 84 Group 5 22 107 Fig.7. Delay and area evaluation of modified SQRT CSLA (a) group 2 (b) group 3 (c) group 4 and (d) group 5. H is a Half Adder

IV. SIMULATION RESULTS The proposed design has been developed using Verilog-HDL and simulation is done by Xilinx ISE 14.1simulator.The comparison of area, delay and power between Regular CSLA and Modified CSLA for 16-bit, 32-bit and 128-bit are shown in Table V The area indicates the reduction in number of gates in the modified SQRT CSLA and the total power is sum of the leakage power, internal power and switching power. Compared to the regular CSLA the number of gates have been reduced in the modified CSLA therefore there is reduction in the area of the modified CSLA Bit TABLE V COMPARISION OF VALUES Regular Modified Area(no. of gates) Power Area(no. of gates) Power 16 348 315 291 268 32 698 553 762 448 128 1976 960 1948 864 Fig.8 Regular SQRT CSLA Fig.9 Modified SQRT CSLA V. CONCLUSION Adders can be constructed for many numerical representations, such as BCD or Excess-1, the most common adders operate on binary numbers. In this paper, we have designed adders using Binary Excess-1 convertors. CSLA can be used for high speed multiplications, they are also used in Advanced Microprocessor design. The reduced number of gates in the modified 128 bit SQRT CSLA offers the great advantage in the reduction of area, delay and also the power. The future scope of this work is to implement 256- bit SQRT CSLA. It would be interesting to test the design of the modified for 256-b. REFERENCES [1] O.J.Bedrij, Carry-select adder, IRE Trans. Electron. Comput.,pp-340-344,1962. [2] B.Ramkumar, H.m.Kittur, and P.M.Kannan, ASIC implementation of modified faster carry save adder, Eur.J.Sci Res.,vol.42,no.1pp.53-58, 2010. [3] T.Y.Ceiang and M.J.Hsino, Carry-select adder using single ripple carry adder. Electron.Lett.,vol.34, [4] Y.Kem and L.-S.Kim, 64-bit carry-select adder with reduced area, Electron. Lett., vol.37, no.10,pp.614-615, May 2001. [5] Kirat Pal Singh, Shivani Parmar, Design of Efficient RISC Processor at 40nm Process Technology, National Conference on Advancements in Solid State Devices & Circuits, vol. 1, April 2013. [6] Kuldeep Rawat, Tarek Darwish and MagdyBayoumi, A low power and reduced area Carry Select Adder, 45 th Midwest Symposium on Circuits and Systems, vol.1.,pp.467-470,march 2002. [7] Youngjoon Kim and Lee-Sup Kim, 64-bit carry-select adder with reduced area, Electronics Letters, vol.37, issue 10,pp.614-615, May 2001. [8] Youngjoon Kim and Lee-Sup Kim, A low power carry select adder with reduced area, IEEE International Symposium on Circuits and Systems, vol.4, pp.218-221, May 2001. [9] Shivani Parmar and kirat pal singh, Design of High speed hybrid carry select adder, IEEE, 2012. [10] Z. Chen and I. Koren, Techniques for yield enhancement of VLSI adders, in Proc. Int. Conf. Appl. Specific Array Process., Strasbourg, France, Jul. 24 26, 1995, pp. 222 229. [11]Milos D. Ercegovac and Thomas Lang, Digital arthimetic, Morgan Kaufmann Elsevier INC, 2004. [12] Shivani Parmar, Kirat Pal Singh, Performance Analysis of Dual gate MOSFET Airthmatic Logic

Unit, International Conference on Recent Trends of Computer Technology in Academia, Vol 1, pp. 96-101, April 2012. [13] W.Jeong and K.Roy, robust highperformance low power adder,proc,of the Asia and South Pacific Design Automatin Conference, pp.503-506,2003. [14] Kirat Pal Singh, Technique for improving design performance of VLSI Circuits and System SSRN 2594227, 2014. [15] D.C Chen, L. M. Guerra,E. H. Ng, M.Potkonjak, D.P. Schultz and J. M. Rabaey, A integrated system for rapid prototyping of high performance algorithm specific data path, in Proc. Application specific Array Processor, pp. 134-148, Aug 1992.