An optimized implementation of 128 bit carry select adder using binary to excess-one converter for delay reduction and area efficiency

Similar documents
128 BIT CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER FOR DELAY REDUCTION AND AREA EFFICIENCY

Design of Carry Select Adder using Binary to Excess-3 Converter in VHDL

Improved 32 bit carry select adder for low area and low power

Implementation and Analysis of Area Efficient Architectures for CSLA by using CLA

128 BIT MODIFIED CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER

An Efficient 64-Bit Carry Select Adder With Less Delay And Reduced Area Application

The main design objective in adder design are area, speed and power. Carry Select Adder (CSLA) is one of the fastest

ISSN:

Implementation of High Speed Adder using DLATCH

Implementation of Low Power and Area Efficient Carry Select Adder

DESIGN OF HIGH PERFORMANCE, AREA EFFICIENT FIR FILTER USING CARRY SELECT ADDER

Implementation of efficient carry select adder on FPGA

FPGA Implementation of Low Power and Area Efficient Carry Select Adder

FPGA IMPEMENTATION OF LOW POWER AND AREA EFFICIENT CARRY SELECT ADDER

Modified128 bit CSLA For Effective Area and Speed

Design And Implimentation Of Modified Sqrt Carry Select Adder On FPGA

Design and Implementation of Low-Power and Area-Efficient for Carry Select Adder (Csla)

Research Article Low Power 256-bit Modified Carry Select Adder

An Efficient Carry Select Adder

Efficient Implementation of Multi Stage SQRT Carry Select Adder

Design and Implementation of High Speed 256-Bit Modified Square Root Carry Select Adder

DESIGN OF LOW POWER AND HIGH SPEED BEC 2248 EFFICIENT NOVEL CARRY SELECT ADDER

Pak. J. Biotechnol. Vol. 14 (Special Issue II) Pp (2017) Parjoona V. and P. Manimegalai

Research Article Design and Implementation of High Speed and Low Power Modified Square Root Carry Select Adder (MSQRTCSLA)

Design of Modified Carry Select Adder for Addition of More Than Two Numbers

A Review on Hybrid Adders in VHDL Payal V. Mawale #1, Swapnil Jain *2, Pravin W. Jaronde #3

An Efficient High Speed Wallace Tree Multiplier

Research Article VLSI Architecture Using a Modified SQRT Carry Select Adder in Image Compression

An MFA Binary Counter for Low Power Application

Design and Analysis of Modified Fast Compressors for MAC Unit

Objectives. Combinational logics Sequential logics Finite state machine Arithmetic circuits Datapath

LUT Optimization for Memory Based Computation using Modified OMS Technique

Bit Swapping LFSR and its Application to Fault Detection and Diagnosis Using FPGA

LUT OPTIMIZATION USING COMBINED APC-OMS TECHNIQUE

High Performance Carry Chains for FPGAs

Optimization of memory based multiplication for LUT

COMPUTATIONAL REDUCTION LOGIC FOR ADDERS

International Journal of Engineering Research-Online A Peer Reviewed International Journal

12-bit Wallace Tree Multiplier CMPEN 411 Final Report Matthew Poremba 5/1/2009

Performance Evolution of 16 Bit Processor in FPGA using State Encoding Techniques

ALONG with the progressive device scaling, semiconductor

Design of Memory Based Implementation Using LUT Multiplier

A Parallel Area Delay Efficient Interpolation Filter Architecture

LUT Design Using OMS Technique for Memory Based Realization of FIR Filter

Distributed Arithmetic Unit Design for Fir Filter

THE USE OF forward error correction (FEC) in optical networks

A Novel Architecture of LUT Design Optimization for DSP Applications

VLSI Technology used in Auto-Scan Delay Testing Design For Bench Mark Circuits

Low Power Area Efficient Parallel Counter Architecture

Hardware Implementation of Viterbi Decoder for Wireless Applications

ANALYSIS OF POWER REDUCTION IN 2 TO 4 LINE DECODER DESIGN USING GATE DIFFUSION INPUT TECHNIQUE

EEC 116 Fall 2011 Lab #5: Pipelined 32b Adder

International Journal of Scientific & Engineering Research, Volume 5, Issue 9, September ISSN

EFFICIENT DESIGN OF SHIFT REGISTER FOR AREA AND POWER REDUCTION USING PULSED LATCH

Implementation of Memory Based Multiplication Using Micro wind Software

IN DIGITAL transmission systems, there are always scramblers

PERFORMANCE ANALYSIS OF AN EFFICIENT PULSE-TRIGGERED FLIP FLOPS FOR ULTRA LOW POWER APPLICATIONS

Half-Adders. Ch.5 Summary. Chapter 5. Thomas L. Floyd

WINTER 15 EXAMINATION Model Answer

International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue8- August 2013

Design and Implementation of Partial Reconfigurable Fir Filter Using Distributed Arithmetic Architecture

A NOVEL DESIGN OF COUNTER USING TSPC D FLIP-FLOP FOR HIGH PERFORMANCE AND LOW POWER VLSI DESIGN APPLICATIONS USING 45NM CMOS TECHNOLOGY

TEST PATTERN GENERATION USING PSEUDORANDOM BIST

An Improved Recursive and Non-recursive Comb Filter for DSP Applications

[Krishna*, 4.(12): December, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785

An Enhancement of Decimation Process using Fast Cascaded Integrator Comb (CIC) Filter

DESIGN AND ANALYSIS OF COMBINATIONAL CODING CIRCUITS USING ADIABATIC LOGIC

Design and FPGA Implementation of 100Gbit/s Scrambler Architectures for OTN Protocol Chethan Kumar M 1, Praveen Kumar Y G 2, Dr. M. Z. Kurian 3.

Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register

OF AN ADVANCED LUT METHODOLOGY BASED FIR FILTER DESIGN PROCESS

CHAPTER 4 RESULTS & DISCUSSION

LFSR Counter Implementation in CMOS VLSI

FPGA Hardware Resource Specific Optimal Design for FIR Filters

Design and analysis of RCA in Subthreshold Logic Circuits Using AFE

8. Design of Adders. Jacob Abraham. Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017

University College of Engineering, JNTUK, Kakinada, India Member of Technical Staff, Seerakademi, Hyderabad

R13 SET - 1 '' ''' '' ' '''' Code No: RT21053

SA4NCCP 4-BIT FULL SERIAL ADDER

Aging Aware Multiplier with AHL using FPGA

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015

CHAPTER 6 DESIGN OF HIGH SPEED COUNTER USING PIPELINING

Hardware Modeling of Binary Coded Decimal Adder in Field Programmable Gate Array

AN OPTIMIZED IMPLEMENTATION OF MULTI- BIT FLIP-FLOP USING VERILOG

Chapter 8 Functions of Combinational Logic

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532

Design of BIST with Low Power Test Pattern Generator

REDUCING DYNAMIC POWER BY PULSED LATCH AND MULTIPLE PULSE GENERATOR IN CLOCKTREE

CHAPTER 6 ASYNCHRONOUS QUASI DELAY INSENSITIVE TEMPLATES (QDI) BASED VITERBI DECODER

COE328 Course Outline. Fall 2007

MODEL QUESTIONS WITH ANSWERS THIRD SEMESTER B.TECH DEGREE EXAMINATION DECEMBER CS 203: Switching Theory and Logic Design. Time: 3 Hrs Marks: 100

Efficient Architecture for Flexible Prescaler Using Multimodulo Prescaler

R13. II B. Tech I Semester Regular Examinations, Jan DIGITAL LOGIC DESIGN (Com. to CSE, IT) PART-A

Research Article Ring Counter Based ATPG for Low Transition Test Pattern Generation

Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module

A High- Speed LFSR Design by the Application of Sample Period Reduction Technique for BCH Encoder

A Low Power Delay Buffer Using Gated Driver Tree

Low Power Approach of Clock Gating in Synchronous System like FIFO: A Novel Clock Gating Approach and Comparative Analysis

FPGA Implementation of DA Algritm for Fir Filter

A Symmetric Differential Clock Generator for Bit-Serial Hardware

An Efficient Reduction of Area in Multistandard Transform Core

Transcription:

Journal From the SelectedWorks of Journal December, 2014 An optimized implementation of 128 bit carry select adder using binary to excess-one converter for delay reduction and area efficiency P. Manga Rao K. Ashok Kumar This work is licensed under a Creative Commons CC_BY-NC International License. Available at: https://works.bepress.com/article/44/

AN OPTIMIZED IMPLEMENTATION OF 128 BIT CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER FOR DELAY REDUCTION AND AREA EFFICIENCY 1 P.Manga Rao, PG Scholar in ECE, 2 K. Ashok Kumar, Associate Professor, ECE,Head of the Department, 1 mangarao0436@gmail.com 2 akkonduru@gmail.com, 1, 2 Annamacharya Institute of Technology and Science, Hyderabad, Telangana. Abstract: In many data-processing processors Carry Select Adder (CSLA) is one a fastest adders used to perform arithmetic functions. The upcoming technologies depicts that there is a scope for reducing the area and power consumption in the CSLA. This work uses a simple gate level modification to significantly reduce the area and power of the CSLA. Based on this modification CSLA architecture have been developed and can be compared with the regular CSLA architecture. The proposed design has reduced area and power as compared with the regular CSLA with only a slight increase in the delay. This work evaluates the performance of the proposed designs in terms of delay, area, power, and their products by hand with logical effort and through custom design and layout in 0.18-m CMOS process technology. The results analysis shows that the proposed CSLA structure is better than the regular CSLA. Keywords- Delay; Area; Array Multiplier, low power, VHDL Modeling & Simulation. I.Introduction Area and power reduction in data path logic systems are the main area of research in VLSI system design. High-speed addition and multiplication has always been a fundamental requirement of high performance processors and systems. In digital adders, the speed of addition is limited by the time required to propagate a carry through the adder. The sum for each bit position in an elementary adder is generated sequentially only after the previous bit position has been summed and a carry propagated into the next position. The major speed limitation in any adder is in the production of carries and many authors have considered the addition problem. The CSLA is used in many computational systems to moderate the problem of carry propagation delay by independently generating multiple carries and then select a carry to generate the sum. However, the CSLA is not area efficient because it uses multiple pairs of Ripple Carry Adders (RCA) to generate partial sum and carry by considering carry input and then the final sum and carry are selected by the multiplexers (mux). To overcome above problem, the basic idea of the proposed work is by using n- bit binary to excess-1 code converters (BEC) to improve the speed of addition. This logic can be implemented with any type of adder to further improve the speed. Using Binary to Excess -1 Converter (BEC) instead of RCA in the regular CSLA to achieve lower area and power consumption. The main advantage of this BEC logic comes from the lesser number of logic gates than the Full Adder (FA) structure. II. Basic Adder Block On the basis of requirements such as area, delay and power consumption some of the complex adders are Ripple Carry Adder, Carry look-ahead Adder and Carry Select Adder. Ripple Carry Adder (RCA) shows the compact design but their computation time is longer. Time critical applications make use of Carry Look-Ahead Adder (CLA) to derive fast results but it leads to increase in area. But the carry select adder provides a compromise between the small areas but longer delay of RCA and large area with small delay of Carry Look Ahead adder. Ripple Carry Adder consists of cascaded N single

bit full adders. Output carry of previous adder becomes the input carry of next full adder. Therefore, the carry of this adder traverses longest path called worst case delay path through N stages. Fig. 1 shows the block diagram of ripple carry adder. Now as the value of N increases, delay of adder will also increase in a linear way. Therefore, RCA has the lowest speed amongst all the adders because of large propagation delay but it occupies the least area. Now CSLA provides a way to get around this linear dependency is to anticipate all possible values of input carry i.e. 0 and 1 and evaluate the result in advance. Once the original value of carry is known, result can be selected using the multiplexer stage. Fig. 2. 4-b BEC Therefore the conventional CSLA makes use of Dual RCA s to generate the partial sum and carry by considering input carry Cin=0 and Cin=1, then the final sum and carry are selected by multiplexers. Fig. 3. 4-b BEC with 8:4 mux. TABLE I TRUTH TABLE OF 4-BIT BINARY TO EXCESS-1 CONVERTER Fig. 1 4-bit Ripple Carry Adder III. BEC The basic idea of this work is to use Binary to Excess- 1 converter (BEC) instead of RCA with Cin=1 in conventional CSLA in order to reduce the area and power.bec uses less number of logic gates than N-bit full adder structure. To replace N-bit RCA, an N+1 bit BEC is required. Therefore, Modified CSLA has low power and less area than conventional CSLA. SQRT CSLA has been chosen for comparison with modified design using BEC as it has more balanced delay, less area and low power. Regular SQRT CSLA also uses dual RCAs. In order to reduce the delay, area and power, the design is modified by using BEC instead of RCA with Cin=1. Therefore, the modified SQRT CSLA occupies less area, delay and low power. Further also, the parameters like delay, area and power can be reduced. bits are designed. The Boolean expressions of the 8-bit

Fig. 4 shows the 16-bit Conventional CSLA. The conventional CSLA is area consuming due to the use of dual RCA s. BEC is listed improved up to 40% to 90%, by performing the additions in parallel, and reducing the maximum carry delay. The structure of the 128-b regular SQRT CSLA is shown in Fig. 6. It has five groups of different size RCA. The delay and area evaluation of each group are shown in Fig. 3, in which the numerals with in specify the delay values, e.g., sum2 requires 10 gate delays. Fig. 4 16-bit conventional carry select adder Modified SQRT CSLA is similar to that of regular SQRT CSLA, the only difference is we replace RCA with Cin=1 with BEC. This replaced BEC performs the same operation as that of the replaced RCA with Cin=1. Fig. 5 shows the block diagram of modified SQRT CSLA. This structure consumes less area; delay and power than regular SQRT CSLA because of less number of transistors are used. Fig.6 Architecture of Regular 128-b CSLA V. Modified Architecture of 128-bit CSLA This architecture is similar to regular 64-bit SQRT CSLA, the only change is that, we replace RCA with Cin=1 among the two available RCAs in a group with a BEC. This BEC has a feature that it can perform the similar operation as that of the replaced RCA with Cin=1. Fig 7 shows the Modified block diagram of 64-bit SQRT CSLA. Fig. 5 Modified 16-bit SQRT CSLA IV. Architecture of 128-bit CSLA A 16-bit carry select adder can be developed in two different sizes namely uniform block size and variable block size. Similarly a 32, 64 and 128-bit can also be developed in two modes of different block sizes. Ripple-carry adders are the simplest and most compact full adders, but their performance is limited by a carry that must propagate from the least significant bit to the most-significant bit. The various 16, 32, 64 and 128-bit CSLA can also be developed by using ripple carry adders. The importance of the BEC logic stems from the large silicon area reduction when the CSLA with large number of The number of bits required for BEC logic is 1 bit more than the RCA bits. The modified block diagram is also divided into various groups of variable sizes of bits with each group having the ripple carry adders, BEC and corresponding mux. As shown in the Fig.4, Group 0 contain one RCA only which is having input of lower significant bit and carry in bit and produces result of sum[1:0] and carry out which is acting as mux selection line for the next group, similarly the procedure continues for higher groups but they includes BEC logic instead of RCA with Cin=1. different sized adders. The similar design flow is followed

for both the regular and modified CSLA of different sizes. VII. CONCLUSION Fig.7 Modified128 bit CSLA VI Comparison of Regular and Modified128-bit CSLA Percentage of delay overhead exhibits a similarly decreasing trend with bit size. The delay overhead for the 8, 16, and 32-b is 14%, 9.8%, and 5.63% respectively, whereas for the 64-b it reduces to only 4.75%. The power delay product of the proposed 8-b is higher than that of the regular CSLA by 5.2% and the area -delay product is lower by 2.9%. However, the power-delay product of the proposed 16-b CSLA reduces by 1.76% and for the 32-b and 64-b by as much as 8.18%, and 12.28% respectively. Similarly the area-delay product of the proposed design for 16-, 32-, 64-b and 128-b is also reduced by 6.7%, 11%, and 14.4% respectively. Table 2: Comparison values Power, delay and area are the constituent factors in VLSI design that limits the performance of any circuit. This work presents a simple approach to reduce the area, delay and power of CSLA architecture. The conventional carry select adder has the disadvantage of more power consumption and occupying more chip area. The proposed SQRT CSLA using common Boolean logic has low power, less delay and reduced area than all the other adder structures. It is also little bit faster than all the other adders. In this way, the transistor count of proposed SQRT CSLA is reduced having less area and low power which makes it simple and efficient for VLSI hardware implementations. REFERENCES [1] O. J. Bedrij, Carry-select adder, IRE Trans. Electron. Comput. 1962. [2] B. Ramkumar, H.M. Kittur, and P. M. Kannan, ASIC implementation of modified faster carry save adder, Eur. J. Sci. Res., vol. 41, no. 1, 2009. [3] T. Y. Ceiang and M. J. Hsiao, Carry-select adder using VI.RESULT The implemented design in this work has been simulated using Verilog-HDL (Modelsim). The adders (of various size 16, 32, 64 and 128) are designed and simulated using Modelsim. After simulation the different size codes are synthesized using Xilinx ISE 10.1. The simulated files are imported into the synthesized tool and corresponding values of delay and area are noted. The synthesized reports contain area and delay values for single ripple carry adder, Electron. Lett. vol. 34, no. 22, pp. 2101 2103, Oct. 1998. [4] Y. Kim and L.-S. Kim, 64-bit carry-select adder with reduced area, Electron. Lett. vol. 36, no. 9, May 2000. [5] J. M. Rabaey, Digtal Integrated Circuits A Design

Perspective. Upper Saddle River, NJ: Prentice-Hall, 2001. [6] Parmar, Shivani, and Kirat Pal Singh. "Design of high speed hybrid carry select adder." In Advance Computing Conference (IACC), 2013 IEEE 3rd International, pp. 1656-1663. IEEE, 2013. [7] I-Chyn Wey, Cheng-Chen Ho, Yi-Sheng Lin, and Chien- Chang Peng, An Area-Efficient Carry Select Adder Design by Sharing the Common Boolean Logic Term, Proceedings of the International MultiConference of Engineers and Computer Scientist 2012 Vol II,IMCES 2012,HongKong,March 14-16 2012. [8] Ms. S.Manjui, Mr. V. Sornagopae, An Efficient SQRT Architecture of Carry Select Adder Design by Common Boolean Logic,IEEE, 2013. [9] Youngjoon Kim and Lee-Sup Kim, 64-bit carry- select adder with reduced area, Electronics Letters, vol.37, issue 10, pp.614-615, May 2001. [10] Yajuan He, Chip-Hong Chang and Jiangmin Gu, An area efficient 64-bit square root Carry-Select Adder for low power applications, IEEE International Symposium on Circuits and Systems,vol.4, pp.4082-4085, May 2005. 118