Efficient Implementation of Multi Stage SQRT Carry Select Adder

Similar documents
Implementation of High Speed Adder using DLATCH

Implementation of Low Power and Area Efficient Carry Select Adder

128 BIT CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER FOR DELAY REDUCTION AND AREA EFFICIENCY

An optimized implementation of 128 bit carry select adder using binary to excess-one converter for delay reduction and area efficiency

Design And Implimentation Of Modified Sqrt Carry Select Adder On FPGA

Implementation and Analysis of Area Efficient Architectures for CSLA by using CLA

Design and Implementation of High Speed 256-Bit Modified Square Root Carry Select Adder

Research Article Low Power 256-bit Modified Carry Select Adder

ISSN:

An Efficient 64-Bit Carry Select Adder With Less Delay And Reduced Area Application

Design of Carry Select Adder using Binary to Excess-3 Converter in VHDL

DESIGN OF HIGH PERFORMANCE, AREA EFFICIENT FIR FILTER USING CARRY SELECT ADDER

Research Article Design and Implementation of High Speed and Low Power Modified Square Root Carry Select Adder (MSQRTCSLA)

The main design objective in adder design are area, speed and power. Carry Select Adder (CSLA) is one of the fastest

Modified128 bit CSLA For Effective Area and Speed

An Efficient Carry Select Adder

Design and Implementation of Low-Power and Area-Efficient for Carry Select Adder (Csla)

128 BIT MODIFIED CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER

Improved 32 bit carry select adder for low area and low power

Pak. J. Biotechnol. Vol. 14 (Special Issue II) Pp (2017) Parjoona V. and P. Manimegalai

FPGA IMPEMENTATION OF LOW POWER AND AREA EFFICIENT CARRY SELECT ADDER

FPGA Implementation of Low Power and Area Efficient Carry Select Adder

Implementation of efficient carry select adder on FPGA

DESIGN OF LOW POWER AND HIGH SPEED BEC 2248 EFFICIENT NOVEL CARRY SELECT ADDER

Design of Modified Carry Select Adder for Addition of More Than Two Numbers

Efficient Architecture for Flexible Prescaler Using Multimodulo Prescaler

An MFA Binary Counter for Low Power Application

Welch Gong (Wg) 128 Bit Stream Cipher For Encryption and Decryption Algorithm

A Review on Hybrid Adders in VHDL Payal V. Mawale #1, Swapnil Jain *2, Pravin W. Jaronde #3

LUT Optimization for Memory Based Computation using Modified OMS Technique

Design and Analysis of Modified Fast Compressors for MAC Unit

Design of Memory Based Implementation Using LUT Multiplier

LUT OPTIMIZATION USING COMBINED APC-OMS TECHNIQUE

Low Power Area Efficient Parallel Counter Architecture

Implementation of Memory Based Multiplication Using Micro wind Software

Improve Performance of Low-Power Clock Branch Sharing Double-Edge Triggered Flip-Flop

ALONG with the progressive device scaling, semiconductor

Arithmetic Unit Based Reconfigurable Approximation Technique for Video Encoding

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532

Research Article VLSI Architecture Using a Modified SQRT Carry Select Adder in Image Compression

Implementation of BIST Test Generation Scheme based on Single and Programmable Twisted Ring Counters

An Efficient Reduction of Area in Multistandard Transform Core

An Efficient High Speed Wallace Tree Multiplier

Efficient Test Pattern Generation Scheme with modified seed circuit.

Memory efficient Distributed architecture LUT Design using Unified Architecture

LUT Design Using OMS Technique for Memory Based Realization of FIR Filter

A Novel Architecture of LUT Design Optimization for DSP Applications

Objectives. Combinational logics Sequential logics Finite state machine Arithmetic circuits Datapath

OMS Based LUT Optimization

HIGH PERFORMANCE AND LOW POWER ASYNCHRONOUS DATA SAMPLING WITH POWER GATED DOUBLE EDGE TRIGGERED FLIP-FLOP

High Speed 8-bit Counters using State Excitation Logic and their Application in Frequency Divider

Optimization of memory based multiplication for LUT

A High- Speed LFSR Design by the Application of Sample Period Reduction Technique for BCH Encoder

Design and Implementation of Partial Reconfigurable Fir Filter Using Distributed Arithmetic Architecture

An FPGA Implementation of Shift Register Using Pulsed Latches

Distributed Arithmetic Unit Design for Fir Filter

A Parallel Area Delay Efficient Interpolation Filter Architecture

High Performance Carry Chains for FPGAs

COMPUTATIONAL REDUCTION LOGIC FOR ADDERS

Performance Evolution of 16 Bit Processor in FPGA using State Encoding Techniques

Reconfigurable FPGA Implementation of FIR Filter using Modified DA Method

TEST PATTERN GENERATION USING PSEUDORANDOM BIST

International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue8- August 2013

Low Power High Speed Voltage Level Shifter for Sub- Threshold Operations

Performance Analysis and Behaviour of Cascaded Integrator Comb Filters

An Lut Adaptive Filter Using DA

REDUCING DYNAMIC POWER BY PULSED LATCH AND MULTIPLE PULSE GENERATOR IN CLOCKTREE

SIC Vector Generation Using Test per Clock and Test per Scan

Advanced Devices. Registers Counters Multiplexers Decoders Adders. CSC258 Lecture Slides Steve Engels, 2006 Slide 1 of 20

Combinational Logic Design

Design and Implementation of LUT Optimization DSP Techniques

Adaptive Fir Filter with Optimised Area and Power using Modified Inner-Product Block

AN EFFICIENT LOW POWER DESIGN FOR ASYNCHRONOUS DATA SAMPLING IN DOUBLE EDGE TRIGGERED FLIP-FLOPS

Design of a High Frequency Dual Modulus Prescaler using Efficient TSPC Flip Flop using 180nm Technology

A Power Efficient Flip Flop by using 90nm Technology

Figure.1 Clock signal II. SYSTEM ANALYSIS

Power Optimization by Using Multi-Bit Flip-Flops

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS)

PERFORMANCE ANALYSIS OF AN EFFICIENT PULSE-TRIGGERED FLIP FLOPS FOR ULTRA LOW POWER APPLICATIONS

Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register

Novel Correction and Detection for Memory Applications 1 B.Pujita, 2 SK.Sahir

Design and Simulation of Modified Alum Based On Glut

Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module

Design of an Efficient Low Power Multi Modulus Prescaler

2e 23-1 Peta Bits Per Second (Pbps) PRBS HDL Design for Ultra High Speed Applications/Products

EFFICIENT DESIGN OF SHIFT REGISTER FOR AREA AND POWER REDUCTION USING PULSED LATCH

A Low Power Delay Buffer Using Gated Driver Tree

Design and analysis of RCA in Subthreshold Logic Circuits Using AFE

MODEL QUESTIONS WITH ANSWERS THIRD SEMESTER B.TECH DEGREE EXAMINATION DECEMBER CS 203: Switching Theory and Logic Design. Time: 3 Hrs Marks: 100

VLSI Technology used in Auto-Scan Delay Testing Design For Bench Mark Circuits

Keywords Xilinx ISE, LUT, FIR System, SDR, Spectrum- Sensing, FPGA, Memory- optimization, A-OMS LUT.

R13 SET - 1 '' ''' '' ' '''' Code No: RT21053

Figure 1.LFSR Architecture ( ) Table 1. Shows the operation for x 3 +x+1 polynomial.

DESIGN AND IMPLEMENTATION OF SYNCHRONOUS 4-BIT UP COUNTER USING 180NM CMOS PROCESS TECHNOLOGY

Design of Polar List Decoder using 2-Bit SC Decoding Algorithm V Priya 1 M Parimaladevi 2

International Journal of Engineering Research-Online A Peer Reviewed International Journal

Efficient Method for Look-Up-Table Design in Memory Based Fir Filters

Midterm Exam 15 points total. March 28, 2011

TIME SCHEDULE. MODULE TOPICS PERIODS 1 Number system & Boolean algebra 17 Test I 1 2 Logic families &Combinational logic

Available online at ScienceDirect. Procedia Computer Science 46 (2015 ) Aida S Tharakan a *, Binu K Mathew b

Transcription:

International Journal of Research Studies in Science, Engineering and Technology Volume 2, Issue 8, August 2015, PP 31-36 ISSN 2349-4751 (Print) & ISSN 2349-476X (Online) Efficient Implementation of Multi Stage SQRT Carry Select Adder Kadudula Raju 1, N. Malathi 2, Dr. D Subba Rao 3 1 Department of ECE, Siddhartha Institute of Engineering and Technology, Hyderabad, India (PG Scholar) 2 Department of ECE, Siddhartha Institute of Engineering and Technology, Hyderabad, India (Associate Professor) 3 Department of ECE, Siddhartha Institute of Engineering and Technology, Hyderabad, India (Head of the Department for Electronics and Communication Engineering) Abstract: To study the data dependence and to identify redundant logic operations, we are analysing logic operations involved in conventional carry select adder (CSLA) and binary to excess-1 converter (BEC)-based CSLA. A new logic formulation for CSLA has been proposed by eliminating all the redundant logic operations present in conventional CSLA. In the proposed scheme calculation of final-sum is scheduled after carry select (CS) operation we have eliminated all the redundant logic operations present in the conventional CSLA and proposed a new logic formulation for CSLA. Bit patterns of two anticipating carry words (corresponding to cin =0and1) and fixed in bits are used for logic optimization of CS and generation units, by using optimized logic units an efficient CSLA design is obtained The proposed CSLA design involves significantly less area and delay than the recently proposed BEC-based CSLA. The proposed CSLA design is best for square-root (SQRT) CSLA because of the small carry-output delay. Keywords: conventional carry select adder, binary to excess-1 converter (BEC), proposed CSLA. 1. INTRODUCTION LOW-POWER, area-efficient, and high-performance VLSI systems are increasingly used in portable and mobile devices, multi standard wireless receivers, and biomedical instrumentation [1], [2]. An adder is the main component of an arithmetic unit. A complex digital signal processing (DSP) system involves several adders. An efficient adder design essentially improves the performance of a complex DSP system. A ripple carry adder (RCA) uses a simple design, but carry propagation delay (CPD) is the main concern in this adder. Carry look-ahead and carry select (CS) methods have been suggested to reduce the CPD of adders. A conventional carry select adder (CSLA) is an RCA RCA configuration that generates a pair of sum words and output carry bits corresponding the anticipated input-carry (c in =0and1) and selects one out of each pair for final-sum and final-output-carry [3]. A conventional CSLA has less CPD than an RCA, but the design is not attractive since it uses a dual RCA. Few attempts have been made to avoid dual use of RCA in CSLA design. Kim and Kim [4] used one RCA and one add-one circuit instead of two RCAs, where the add-one circuit is implemented using a multiplexer (MUX). Heet al.[5] proposed a square-root (SQRT)-CSLA to implement large bit-width adders with less delay. In a SQRT CSLA, CSLAs with increasing size are connected in a cascading structure. The main objective of SQRT-CSLA design is to provide a parallel path for carry propagation that helps to reduce the overall adder delay. Ram Kumar and Kittur [6] suggested a binary to BEC-based CSLA. The BEC-based CSLA involves less logic resources than the conventional CSLA, but it has marginally higher delay. A CSLA based on common Boolean logic (CBL) is also proposed in [7] and [8]. The CBL-based CSLA of [7] involves significantly less logic resource than the conventional CSLA but it has longer CPD, which is almost equal to that of the RCA. To overcome this problem, a SQRT-CSLA based on CBL was proposed in [8]. However, the CBL-based SQRTCSLA design of [8] requires more logic resource and delay than the BEC-based SQRT-CSLA of [6]. We observe that logic optimization largely depends on availability of redundant operations in the formulation, whereas adder delay mainly depends on data dependence. In the existing designs, logic is optimized without giving any consideration to the data dependence. In this brief, we made an analysis on logic operations involved in conventional and BEC-based CSLAs to study the data dependence and to identify redundant logic operations. Based on this analysis, we have International Journal of Research Studies in Science, Engineering and Technology [IJRSSET] 31

Efficient Implementation of Multi Stage SQRT Carry Select Adder proposed a logic formulation for the CSLA. The main contribution in this brief is logic formulation based on data dependence and optimized carry generator (CG) and CS design based on the proposed logic formulation; we have derived an efficient logic design for CSLA. Due to optimized logic Units. 2. EXISTING CSLA The structure of the 16-bit regular SQRT CSLA is shown in Fig. 4. It has 5 groups of different size RCA. Each group contains dual RCA and Mux. The linear carry select adder has two disadvantages there are high area usage and high time delay. SQRT CSLA can rectify these disadvantages of linear carry select adder. It is an improved one of linear CSLA. The time delay of the linear adder can decrease by having one more input into each set of adders than in the previous set. This is called a Square Root Carry Select Adder. Square Root carry select adder is constructed by equalizing the delay through two carry chains and the block-multiplexer signal from previous stage. The steps leading to the evaluations are given here. In the regular SQRT CSLA, the group2 has two sets of 2-bit RCA. The selection input of 3:2 Mux is c1. If the c1 = 0, the Mux select first RCA output otherwise it select second RCA output. The output of group2 is Sum [3:2] and carryout, c3. Then the area count of group2 is determined as follows: Gate count = 57 (FA + HA + Mux) FA = 39 (3 * 13) HA = 6 (1 * 6) Mux = 12 (3 * 4) The structure of the 16-bit modified SQRT CSLA is shown in Fig. It has 5 groups of different size RCA and BEC. Each group contains one RCA, one BEC and MUX. In the modified SQRT CSLA, the group2 has one 2-bit RCA which has 1 FA and 1 HA for carry in = 0. Instead of another 2-bit RCA with carry in = 1 a 3-bit BEC is used which adds one to the output from 2-bit RCA. The selection input of 6:3 Mux is c3. If the c3 = 0, the Mux select RCA output otherwise it select BEC output. The output of group2 is Sum [3:2] and carryout, c3. Then the area count of group2 is determined as follows: Gate count = 43 (FA + HA + Mux + BEC) FA = 13 (1 * 13) HA = 6 (1 * 6) Mux = 12 (3 * 4) NOT = 1 AND = 1 XOR = 10 (2 * 5) BEC (3-BIT) = NOT + AND + XOR = 12 Regular 16-Bit SQRT CSLA International Journal of Research Studies in Science, Engineering and Technology [IJRSSET] 32

Kadudula Raju et al. 3. PROPOSED CSLA Modified 16-Bit SQRT CSLA The CSLA has two units: 1) the sum and carry generator unit (SCG) and 2) the sum and carry selection unit [9]. The SCG unit consumes most of the logic resources of CSLA and significantly contributes to the critical path. Different logic designs have been suggested for efficient implementation of the SCG unit. We made a study of the logic designs suggested for the SCG unit of conventional and BEC-based CSLAs of [6] by suitable logic expressions. The main objective of this study is to identify redundant logic operations and data dependence. Accordingly, we remove all redundant logic operations and sequence logic operations based on their data dependence. The proposed CSLA consists of one HSG unit, one FSG unit, one CG unit, and one CS unit. The CG unit is composed of two CGs (CG0and CG1) corresponding to input-carry 0 and 1. The HSG receives two n-bit operands (A and B) and generatehalf-sumwords0andhalf-carryword c0 of width n bits each. Both CG0and CG1receives0andc0 from the HSG unit and generate two n-bit full-carry wordsc01 andc11 corresponding to input-carry 0 and 1 respectively. The logic circuits of CG0and CG1are optimized to take advantage of the fixed input-carry bits. The optimized designs of CG0and CG1.The multipath carry propagation feature of the CSLA are fully exploited in the SQRT-CSLA [5], which is composed of a chain of CSLAs. CSLAs of increasing size are used in the SQRT-CSLA to extract the maximum concurrence in the carry propagation path Proposed CS adder design Gate-level design of the HSG International Journal of Research Studies in Science, Engineering and Technology [IJRSSET] 33

Efficient Implementation of Multi Stage SQRT Carry Select Adder Gate-level optimized design of (CG0) for input-carry=0. Gate-level optimized design of (CG1) for input-carry=1 Gate-level design of the CS unit Gate-level design of the final-sum generation (FSG) unit Proposed SQRT-CSLA for 16 bit Using the SQRT-CSLA design, large-size adders are implemented with significantly less delay than a single-stage CSLA of same size. However, carry propagation delay between the CSLA stages of SQRT-CSLA is critical for the overall adder delay. Due to early generation of output-carry with multipath carry propagation feature, the proposed CSLA design is more favourable than the existing CSLA designs for area delay efficient implementation of SQRT-CSLA. A 16-bit SQRT-CSLA design using the proposed CSLA is shown in Fig where the 2-bit RCA, 2-bit CSLA, 3-bit CSLA, 4-bit CSLA, and 5-bit CSLA are used. We have considered the cascaded configuration of (2-bit RCA and 2-, 3-, 4-, 6-, 7-and 8-bit CSLAs) and (2-bit RCA and 2-, 3-, 4-, 6-, 7-, 8-, 9-, 11-, and 12-bit CSLAs), respectively, for the 32-bit SQRTCSLA and the 64-bit SQRT-CSLA International Journal of Research Studies in Science, Engineering and Technology [IJRSSET] 34

Kadudula Raju et al. Proposed SQRT-CSLA for 16 bit The logic operations of the RCA is shown in split form, where HSG, HCG, FSG, and FCG represent half-sum generation, half-carry generation, full-sum generation, and full-carry generation, respectively 4. RESULTS Proposed SQRT-CSLA for 16 Bit Simulation Proposed SQRT-CSLA for 64 bit Proposed SQRT-CSLA for 64 Bit Simulation 5. CONCLUSION We have analysed the logic operations involved in the conventional and BEC-based CSLAs to study the data dependence and to identify redundant logic operations. We have eliminated all the redundant logic operations of the conventional CSLA and proposed a new logic formulation for the CSLA. In the proposed scheme, the CS operation is scheduled before the calculation of final-sum, which is different from the conventional approach. Carry words corresponding to input-carry 0 and International Journal of Research Studies in Science, Engineering and Technology [IJRSSET] 35

Efficient Implementation of Multi Stage SQRT Carry Select Adder 1 generated by the CSLA based on the proposed scheme follow a specific bit pattern, which is used for logic optimization of the CS unit. Fixed input bits of the CG unit are also used for logic optimization. Based on this, an optimized design for CS and CG units are obtained Using these optimized logic units, an efficient design is obtained for the CSLA. The proposed CSLA design involves significantly less area and delay than the recently proposed BEC-based CSLA. Due to the small carry output delay, the proposed CSLA design is a good candidate for the SQRT adder REFERENCES [1] K.K.Parhi, VLSI Digital Signal Processing. New York, NY, USA: Wiley, 1998. [2] A. P. Chandrakasan, N. Verma, and D. C. Daly, Ultralow-power electronics for biomedical applications, Annu. Rev. Biomed. Eng., vol. 10, pp. 247 274, Aug. 2008. [3] O. J. Bedrij, Carry-select adder, IRE Trans. Electron. Comput., vol. EC-11, no. 3, pp. 340 344, Jun. 1962. [4] Y. Kim and L.-S. Kim, 64-bit carry-select adder with reduced area, Electron. Lett., vol. 37, no. 10, pp. 614 615, May 2001. [5] Y. He, C. H. Chang, and J. Gu, An area-efficient 64-bit square root carry select adder for low power application, in Proc. IEEE Int. Symp. Circuits Syst., 2005, vol. 4, pp. 4082 4085. [6] B. Ramkumar and H. M. Kittur, Low-power and area-efficient carry-select adder, IEEE Trans. Very Large Scale Integer. (VLSI) Syst., vol. 20, no. 2,pp. 371 375, Feb. 2012. [7] I.-C. Wey, C.-C. Ho, Y.-S. Lin, and C. C. Peng, An area-efficient carry select adder design by sharing the common Boolean logic term, inproc.imecs, 2012, pp. 1 4. [8] S. Manju and V. Sornagopal, An efficient SQRT architecture of carry select adder design by common Boolean logic, inproc. VLSI ICEVENT, 2013,pp. 1 5. [9] B. Parhami, Computer Arithmetic: Algorithms and Hardware Designs, 2nd ed. New York, NY, USA: Oxford Univ. Press, 2010. AUTHOR S BIOGRAPHY Kadudula raju has completed his B.Tech in Electronics and Communication Engineering from SVS INSTITUTE OF ENGINEERING & TECHNOLOGY, J.N.T.U.H Affiliated College in 2013. He is pursuing his M.Tech in VLSI System Design from SIDDHARTHA COLLEGE OF ENGINEERING AND TECHNOLOGY, J.N.T.U.H Affiliated College. N.Malathi is an Associate Professor at Siddhartha Institute of Engineering and Technology, Hyderabad in ECE Department. He received his B.Tech degree in Electronics and Communication Engineering from SREE NIDHI INSTITUTE OF ENGINEERING AND TECHNOLOGY, Hyderabad and M.Tech degree in VLSI System Design from TKR COLLEGE OF ENGINEERINGE, Hyderabad. His research interest is VLSI Technology and Design and Communication Systems. And Digital electronics Dr. D Subba Rao is a proficient Ph.D person in the research area of Image Processing from Vel-Tech University, Chennai along with initial degrees of Bachelor of Technology in Electronics and Communication Engineering (ECE) from Dr. S G I E T, Markapur and Master of Technology in Embedded Systems from SRM University, Chennai. He has 13 years of teaching experience and has published 12 Papers in International Journals, 2 Papers in National Journals and has been noted under 4 International Conferences. He has a fellowship of The Institution of Electronics and Telecommunication Engineers (IETE) along with a Life time membership of Indian Society for Technical Education (ISTE). He is currently bounded as an Associate Professor and is being chaired as Head of the Department for Electronics and Communication Engineering discipline at Siddhartha Institute of Engineering and Technology, Ibrahimpatnam, Hyderabad. International Journal of Research Studies in Science, Engineering and Technology [IJRSSET] 36