Low-Power Design of Sequential Circuits Using a Quasi-Synchronous Derived Clock *

Similar documents
SYNCHRONOUS DERIVED CLOCK AND SYNTHESIS OF LOW POWER SEQUENTIAL CIRCUITS *

DESIGN OF A NEW MODIFIED CLOCK GATED SENSE-AMPLIFIER FLIP-FLOP

HIGH PERFORMANCE AND LOW POWER ASYNCHRONOUS DATA SAMPLING WITH POWER GATED DOUBLE EDGE TRIGGERED FLIP-FLOP

AN EFFICIENT LOW POWER DESIGN FOR ASYNCHRONOUS DATA SAMPLING IN DOUBLE EDGE TRIGGERED FLIP-FLOPS

Dual Slope ADC Design from Power, Speed and Area Perspectives

DUAL EDGE-TRIGGERED D-TYPE FLIP-FLOP WITH LOW POWER CONSUMPTION

Power Optimization by Using Multi-Bit Flip-Flops

PERFORMANCE ANALYSIS OF AN EFFICIENT PULSE-TRIGGERED FLIP FLOPS FOR ULTRA LOW POWER APPLICATIONS

REDUCING DYNAMIC POWER BY PULSED LATCH AND MULTIPLE PULSE GENERATOR IN CLOCKTREE

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS)

Figure.1 Clock signal II. SYSTEM ANALYSIS

A Power Efficient Flip Flop by using 90nm Technology

Low Power D Flip Flop Using Static Pass Transistor Logic

Design of New Dual Edge Triggered Sense Amplifier Flip-Flop with Low Area and Power Efficient

DESIGN OF DOUBLE PULSE TRIGGERED FLIP-FLOP BASED ON SIGNAL FEED THROUGH SCHEME

Leakage Current Reduction in Sequential Circuits by Modifying the Scan Chains

DIFFERENTIAL CONDITIONAL CAPTURING FLIP-FLOP TECHNIQUE USED FOR LOW POWER CONSUMPTION IN CLOCKING SCHEME

Design of a Low Power Four-Bit Binary Counter Using Enhancement Type Mosfet

ECE321 Electronics I

High Performance Dynamic Hybrid Flip-Flop For Pipeline Stages with Methodical Implanted Logic

Modifying the Scan Chains in Sequential Circuit to Reduce Leakage Current

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532

Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module

LOW POWER AND HIGH PERFORMANCE SHIFT REGISTERS USING PULSED LATCH TECHNIQUE

An FPGA Implementation of Shift Register Using Pulsed Latches

Low Power VLSI Circuits and Systems Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

EL302 DIGITAL INTEGRATED CIRCUITS LAB #3 CMOS EDGE TRIGGERED D FLIP-FLOP. Due İLKER KALYONCU, 10043

DESIGN AND ANALYSIS OF LOW POWER STS PULSE TRIGGERED FLIP-FLOP USING 250NM CMOS TECHNOLOGY

Design of Low Power D-Flip Flop Using True Single Phase Clock (TSPC)

Design of Fault Coverage Test Pattern Generator Using LFSR

EEC 118 Lecture #9: Sequential Logic. Rajeevan Amirtharajah University of California, Davis Jeff Parkhurst Intel Corporation

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Science

Sequencing. Lan-Da Van ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall,

Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register

Low Power Approach of Clock Gating in Synchronous System like FIFO: A Novel Clock Gating Approach and Comparative Analysis

LOW POWER LEVEL CONVERTING FLIP-FLOP DESIGN BY USING CONDITIONAL DISCHARGE TECHNIQUE

Sequential Circuit Design: Part 1

CHAPTER 6 ASYNCHRONOUS QUASI DELAY INSENSITIVE TEMPLATES (QDI) BASED VITERBI DECODER

ELE2120 Digital Circuits and Systems. Tutorial Note 7

Project 6: Latches and flip-flops

Sequential Circuit Design: Part 1

POWER OPTIMIZED CLOCK GATED ALU FOR LOW POWER PROCESSOR DESIGN

Design And Analysis Of Implicit Pulsed Double Edge Triggered Clocked Latch For Low Power Applications

A Low Power Delay Buffer Using Gated Driver Tree

A Modified Static Contention Free Single Phase Clocked Flip-flop Design for Low Power Applications

Research Article Ultra Low Power, High Performance Negative Edge Triggered ECRL Energy Recovery Sequential Elements with Power Clock Gating

LFSR Counter Implementation in CMOS VLSI

A Low-Power CMOS Flip-Flop for High Performance Processors

Efficient Architecture for Flexible Prescaler Using Multimodulo Prescaler

Improve Performance of Low-Power Clock Branch Sharing Double-Edge Triggered Flip-Flop

Reduction of Clock Power in Sequential Circuits Using Multi-Bit Flip-Flops

Design of a High Frequency Dual Modulus Prescaler using Efficient TSPC Flip Flop using 180nm Technology

Dual Edge Adaptive Pulse Triggered Flip-Flop for a High Speed and Low Power Applications

P.Akila 1. P a g e 60

A Novel Low-overhead Delay Testing Technique for Arbitrary Two-Pattern Test Application

CMOS DESIGN OF FLIP-FLOP ON 120nm

Minimization of Power for the Design of an Optimal Flip Flop

Asynchronous Data Sampling Within Clock-Gated Double Edge-Triggered Flip-Flops

Chapter 4. Logic Design

EEC 116 Fall 2011 Lab #5: Pipelined 32b Adder

EFFICIENT POWER REDUCTION OF TOPOLOGICALLY COMPRESSED FLIP-FLOP AND GDI BASED FLIP FLOP

A Symmetric Differential Clock Generator for Bit-Serial Hardware

ISSN:

Comparative study on low-power high-performance standard-cell flip-flops

Retiming Sequential Circuits for Low Power

Low Power and Reduce Area Dual Edge Pulse Triggered Flip-Flop Based on Signal Feed-Through Scheme

Design of a Ternary Edge-Triggered D Flip-Flap-Flop for Multiple-Valued Sequential Logic

Final Exam review: chapter 4 and 5. Supplement 3 and 4

Design of Conditional-Boosting Flip-Flop for Ultra Low Power Applications

Power Efficient Design of Sequential Circuits using OBSC and RTPG Integration

Asynchronous Model of Flip-Flop s and Latches for Low Power Clocking

Design and Simulation of a Digital CMOS Synchronous 4-bit Up-Counter with Set and Reset

LOW POWER DOUBLE EDGE PULSE TRIGGERED FLIP FLOP DESIGN

Lecture 21: Sequential Circuits. Review: Timing Definitions

Design of Pulse Triggered Flip Flop Using Conditional Pulse Enhancement Technique

Lecture 8: Sequential Logic

Memory elements. Topics. Memory element terminology. Variations in memory elements. Clock terminology. Memory element parameters. clock.

New Single Edge Triggered Flip-Flop Design with Improved Power and Power Delay Product for Low Data Activity Applications

Energy Recovery Clocking Scheme and Flip-Flops for Ultra Low-Energy Applications


data and is used in digital networks and storage devices. CRC s are easy to implement in binary

Gated Driver Tree Based Power Optimized Multi-Bit Flip-Flops

Low-Power and Area-Efficient Shift Register Using Pulsed Latches

2.6 Reset Design Strategy

Design of low power 4-bit shift registers using conditionally pulse enhanced pulse triggered flip-flop

SEQUENTIAL CIRCUITS SEQUENTIAL CIRCUITS

Current Mode Double Edge Triggered Flip Flop with Enable

CS8803: Advanced Digital Design for Embedded Hardware

DESIGN OF LOW POWER TEST PATTERN GENERATOR

LOW-POWER CLOCK DISTRIBUTION IN EDGE TRIGGERED FLIP-FLOP

Dynamic Power Reduction in Sequential Circuit Using Clock Gating

II. ANALYSIS I. INTRODUCTION

Sequential Logic. E&CE 223 Digital Circuits and Systems (A. Kennings) Page 1

Parametric Optimization of Clocked Redundant Flip-Flop Using Transmission Gate

Use of Low Power DET Address Pointer Circuit for FIFO Memory Design

A NOVEL DESIGN OF COUNTER USING TSPC D FLIP-FLOP FOR HIGH PERFORMANCE AND LOW POWER VLSI DESIGN APPLICATIONS USING 45NM CMOS TECHNOLOGY

1. What does the signal for a static-zero hazard look like?

Power Optimization Techniques for Sequential Elements Using Pulse Triggered Flip-Flops with SVL Logic

AN EFFICIENT DOUBLE EDGE TRIGGERING FLIP FLOP (MDETFF)

Low Power Digital Design using Asynchronous Logic

Transcription:

Low-Power esign of Sequential Circuits Using a uasi-synchronous erived Clock * Xunwei Wu, Jian Wei Institute of Circuits and Systems Ningbo University Ningbo, Zhejiang 5, CHINA el: 86-574-76-5785 Fax: 86-574-76-459 email: {xunweiwu, weijian}@mail.hz.zj.cn Abstract his paper presents a novel circuit design technique to reduce the power dissipation in sequential circuits by generating a quasi-synchronous derived clock from the master clock and using it to isolate the flip flops in the circuit from the unwanted triggering action of the master clock. An example design of a decimal counter demonstrates the large power saving and improved performance of the resulting circuit. I. INROUCION In the past, the major concerns of the VLSI designer were area, performance, and cost; power consumption considerations were mostly of secondary concern. In recent years, however, this trend has begun to change and, increasingly, power consumption is being given comparable weight to area and speed in VLSI design []. One reason is that the continuing increase in the chip scale integration and the operating frequency has made power consumption a major design issue in VLSI circuits. he excessive power dissipation in integrated circuits not only discourages their use in a portable device, but also causes overheating, which degrades performance and reduces the circuit lifetime. All of these factors drive designers to devote significant resources to reduce the circuit power dissipation. Indeed, the Semiconductor Industry Association has identified low-power design as a critical technological direction []. In CMOS circuits, the dominant term of power dissipation is that which is required to charge or discharge the capacitors in the circuit. he power dissipation of a node in the circuit is expressed by the following equation: P =.5C L V fclk ESW where C L is the physical capacitance at the node, V is the supply voltage, f CLK is the clock frequency, E SW (referred to as the average switching activity) is the average number of output transitions per clock cycle /f CLK. he sequential circuit elements in a CMOS circuit are considered major contributors to the power dissipation since one input of sequential circuit elements is the clock, which is the only signal that switches all the time. In addition, the clock signal tends to be highly loaded. o distribute the clock and control the clock skew, one must construct a clock network (often a clock tree) with clock buffers. * his work was supported in part by NNSF in China under contract #69774 and NSF in USA under contract # MIP-968999. Massoud Pedram, ing Wu epartment of Electrical Engineering-Systems University of Southern California Los Angeles, CA 989, USA el: --74-4458 Fax: --74-79 email: {massoud, qingwu}@zugros.usc.edu All of this adds to the total node capacitance of the clock net, which also happens to have the largest activity (two transitions per cycles) in a synchronous circuit (ignoring possible hazard activity on same signal lines). Recent studies indicate that the clock signals in digital computers consume a large (5% - 45%) percentage of the system power. hus, reducing the clock power dissipation can greatly reduce the power dissipation in digital VLSI circuits. Most efforts for clock power reduction have focused on issues such as voltage swing reduction, buffer insertion and clock routing []. In many cases switching of the clock causes a lot of unnecessary gate activity. For that reason, reducing or suppressing the unwanted switching of the clock becomes an important way to reduce the power dissipation of sequential circuits. his goal can be achieved by two means. he first method is to eliminate the wasted power dissipation caused by the clock s switching in the non-triggering direction. he flip-flops now available are all Single-Edge riggered (SE), for example, they are only sensitive to the clock s falling edges, thus the power dissipation caused by the clock s rising edge is wasteful. For this reason, ouble-edge riggered (E) flip-flops have been developed which switch at both the falling and the rising edges of the clock. Consequently the clock frequency can be reduced by half while keeping the same data rate resulting in 5% power savings in the flip-flops [4,5]. he second method is to block the clock to the flip-flops during their holding states so as to reduce the power dissipation. In this case the clock received by the flip-flop should not be the chip master clock. his means that other clocks must be derived from the master clock which, based on certain conditions, can be slowed down or stopped completely with respect to the master clock. Obviously, this scheme results in power savings due to the following factors: i) Load on the master clock as well as the number of required buffers in the clock tree is decreased. herefore, the power dissipation of the clock tree is reduced. ii) he flip-flop receiving the derived clock is not triggered in idle cycles, hence, the corresponding dynamic power dissipation is saved. iii) he excitation function of the flip-flop triggered by a derived clock may be simplified since it has a don t-care condition in the cycle when the flip-flop is not triggered by the

derived clock. Based on the above discussion, this paper describes how to generate a secondary clock, which is derived from the clock tree and meets all design requirements, such as being glitch-free and having no additional skew. Next, we show how to use a quasi-synchronous derived clock for designing sequential circuits, which have lower dissipation and simpler combinational logic. Circuit simulation is used to check the quality of the derived clock and its ability to reduce power dissipation of sequential circuits. In conventional design, we are interested in the next state of a flip-flop, hence, flip-flops are the natural choice. In low power design, we are more concerned with whether or not the next state changes, hence, flip-flops become the preferred choice. he clock power dissipation occurs during both = and =. It is however desirable to eliminate the clock power dissipation during =. his can be accomplished by using the excitation function for input to control the clock. his idea is very effective in reducing the clock power and, as we will show during the design of a decimal counter, the use of flip-flops (instead of flip-flops) leads to simpler combinational logic, and hence further power reduction. II. GAING HE CLOCK BY USING A NO-RIGGER SIGNAL If there are flip-flops whose inputs are unchanged when a sequential circuit goes from one state to next, we can produce a not-trigger signal from the original state to cutoff the path from the master clock to these flip-flops. As a result, these flip-flops are not subjected to the clock signal and their power dissipation is accordingly reduced. Without loss of generality, consider that the flip-flops in the sequential circuit are sensitive to the clock s falling edge. Fig.(a) shows that is directly used to control the OR gate to cutoff the master clock, and the derived clock is ' =. Fig.(d) shows the timing relationship of, and '. his scheme however fails because of the following reasons: i) Suppose that is produced in cycle S and disappears in cycle S, that is, the output of the flip-flop should remain unchanged during S S and S S transitions. waveform in Fig.(d) however shows that the falling transition of in cycle S causes an unwanted transition that should have been avoided since = when S S. Besides, ' has a long delay of t f td t g with respect to the master clock, where t f is the delay time of flip-flop, t d is the delay time of the combinational circuit for generating the signal, and t g is the delay time of the OR gate in Fig.(a). ii) If the combinational circuit for generating has race hazards, these hazards may propagate to the clock signal of the flip-flop through OR gate in Fig.(a). In [6], the authors propose a scheme whereby a latch is used to filter out hazards and synchronize with the master clock as shown in Fig(b). Because the latch is in the storage state during =, the incidental glitch of can be filtered ' out and the derived is able to cutoff the master clock during the state transitions S S and S S. he derived clock '' is obtained as shown in Fig.(d). However, this scheme has the following shortcomings: i) he added latch increases the circuit complexity. ii) Since is connected to the newly added latch, the extra power dissipation of offsets some of the power saving due to the non-triggering of the flip-flop. iii) here is still the t g delay between the derived clock '' and the master clock, which results in the clock skew. As a result, the sequential circuit is not safely synchronized. In fact, we can cutoff the transmission of the clock by using the trigger signal to control an AN gate, as shown in Fig.(c). he derived clock ''' is shown in Fig.(d). his scheme has an obvious advantage due to the omission of the appended latch, but does not solve the clock skew problem. A simple idea is to appropriately delay the, and then use it as the master clock. In this way the delayed master clock will be quasi-synchronous with respect to ''', which means that the clock skew between ''' and can be made very small by appropriate sizing of transistors in the NOR gates. his leads to another idea where the presence of both and in the clock tree can be exploited. In particular, if we rewrite ' '' = as ' ' ' =, where is taken from the previous stage of the clock tree, then we can design a quasi-synchronous derived clock based on the NOR gate, as shown in Fig.. So long as we control delay of the inverter to make it nearly the same as that of the NOR gate in Fig., the skew between and ' ' ' will be very small, thus the derived clock ' ' ' will be quasi-synchronous with respect to. In Fig., we also present the circuit to produce the quasi-synchronous derived clock based on a NAN gate which is controlled by. his design is suitable for the flip-flops which trigger on the rising clock edge. Notice that the switching of the output of a flip-flop is controlled by the excitation input.

(a) (b) latch ' ' ' ' (d) ' ' ' ' S S S (c) ' '' ' '' Fig. Not-trigger signal controls the master clock (a) Cutoff the master clock through an OR gate, (b) Cutoff the master clock through a latch, (c) Cutoff the master clock through an AN gate, (d) he logic waveforms III. ESIGN OF SEUENIAL CIRCUIS USING UASI-SYNCHRONOUS ERIVE CLOCKS he flip-flop is widely used in the design of CMOS sequential circuits. However, when the not-trigger signal is used to gate the master clock s triggering action to the flip-flops, the flip-flop function is a better choice. he next-state equation of a flip-flop is =, therefore we set =, which indicates that the flip-flop switches or holds state during = or =, respectively. his equation matches with the method of using the not-trigger signal to gate the master clock as mentioned in previous section. So we just need to use the function from the design of a flip-flop to gate the master clock. aking a decimal counter as an example, the next state of the counter is shown in able II. If flip-flops are used as is typically the case, we will obtain Karnaugh maps for the excitation functions,, and from the next states in ab., as shown in Fig.(a). In these maps, an empty box represents the don t-care condition. he optimized excitation functions are: =, = ( ), =, =. ' '' ' '' (master clock) Fig. uasi-synchronous clocks derived from the master clock. he corresponding circuit realization is shown in Fig.(b) which is a traditional synchronous design for a decimal counter. Now we use flip-flops in the design instead. Since =, from the next-state Karnaugh maps in Fig.(a), the Karnaugh maps of the excitation functions,, and are obtained as shown in Fig.4(a). he optimized excitation functions are: = = (, )

=, =, =. At the first glance, the above excitation functions are simpler than those of flip-flops. However, if we construct a flip-flop from flip-flop an extra XOR gate has to be attached to produce =. hus, we didn t adopt flip-flops in usual sequential designs. ABLE I SAE ABLE OF A ECIMAL COUNER (b) (a) Fig. Synchronous design of a decimal counter. (a) Karnaugh maps of,, and, (b) circuit realization Assuming that the flip-flops are sensitive to the falling edge of the clock signal, we can adopt the method of producing derived clock based on the NOR gate in Fig.. he clock signals for these four flip-flops are: = = =, = = =, = = =, = = = From the clock functions, we construct the circuit realizations shown in Fig.4(b). Notice that in this circuit, to. derive, the n part of the CMOS configuration for realizing is composed of two series nmos transistors parallel-connected with an nmos transistor, and the p part is composed of three pmos transistors with the dual configuration [7]. So the complexity of the circuit for realizing is equal to that of a -input NOR gate. From this, the circuit construction in Fig.4(a) is clearly simpler than its counterpart in Fig(a). he simple construction of the combinational circuit results in low power dissipation. However the low power

dissipation property is mostly achieved as a result of gating the clock. Besides flip-flop, the three flip-flops, and have no dynamic power dissipation when there is no triggering of the clock. We simulated the new design in Fig.4(b) by PSPICE with.5μ CMOS technology and the following MOS parameters: nmos Ö level= phi=.7 tox=9.6e-9 xj=.u tpg= vto=.6566 delta=6.9e- ld=4.79e-8 kp=.9647e-4 uo=546. theta=.684e- (a) (b) Fig. 4 uasi-synchronous design of a decimal counter. (a) Karnaugh maps of,, and, (b) he circuit realization rsh=.5e gamma=.5976 nsub=.9e7 nfs=5.99e vmax=.8e5 eta=.78e- kappa=.898e- cgdo=.55e- cgso=.55e- cgbo=4.9e- cj=5.6e-4 mj=.559 cjsw=5.e- mjsw=.5 pb=.99 pmos Ö level= phi=.7 tox=9.6e-9 xj=.u tpg=- vto=-.9 delta=.875e- ld=.57e-8 kp=4.874e-5 uo=5.5 theta=.87e- rsh=.e- gamma=.467 nsub=8.5e6 nfs=6.5e vmax=.54e5 eta=.45e- kappa=7.958e cgdo=.9e- cgso=.9e- cgbo=.7579e- cj=9.5e-4 mj=.468 cjsw=.89e- mjsw=.55 pb=.99. he transient analysis shown in Fig.5 proves that the new design has the expected logic operation. he -to- average delays for the four flip-flops are listed in able II, which shows that the flip-flops in Fig.4(b) can synchronously be switched. For comparison, the delays of the synchronous design in Fig(b) are also given in the table. As we can see, the average delays are comparable. ABLE II AVERAGE ELAY OF FLIP-FLOPS (ns) esign in Fig.4(b).5.59.6.76 esign in Fig.(b).46.45.4.4 Fig. 5 ransient analysis.

with respect to the master clock and can be used to isolate the triggered flip-flops from the master clock in their idle cycles. he achieved power saving is significant as shown by the example design of a decimal counter. he circuit simulation proved the quality of the new derived clock and its ability to reduce power dissipation. In this paper we only provided a few examples to illustrate the basic ideas. Our purpose is to introduce the design method and indicate that the engineering issues related to the use of gated clocks could be resolved for practical applications, opening the path for adoption of the clock-gating technique in the design of low power sequential circuits. REFERENCES Fig. 6 Energy dissipation versus time. (i) uasi-synchronous design of a decimal counter (ii) Synchronous design of a decimal counter We also measure the power dissipation of the synchronous design in Fig.(b) and the quasi-synchronous design in Fig.4(b). he energy dissipation diagrams are shown in Fig.6, and prove that the later design reduces the power dissipation by 5%. his is expected since à and waveforms in Fig.5 show that in a decimal counting cycle flip-flops, and are triggered only 4, 4 and times, respectively. Furthermore, we know that they have no dynamic power dissipation when not triggered, and that the simpler combinational circuit in Fig.4(b) has simpler structure which results in low power dissipation. IV. CONCLUSION We presented a procedure for gating the clock by means of the not-trigger signal. he derived clock is quasi-synchronous []. J. Rabaey and M. Pedram, Low Power esign Methodologies, Kluwer Academic Publishers, Norwell, 996. []. he National echnology Roadmap for Semiconductors, echnology needs, Semiconductor Industry Association, pp.7-8, the 997 Edition. []. G. Friedman, Clock distribution design in VLSI circuits: an overview, Proc. IEEE ISCAS, San Jose, pp. 475-478, 994. [4]. R. Hossain, L.. Wronski and A. Albicki, Low power design using double edge triggered flip-flops, IEEE rans. VLSI Systems, vol., no., pp. 6-65, June 994. [5]. M. Pedram,. Wu and X. Wu, A new design of double edge triggered flip-flops, Proc. ASP-AC, Yokohama, pp. 47-4, Feb. 998. [6]. L. Benini and G. e Micheli, ransformation and synthesis of FSMs for low power gated clock implementation, Proc. Int. Symp. Workshop on Low Power esign, pp.-6, Apr. 995. [7]. N. H. E. West, K. Eshraghian, Principles of CMOS VLSI esign: A System Perspective, nd Edition, Addison-Wesley Publishing Co., New Work, 99.