A Design Of A Low Power Delay Buffer Using Ring Counter Addressing Schemes

Similar documents
A Low Power Delay Buffer Using Gated Driver Tree

Gated Driver Tree Based Power Optimized Multi-Bit Flip-Flops

Low-Power Delay Buffer Design Using Asymmetric C-Element Gated Clock Strategy

AN OPTIMIZED IMPLEMENTATION OF MULTI- BIT FLIP-FLOP USING VERILOG

AN EFFICIENT LOW POWER DESIGN FOR ASYNCHRONOUS DATA SAMPLING IN DOUBLE EDGE TRIGGERED FLIP-FLOPS

LFSR Counter Implementation in CMOS VLSI

LOW POWER AND HIGH PERFORMANCE SHIFT REGISTERS USING PULSED LATCH TECHNIQUE

Use of Low Power DET Address Pointer Circuit for FIFO Memory Design

Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register

ISSN Vol.08,Issue.24, December-2016, Pages:

FAULT SECURE ENCODER AND DECODER WITH CLOCK GATING

HIGH PERFORMANCE AND LOW POWER ASYNCHRONOUS DATA SAMPLING WITH POWER GATED DOUBLE EDGE TRIGGERED FLIP-FLOP

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532

An FPGA Implementation of Shift Register Using Pulsed Latches

Chapter 6. Flip-Flops and Simple Flip-Flop Applications

PERFORMANCE ANALYSIS OF AN EFFICIENT PULSE-TRIGGERED FLIP FLOPS FOR ULTRA LOW POWER APPLICATIONS

A NOVEL DESIGN OF COUNTER USING TSPC D FLIP-FLOP FOR HIGH PERFORMANCE AND LOW POWER VLSI DESIGN APPLICATIONS USING 45NM CMOS TECHNOLOGY

VLSI Technology used in Auto-Scan Delay Testing Design For Bench Mark Circuits

A Power Efficient Flip Flop by using 90nm Technology

Improve Performance of Low-Power Clock Branch Sharing Double-Edge Triggered Flip-Flop

EFFICIENT DESIGN OF SHIFT REGISTER FOR AREA AND POWER REDUCTION USING PULSED LATCH

Design of Fault Coverage Test Pattern Generator Using LFSR

data and is used in digital networks and storage devices. CRC s are easy to implement in binary

Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module

Objectives. Combinational logics Sequential logics Finite state machine Arithmetic circuits Datapath

DIFFERENTIAL CONDITIONAL CAPTURING FLIP-FLOP TECHNIQUE USED FOR LOW POWER CONSUMPTION IN CLOCKING SCHEME

Fully Static and Compressed Topology Using Power Saving in Digital circuits for Reduced Transistor Flip flop

A Modified Static Contention Free Single Phase Clocked Flip-flop Design for Low Power Applications

Power Optimization by Using Multi-Bit Flip-Flops

Design of New Dual Edge Triggered Sense Amplifier Flip-Flop with Low Area and Power Efficient

Random Access Scan. Veeraraghavan Ramamurthy Dept. of Electrical and Computer Engineering Auburn University, Auburn, AL

Figure.1 Clock signal II. SYSTEM ANALYSIS

Sequencing. Lan-Da Van ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall,

Dual Edge Adaptive Pulse Triggered Flip-Flop for a High Speed and Low Power Applications

REDUCING DYNAMIC POWER BY PULSED LATCH AND MULTIPLE PULSE GENERATOR IN CLOCKTREE

Combinational vs Sequential

Chapter 4. Logic Design

IT T35 Digital system desigm y - ii /s - iii

Design Low-Power and Area-Efficient Shift Register using SSASPL Pulsed Latch

Optimized Design and Simulation of 4-Bit Johnson Ring Counter Using 90nm Technology

High Speed 8-bit Counters using State Excitation Logic and their Application in Frequency Divider

LUT Optimization for Memory Based Computation using Modified OMS Technique

Power Efficient Design of Sequential Circuits using OBSC and RTPG Integration

DESIGN OF LOW POWER TEST PATTERN GENERATOR

A Symmetric Differential Clock Generator for Bit-Serial Hardware

Vignana Bharathi Institute of Technology UNIT 4 DLD

Low-Power and Area-Efficient Shift Register Using Pulsed Latches

DESIGN OF A NEW MODIFIED CLOCK GATED SENSE-AMPLIFIER FLIP-FLOP

P.Akila 1. P a g e 60

Analysis of Digitally Controlled Delay Loop-NAND Gate for Glitch Free Design

International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS)

Low Power Approach of Clock Gating in Synchronous System like FIFO: A Novel Clock Gating Approach and Comparative Analysis

11. Sequential Elements

University College of Engineering, JNTUK, Kakinada, India Member of Technical Staff, Seerakademi, Hyderabad

Low Power Area Efficient Parallel Counter Architecture

Optimized Magnetic Flip-Flop Combined With Flash Architecture for Memory Unit Based On Sleep Transistor

An Efficient Power Saving Latch Based Flip- Flop Design for Low Power Applications

Design of a High Frequency Dual Modulus Prescaler using Efficient TSPC Flip Flop using 180nm Technology

Efficient Architecture for Flexible Prescaler Using Multimodulo Prescaler

Design of Pulse Triggered Flip Flop Using Conditional Pulse Enhancement Technique

Reduction of Clock Power in Sequential Circuits Using Multi-Bit Flip-Flops

Logic Design. Flip Flops, Registers and Counters

Power Reduction Techniques for a Spread Spectrum Based Correlator

COMP2611: Computer Organization. Introduction to Digital Logic

SIC Vector Generation Using Test per Clock and Test per Scan

Implementation of Low Power and Area Efficient Carry Select Adder

Dual Edge Triggered Flip-Flops Based On C-Element Using Dual Sleep and Dual Slack Techniques

Serial In/Serial Left/Serial Out Operation

Comparative Analysis of Pulsed Latch and Flip-Flop based Shift Registers for High-Performance and Low-Power Systems

LOW POWER DOUBLE EDGE PULSE TRIGGERED FLIP FLOP DESIGN

POWER OPTIMIZED CLOCK GATED ALU FOR LOW POWER PROCESSOR DESIGN

Reduction of Area and Power of Shift Register Using Pulsed Latches

International Journal Of Global Innovations -Vol.6, Issue.I Paper Id: SP-V6-I1-P46 ISSN Online:

CHAPTER 6 DESIGN OF HIGH SPEED COUNTER USING PIPELINING

DESIGN OF DOUBLE PULSE TRIGGERED FLIP-FLOP BASED ON SIGNAL FEED THROUGH SCHEME

HIGH SPEED CLOCK DISTRIBUTION NETWORK USING CURRENT MODE DOUBLE EDGE TRIGGERED FLIP FLOP WITH ENABLE

GLITCH FREE NAND BASED DCDL IN PHASE LOCKED LOOP APPLICATION

WINTER 15 EXAMINATION Model Answer

D Latch (Transparent Latch)

VTU NOTES QUESTION PAPERS NEWS RESULTS FORUMS Registers

NH 67, Karur Trichy Highways, Puliyur C.F, Karur District UNIT-III SEQUENTIAL CIRCUITS

DESIGN OF EFFICIENT SHIFT REGISTERS USING PULSED LATCHES

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

Implementation of BIST Test Generation Scheme based on Single and Programmable Twisted Ring Counters

Logic Design Viva Question Bank Compiled By Channveer Patil


Current Mode Double Edge Triggered Flip Flop with Enable

Low Power VLSI Circuits and Systems Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

Modified Ultra-Low Power NAND Based Multiplexer and Flip-Flop

Design a Low Power Flip-Flop Based on a Signal Feed-Through Scheme

Counter dan Register

Name Of The Experiment: Sequential circuit design Latch, Flip-flop and Registers

CMOS Low Power, High Speed Dual- Modulus32/33Prescalerin sub-nanometer Technology

Dual Slope ADC Design from Power, Speed and Area Perspectives

LOW-POWER CLOCK DISTRIBUTION IN EDGE TRIGGERED FLIP-FLOP

Design of Low Power D-Flip Flop Using True Single Phase Clock (TSPC)

CMOS DESIGN OF FLIP-FLOP ON 120nm

Design of Low Power and Area Efficient Pulsed Latch Based Shift Register

DUAL EDGE-TRIGGERED D-TYPE FLIP-FLOP WITH LOW POWER CONSUMPTION

Low Power D Flip Flop Using Static Pass Transistor Logic

Transcription:

A Design Of A Low Power Delay Buffer Using Ring Counter Addressing Schemes B.R.B Jaswanth St.Theresa Institute of Engineering and Technology Gudiwada, India Abstract This work presents circuit design of a low-power delay buffer. The proposed delay buffer uses several new techniques to reduce its power consumption. Since delay buffers are accessed sequentially, it adopts a ring-counter addressing scheme. In the ring counter, double-edge-triggered (DET) flipflops are utilized to reduce the operating frequency by half and the C-element gated-clock strategy is proposed. A novel gatedclock-driver tree is then applied to further reduce the activity along the clock distribution network. Moreover, the gated-drivertree idea is also employed in the input and output ports of the memory block to decrease their loading, thus saving even more power. Keywords- Low Power Delay Buffer, Double Edge Triggered Flip Flop, Ring Counter. I. INTRODUCTION The skyrocketing increasing transistor count and circuit density of modern very large scale integrated (VLSI) circuits have made them extremely difficult and expensive to test comprehensively. In a digital processing chip of mobile communications, the delay buffer takes up a large portion of the circuit layout. If the power consumption of the delay buffer could be reduced significantly, the overall power consumption of the digital processing chip could be reduced significantly as well. On the other hand, as these chips are working at even higher operation frequencies, a new, low-power delay buffer should be operable under high frequencies. FIG. 1 is a schematic diagram showing a conventional delay buffer having a length N and data width W bits using shift registers. As illustrated, the delay buffer contains N times. W shift registers 10, arranged between the input and the output in N stages, each with W shift registers. The N times W shift registers are triggered by a same clock signal CLK. For every clock period of CLK, W-bit data is shifted from W shift registers of a previous stage to those of a next stage, and so on. A W-bit data input N clock periods ago therefore would be delayed and output after N clock periods. The clock signal CLK is provided to all N times W shift registers, contributing to the high power consumption. Moreover, the N times W shift registers would also take up a large die area. Therefore, in real life, delay buffer such as the one. One of the common delay buffer implementation is a dualport SRAM memory whose operation is different from that of the shift-register-based delay buffer. For an N times W SRAMbased delay buffer, there is no data movement between stages. Instead, at every clock period, a W-bit data is written to one of R.V.S Rayudu, K.Mani babu, R.Himaja, L.Veda kumar V.K.R, V.N.B & A.G.K College of Engineering Gudiwada, India the N times W storage locations of the SRAM-based delay buffer, and another W-bit data that is written N clock periods ago is output. The power consumption of a SRAM-based delay buffer is mainly from the address decoder and the drivers for its input and output ports. As memory related technology has already quite mature and satisfactory results in terms of layout area and speed are achievable. Therefore in reality a delay buffer is often implemented using SRAM memory. II. MEMORY ORGANIZATION Memory organization is two-fold. First we discuss the hardware (physical) organization, then the internal architecture. The type of computer and its size do not reflect the type of memories that the computer uses. Some computers have a mixture of memory types. For example, they may use some type of magnetic memory (core or film) and also a semiconductor memory (static or dynamic). They also have a read-only memory which is usually a part of the CPU. Memory in a computer can vary from one or more modules to one or more pcb s, depending on the computer type. The larger mainframe computers use the modular arrangement, multiple modules (four or more), to make up their memories whereas, minicomputers and microcomputers use chassis or assemblies, cages or racks, and motherboard or backplane arrangements. Minis and micros use multiple components on one PCB or groups of PCB have to form the memory. There are several ways to organize memories with respect to the way they are connected to the cache: one-word-wide memory organization wide memory organization interleaved memory organization independent memory organization III. DELAY BUFFER This section describes PJMEDIA's implementation of delay buffer. Delay buffer works quite similarly like a fixed jitter buffer, that is it will delay the frame retrieval by some interval so that caller will get continuous frame from the buffer. This can be useful when the operations are not evenly interleaved, for example when caller performs burst of put() operations and then followed by burst of operations. With using this delay buffer, the buffer will put the burst frames into a buffer so that get() operations will always get a frame from the buffer (assuming that the number of get() and put() are matched). 99 www.ijtel.org

The buffer is adaptive, that is it continuously learns the optimal delay to be applied to the audio flow at run-time. Once the optimal delay has been learned, the delay buffer will apply this delay to the audio flow, expanding or shrinking the audio samples as necessary when the actual audio samples in the buffer are too low or too high. A. EXISTING TECHNIQUE Figure 1. Memory Buffer Figure 2. Bloch Diagram of Existing Technique 1) INPUT BUFFER The Input buffer is also commonly known as the input area or input block. When referring to computer memory, the input buffer is a location that holds all incoming information before it continues to the CPU for processing. Input buffer can be also used to describe various other hardware or software buffers used to store information before it is processed. Some scanners (such as those which support include files) require reading from several input streams. As flex scanners do a large amount of buffering, one cannot control where the next input will be read from by simply writing a YY_INPUT() which is sensitive to the scanning context. YY_INPUT() is only called when the scanner reaches the end of its buffer, which may be a long time after scanning a statement such as an include statement which requires switching the input source. Figure 3. Input Buffer 2) MEMORY BLOCK Random-access memory (RAM) is a form of computer data storage. Today, it takes the form of integrated circuits that allow stored data to be accessed in any order (that is, at random). "Random" refers to the idea that any piece of data can be returned in a constant time, regardless of its physical location and whether it is related to the previous piece of data. The word "RAM" is often associated with volatile types of memory (such as DRAM memory modules), where the information is lost after the power is switched off. Many other types of memory are RAM as well, including most types of ROM and a type of flash memory called NOR-Flash. Scan design has been the backbone of design for testability (DFT) in industry for about three decades because scan-based design can successfully obtain controllability and observability for flip-flops. Serial Scan design has dominated the test architecture because it is convenient to build. However, the serial scan design causes unnecessary switching activity during testing which induce unnecessarily enormous power dissipation. The test time also increases dramatically with the continuously increasing number of flip-flops in large sequential circuits even using multiple scan chain architecture. An alternate to serial scan architecture is Random Access Scan (RAS). In RAS, flip-flops work as addressable memory elements in the test mode which is a similar fashion as random access memory (RAM). This approach reduces the time of setting and observing the flip-flop states but requires a large overhead both in gates and test pins. Despite of these drawbacks, the RAS was paid attention by many researchers in these years. This paper takes a view of recently published papers on RAS and rejuvenates the random access scan as a DFT method that simultaneously address three limitations of the traditional serial scan namely, test data volume, test application time, and test power. 3) RING COUNTER A ring counter is a type of counter composed of a circular shift register. The output of the last shift register is fed to the input of the first register. There are two types of ring counters: A straight ring counter or Overbeck counter connects the output of the last shift register to the first shift register input and circulates a single one (or zero) bit around the ring. For example, in a 4- register one-hot counter, with initial register values of 1000, the repeating pattern is: 1000, 0100, 0010, 0001, 1000.... Note that one of the registers must be pre-loaded with a 1 (or 0) in order to operate properly. A twisted ring counter (also called Johnson counter or Moebius counter) connects the complement of the output of the last shift register to its input and circulates a stream of ones followed by zeros around the ring. For example, in a 4-register counter, with initial register values of 0000, the repeating pattern is: 0000, 1000, 1100, 1110, 1111, 0111, 0011, 0001, 0000.... If the output of a shift register is fed back to the input. a ring counter results. The data pattern contained within the shift register will re-circulate as long as clock pulses are applied. For example, the data pattern will repeat every four clock pulses in the figure below. However, we must load a data pattern. All 0's 100 www.ijtel.org

or all 1's doesn't count. Is a continuous logic level from such a condition useful? We make provisions for loading data into the parallel-in/ serial-out shift register configured as a ring counter below. Any random pattern may be loaded. The most generally useful pattern is a single 1. B. PROPOSED TECHNIQUE: Figure 7. Block Diagram of Proposed Technique 1) GATED DRIVER TREE: Figure 4. Parallel-in Serial-Out Shift Register Loading binary 1000 into the ring counter, above, prior to shifting yields a viewable pattern. The data pattern for a single stage repeats every four clock pulses in our 4-stage example. The waveforms for all four stages look the same, except for the one clock time delay from one stage to the next. See figure below. Figure 8. Gated Driver Tree Figure 5. 4-Stage Ring Counter and Shift It is derived from the same clock gating signals of the blocks that they drive. Thus, in a quad-tree clock distribution network, the gate signal of the th gate driver at the th level (CKE) should be asserted when the active DET flip-flop. 2) MODIFIED RING COUNTER: Figure 6. Ring counter with SR flip-flops The above block diagram shows the power controlled Ring counter. First, total block is divided into two blocks. Each block is having one SR FLIPFLOP controller. Figure 9. Modified Ring Counter 101 www.ijtel.org

DET :(Double edge triggered flip-flops: double-edgetriggered (DET) flip-flops are utilized to reduce the operating frequency by half The logic construction of a doubleedge-triggered (DET) flip-flop, which can receive input signal at two levels the clock, is analyzed and a new circuit design of CMOS DET In this paper, we propose to use doubleedge-triggered (DET) flip-flops instead of traditional DFFs in the ring counter to halve the operating clock frequency. Double edge-triggered flipflops are becoming a popular technique for low-power designs since they effectively enable a halving of the clock frequency. The paper by Hossain et al[1] showed that while a single-edge triggered flipflop can be implemented by two transparent latches in series, a double edge-triggered flipflop can be implemented by two transparent latches in parallel; the circuit in Fig. 1 was given for the static flipflop implementation. The clock signal is assumed to be inverted locally. In high noise or low-voltage environments, Hossain et al noted that the p-type pass-transistors may be replaced by n- types or that all pass-transistors may be replaced by transmission gates. IV. SIMULATION AND SYNTHESIS To obtain a further insight about the scalability of the proposed delay buffer architecture in nanometer CMOS technology, simulations of the proposed buffer with several different lengths in 90-nm and 65-nm CMOS technology are run. To alleviate the leakage power problem, dual-vt MOS transistors are used in the 90-nm technology. The low Vt MOS transistors only exist in write-enable and read-enable pass gates between bit lines and memory cells to provide enough drying, the supply voltages are 1V and 0.85V in 90-nm cases, respectively, and the operating frequency is 200 MHz. The total power consumption in normal operation mode is not logarithmically proportional to the length of the delay buffer in snorted. Instead, due to the quad tree structure for all the driving circuitry, delay buffers of length and have approximate dynamic power because basically these two cases activate the same number of drivers. One can see that the superiority of the proposed circuit is still obvious in 90-nm technology in that the leakage power is almost negligible. A. SIMULATION AND SYNTHESIS POWER REPORTS OF THE EXISTING DELAY BUFFER Even in the more advanced 65-nm technology, the leakage power can be controlled to within unacceptable level for medium-length delay buffers with the dual-vt approach. For longer-length delay buffers and for more advanced technology, other leakage reduction techniques such as the sleep transistors in SRAM (Latch). To highlight the result of the proposed delay buffer, the simulation of the previous existing buffer is of paramount importance. Hence the simulation and the synthesis results of the existing buffer are preceded by the simulation and synthesis power reports of the proposed delay buffer. Figure 10. Simulation Power Report of the Existing Delay Buffer The existing methodology is simulated using the synthesis and simulation tools and the analysis is drawn out as shown above which tells that the power levels and the time taken are significantly high and the relative comparison is done by simulating the present, proposed methodology. Figure 11. The Simulation Project Report of the Proposed Delay Buffer 102 www.ijtel.org

B. PROPOSED Figure 13. Existing Technique over All Power Report Figure 12. Synthesis Power Report of the Existing Delay Buffer The properties and simulation results are analyzed by implementing on the tool simulator Xilinx with Spartan and device XC25100 with FG256 package. The synthesis tool is XST (verilog/vhdl) and the simulation tool is ISE simulator(vhdl/verilog) To highlight the result of the proposed delay buffer, the simulation of the previous existing buffer is of paramount importance. Hence the simulation and the synthesis results of the existing buffer are processed by the simulation and synthesis power reports of the proposed delay buffer. The power report represents the consumption criteria and dissipation criteria of the existing and the proposed present techniques. The relative comparison is made based on the two methodologies using the same synthesis and simulation tools and the proportionate results speak that the robustness of the present proposed design methodology is in the lime light. A. EXISTING V. OVER-ALL POWER REPORT Figure 14. Proposed Technique over All Power Report VI. CONCLUSION We have analyzed the existing method and proposed method using the same synthesis and simulator tool. The simulation and the synthesis results of the existing buffer are processed by the simulation and synthesis power reports of the proposed delay buffer and the proportionate results speak that the robustness of the present proposed design methodology is in the lime light. REFERENCES [1] W. Eberle et al., 80-Mb/s QPSK and 72-Mb/s 64-QAM flexible and scalable digital OFDM transceiver ASICs for wireless local area Networks in the 5-GHz band, IEEE J. Solid-State Circuits, vol. 36, no. 11, pp. 1829 1838, Nov. 2001. [2] M. L. Liou, P. H. Lin, C. J. Jan, S. C. Lin, and T. D. Chiueh, Design of an OFDM baseband receiver with space diversity, IEE Proc. Commun., vol. 153, no. 6, pp. 894 900, Dec. 2006. [3] G. Pastuszak, A high-performance architecture for embedded block coding in JPEG 2000, IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 9, pp. 1182 1191, Sep. 2005. [4] W. Li and L.Wanhammar, A pipeline FFT processor, in Proc. Workshop Signal Process. Syst. Design Implement., 1999, pp. 654 662. [5] E. K. Tsern and T. H. Meng, A low-power video-rate pyramid VQ decoder, IEEE J. Solid-State Circuits, vol. 31, no. 11, pp. 1789 1794, Nov. 1996. [6] N. Shibata, M.Watanabe, and Y. Tanabe, A current-sensed high-speed and low-power first-in-first-out memory using a wordline/bitlineswapped dual-port SRAM cell, IEEE J. Solid-State circuits, vol. 37, no. 6, pp. 735 750, Jun. 2002. [7] E. Sutherland, Micropipelines, Commun. ACM, vol. 32, no. 6, pp. 720 738, Jun. 1989. [8] A Low-Power Delay Buffer Using Gated Driver Tree Po-chun hsieh, jing-siang jhuang, pei-yun tsai, member,ieee, and tzi-dar chiueh, senior member, IEEE. 103 www.ijtel.org