Lecture 26: Multipliers. Final presentations May 8, 1-5pm, BWRC Final reports due May 7 Final exam, Monday, May :30pm, 241 Cory

Similar documents
EE241 - Spring 2007 Advanced Digital Integrated Circuits. Announcements

Lecture 21: Sequential Circuits. Review: Timing Definitions

Homework 3 posted this week, due after Spring break Quiz #2 today Midterm project report due on Wednesday No office hour today

EE-382M VLSI II FLIP-FLOPS

ISSCC 2003 / SESSION 19 / PROCESSOR BUILDING BLOCKS / PAPER 19.5

II. ANALYSIS I. INTRODUCTION

Lecture 6. Clocked Elements

11. Sequential Elements

Digital System Clocking: High-Performance and Low-Power Aspects

High Performance Dynamic Hybrid Flip-Flop For Pipeline Stages with Methodical Implanted Logic

Comparative study on low-power high-performance standard-cell flip-flops

Energy-Delay Space Analysis for Clocked Storage Elements Under Process Variations

Topic 8. Sequential Circuits 1

EE241 - Spring 2005 Advanced Digital Integrated Circuits

ECE321 Electronics I

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Science

Hardware Design I Chap. 5 Memory elements

Sequential Circuit Design: Part 1

Sequencing. Lan-Da Van ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall,

Sequential Circuit Design: Part 1

EEC 118 Lecture #9: Sequential Logic. Rajeevan Amirtharajah University of California, Davis Jeff Parkhurst Intel Corporation

A NOVEL APPROACH TO ACHIEVE HIGH SPEED LOW-POWER HYBRID FLIP-FLOP

Memory elements. Topics. Memory element terminology. Variations in memory elements. Clock terminology. Memory element parameters. clock.

Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module

EE241 - Spring 2013 Advanced Digital Integrated Circuits. Announcements. Lecture 14: Statistical timing Latches

AN EFFICIENT DOUBLE EDGE TRIGGERING FLIP FLOP (MDETFF)

A Power Efficient Flip Flop by using 90nm Technology

EE141-Fall 2010 Digital Integrated Circuits. Announcements. Homework #8 due next Tuesday. Project Phase 3 plan due this Sat.

UNIT III COMBINATIONAL AND SEQUENTIAL CIRCUIT DESIGN

Load-Sensitive Flip-Flop Characterization

Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register

A Unified Approach in the Analysis of Latches and Flip-Flops for Low-Power Systems

Clocking Spring /18/05

ECEN454 Digital Integrated Circuit Design. Sequential Circuits. Sequencing. Output depends on current inputs

Modeling and designing of Sense Amplifier based Flip-Flop using Cadence tool at 45nm

Clock - key to synchronous systems. Lecture 7. Clocking Strategies in VLSI Systems. Latch vs Flip-Flop. Clock for timing synchronization

Clock - key to synchronous systems. Topic 7. Clocking Strategies in VLSI Systems. Latch vs Flip-Flop. Clock for timing synchronization

Low-Power and Area-Efficient Shift Register Using Pulsed Latches

data and is used in digital networks and storage devices. CRC s are easy to implement in binary

EE141-Fall 2010 Digital Integrated Circuits. Announcements. Synchronous Timing. Latch Parameters. Class Material. Homework #8 due next Tuesday

ECE 341. Lecture # 2

CPE/EE 427, CPE 527 VLSI Design I Sequential Circuits. Sequencing

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2011

Lecture 11: Sequential Circuit Design

(CSC-3501) Lecture 7 (07 Feb 2008) Seung-Jong Park (Jay) CSC S.J. Park. Announcement

Sequential Logic. References:

55:131 Introduction to VLSI Design Project #1 -- Fall 2009 Counter built from NAND gates, timing Due Date: Friday October 9, 2009.

D Latch (Transparent Latch)

Energy Recovery Clocking Scheme and Flip-Flops for Ultra Low-Energy Applications

Digital System Clocking: High-Performance and Low-Power Aspects

12-bit Wallace Tree Multiplier CMPEN 411 Final Report Matthew Poremba 5/1/2009

An efficient Sense amplifier based Flip-Flop design

An Optimized Implementation of Pulse Triggered Flip-flop Based on Single Feed-Through Scheme in FPGA Technology

CMOS Latches and Flip-Flops

Asynchronous Data Sampling Within Clock-Gated Double Edge-Triggered Flip-Flops

DESIGN OF DOUBLE PULSE TRIGGERED FLIP-FLOP BASED ON SIGNAL FEED THROUGH SCHEME

Digital Logic Design ENEE x. Lecture 19

Dual Edge Adaptive Pulse Triggered Flip-Flop for a High Speed and Low Power Applications

cascading flip-flops for proper operation clock skew Hardware description languages and sequential logic

An FPGA Implementation of Shift Register Using Pulsed Latches

Optimization of Scannable Latches for Low Energy

CS/EE 6710 Digital VLSI Design CAD Assignment #3 Due Thursday September 21 st, 5:00pm

International Journal of Engineering Research in Electronics and Communication Engineering (IJERECE) Vol 1, Issue 6, June 2015 I.

Asynchronous Model of Flip-Flop s and Latches for Low Power Clocking

EE 447/547 VLSI Design. Lecture 9: Sequential Circuits. VLSI Design EE 447/547 Sequential circuits 1

SYNCHRONOUS DERIVED CLOCK AND SYNTHESIS OF LOW POWER SEQUENTIAL CIRCUITS *

EECS150 - Digital Design Lecture 3 - Timing

CprE 281: Digital Logic

Unit 11. Latches and Flip-Flops

Low Power and Reduce Area Dual Edge Pulse Triggered Flip-Flop Based on Signal Feed-Through Scheme

High performance and Low power FIR Filter Design Based on Sharing Multiplication

Comparative Analysis of Pulsed Latch and Flip-Flop based Shift Registers for High-Performance and Low-Power Systems

Lecture 10: Sequential Circuits

Chapter 6. Flip-Flops and Simple Flip-Flop Applications

Improve Performance of Low-Power Clock Branch Sharing Double-Edge Triggered Flip-Flop

More on Flip-Flops Digital Design and Computer Architecture: ARM Edition 2015 Chapter 3 <98> 98

DESIGN AND ANALYSIS OF LOW POWER STS PULSE TRIGGERED FLIP-FLOP USING 250NM CMOS TECHNOLOGY

Advanced Devices. Registers Counters Multiplexers Decoders Adders. CSC258 Lecture Slides Steve Engels, 2006 Slide 1 of 20

Tutorial Outline. Design Levels

EFFICIENT TIMING ELEMENT DESIGN FEATURING LOW POWER VLSI APPLICATIONS

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532

Timing Error Detection: An Adaptive Scheme To Combat Variability EE241 Final Report Nathan Narevsky and Richard Ott {nnarevsky,

Experiment 8 Introduction to Latches and Flip-Flops and registers

Improved Sense-Amplifier-Based Flip-Flop: Design and Measurements

HIGH PERFORMANCE AND LOW POWER ASYNCHRONOUS DATA SAMPLING WITH POWER GATED DOUBLE EDGE TRIGGERED FLIP-FLOP

New Single Edge Triggered Flip-Flop Design with Improved Power and Power Delay Product for Low Data Activity Applications

Chapter 2. Digital Circuits

Digital Integrated Circuits EECS 312

Design And Analysis of Clocked Subsystem Elements Using Leakage Reduction Technique

Counter dan Register

GLITCH FREE NAND BASED DCDL IN PHASE LOCKED LOOP APPLICATION

The outputs are formed by a combinational logic function of the inputs to the circuit or the values stored in the flip-flops (or both).

Digital System Clocking: High-Performance and Low-Power Aspects. Microprocessor Examples

Logic Design. Flip Flops, Registers and Counters

EL302 DIGITAL INTEGRATED CIRCUITS LAB #3 CMOS EDGE TRIGGERED D FLIP-FLOP. Due İLKER KALYONCU, 10043

CSE115: Digital Design Lecture 23: Latches & Flip-Flops

A Low-Power CMOS Flip-Flop for High Performance Processors

Asynchronous (Ripple) Counters

Power Optimization Techniques for Sequential Elements Using Pulse Triggered Flip-Flops with SVL Logic

Sequential logic. Circuits with feedback. How to control feedback? Sequential circuits. Timing methodologies. Basic registers

Transcription:

EE241 - Spring 2008 Advanced Digital Integrated Circuits Lecture 26: Multipliers Latches Announcements Homework 5 Due today Wrapping-up the class: Final presentations May 8, 1-5pm, BWRC Final reports due May 7 Final exam, Monday, May 12 3-4:30pm, 241 Cory Presentations 12 minutes (max 10 slides) + 3 minutes for & A 2 1

Agenda Wrap up multipliers Latches and flip-flops 3 Multipliers 2

Generalized Counters Stenzel, Trans on Comp 10/77 5 Generalized Counters 6 3

Generalized Counters 32x32b using (5,5,4) with (3,2) in the last stage 7 4:2 Counters (Compressors) 4-2 carry-save module Weinberger, IBM J. ResDev 1/81 Santoro, Horowitz, JSSC 4/89 8 4

4:2 Compressors Built of CSAs Pipelined version compresses 8 partial products per cycle 9 4:2 Compressors Interconnect can be more regular than in Wallace tree 10 5

Three Dimensional Optimization Oklobdzija, Villeger, Liu, Trans on Comp 3/96 11 Vertical Slices in TDM 12 6

Final Addition 13 Final Addition 14 7

Example: CPL Multiplier Block Diagram Yano, JSSC 4/90 Critical Path 15 Example: DPL Multiplier Ohkubo, JSSC 3/95 16 8

Example: DPL Multiplier Booth encoder Partial product generator 17 Example: DPL Multiplier FA-based 4:2 Modified 4:2 18 9

Latches and flip-flops flops Latches: Reading Rabaey et al, Chapters 7 and 10 Chapter 10 in Chandrakasan et al, by Partovi Stojanovic, Oklobdzija, JSSC 4/99 20 10

Latch vs. Flip-Flop Latch stores data when clock is low Flip-Flop (register) stores data when clock rises D D D D 21 Latch vs. Flip-Flop Courtesy of IEEE Press, New York. 2000 22 11

Latch Pair vs. Flip-Flop Performance metrics Delay metrics Delay penalty Clock skew penalty Inclusion of logic Inherent race immunity Power/Energy Metrics Power/energy PDP, EDP Design robustness 23 Latches Transmission-Gate Latch C 2 MOS Latch D D 24 12

Latches Courtesy of IEEE Press, New York. 2000 25 Latch Pair as a Flip-Flop 26 13

Requirements for the Flip-Flop Design High speed of operation: Small -Output delay Small setup time Small hold time Inherent race immunity Low power Small clock load High driving capability Integration of logic into flip-flop Multiplexed or clock scan Robustness 27 Sources of Noise Courtesy of IEEE Press, New York. 2000 28 14

Gate Isolation Courtesy of IEEE Press, New York. 2000 29 Flip-Flop Robustness Robustness of the storage node Input isolation Data stored statically, max resistance limit Min capacitance limit Preventing storage node exposure 30 15

Types of Flip-Flops Latch Pair (Master-Slave) Pulse-Triggered Latch Data L1 L2 L Data D D D 31 Flip-Flop Delay Sum of setup time and -output delay is the true measure of the performance with respect to the system speed T = T - + T Logic + T setup (ignoring skew) D Logic D N T - T Logic T Setup 32 16

Delay vs. Setup/Hold Times 350 Minimum Data-Output 300 250 -Output [ps] Setup 200 150 100 Hold 50 0-200 -150-100 -50 0 50 100 150 200 Data- [ps] 33 Master-Slave Latch Pairs Positive setup times Two clock phases:» distributed globally» generated locally Small penalty in delay for incorporating MUX Some circuit tricks needed to reduce the overall delay 34 17

Master-Slave Latch Pairs Case 1: PowerPC 603 (Gerosa, JSSC 12/94) Vdd Vdd D b b 35 T-G Master-Slave Latch Feedback added for static operation Unbuffered Ubff d input input capacitance depends on the phase of the clock over-shoot and under-shoot with long routes wirelength must be restricted at the input Clock load is high Low power Small clk-output delay, but positive setup 36 18

Master-Slave Latches Case 2: C 2 MOS Vdd Vdd D Ck Ckb Ckb Ck Vdd Vdd Vdd Vdd Ck Ckb Vdd Ck Vdd Feedback added for static operation Locally generated clock Poor driving capability Ck Ckb 37 Pulse-Triggered Latches First stage is a pulse generator generates a pulse (glitch) on a rising edge of the clock Second stage is a latch captures the pulse generated in the first stage Pulse generation results in a negative setup time Frequently exhibit a soft edge property Note: power is always consumed in the pulse generator 38 19

Pulsed Latch Simple pulsed latch Kozu, ISSCC 96 39 Intel/HP Itanium 2 Naffziger, ISSCC 02 40 20

Pulse-Triggered Latches Hybrid Latch Flip-Flop, AMD K-6 Partovi, ISSCC 96 Vdd D 41 HLFF Operation 1-0 and 0-1 transitions at the input with 0ps setup time 42 21

Hybrid Latch Flip-Flop Skew absorption Partovi et al, ISSCC 96 43 Pulse-Triggered Latches AMD K-7 Courtesy of IEEE Press, New York. 2000 44 22

Pulse-Triggered Latches Semi-Dynamic Flip-Flop (SDFF), Sun UltraSparc III, Klass, VLSI Circuits 98 Vdd Vdd D Pulse generator is dynamic, cross-coupled latch is added for robustness. Loses soft edge on rising transition Latch has one transistor less in stack - faster than HLFF, but 1-1 glitch exists Small penalty for adding logic 45 Pulse-Triggered Latches 7474, from early 1960 s S R D 46 23

Pulse-Triggered Latches Case 4: Sense-amplifier-based flip-flop, Matsui 1992. DEC Alpha 21264, StrongARM 110 First stage is a sense amplifier, precharged to high, when = 0 After rising edge of the clock sense amplifier generates the pulse on S or R The pulse is captured in S-R latch Cross-coupled NAND has different propagation delays of rising and falling edges 47 Sense Amplifier-Based Flip-Flop Courtesy of IEEE Press, New York. 2000 48 24

Sampling Window Comparison Naffziger, JSSC 11/02 49 Local Clock Gating D CKI 0.85 0.85 D I 05 0.5 0.85 0.5 0.5 CKIB CKIB 0.5 2 1.2 0.5 Data-Transition Look-Ahead 0.85 0.5 0.85 0.5 Pulse Generator XNOR CP 0.85 0.5 CKIB CKI Clock on demand Flip-flop 50 25

Next Lecture Timing 51 26