EE241 - Spring 2008 Advanced Digital Integrated Circuits Lecture 26: Multipliers Latches Announcements Homework 5 Due today Wrapping-up the class: Final presentations May 8, 1-5pm, BWRC Final reports due May 7 Final exam, Monday, May 12 3-4:30pm, 241 Cory Presentations 12 minutes (max 10 slides) + 3 minutes for & A 2 1
Agenda Wrap up multipliers Latches and flip-flops 3 Multipliers 2
Generalized Counters Stenzel, Trans on Comp 10/77 5 Generalized Counters 6 3
Generalized Counters 32x32b using (5,5,4) with (3,2) in the last stage 7 4:2 Counters (Compressors) 4-2 carry-save module Weinberger, IBM J. ResDev 1/81 Santoro, Horowitz, JSSC 4/89 8 4
4:2 Compressors Built of CSAs Pipelined version compresses 8 partial products per cycle 9 4:2 Compressors Interconnect can be more regular than in Wallace tree 10 5
Three Dimensional Optimization Oklobdzija, Villeger, Liu, Trans on Comp 3/96 11 Vertical Slices in TDM 12 6
Final Addition 13 Final Addition 14 7
Example: CPL Multiplier Block Diagram Yano, JSSC 4/90 Critical Path 15 Example: DPL Multiplier Ohkubo, JSSC 3/95 16 8
Example: DPL Multiplier Booth encoder Partial product generator 17 Example: DPL Multiplier FA-based 4:2 Modified 4:2 18 9
Latches and flip-flops flops Latches: Reading Rabaey et al, Chapters 7 and 10 Chapter 10 in Chandrakasan et al, by Partovi Stojanovic, Oklobdzija, JSSC 4/99 20 10
Latch vs. Flip-Flop Latch stores data when clock is low Flip-Flop (register) stores data when clock rises D D D D 21 Latch vs. Flip-Flop Courtesy of IEEE Press, New York. 2000 22 11
Latch Pair vs. Flip-Flop Performance metrics Delay metrics Delay penalty Clock skew penalty Inclusion of logic Inherent race immunity Power/Energy Metrics Power/energy PDP, EDP Design robustness 23 Latches Transmission-Gate Latch C 2 MOS Latch D D 24 12
Latches Courtesy of IEEE Press, New York. 2000 25 Latch Pair as a Flip-Flop 26 13
Requirements for the Flip-Flop Design High speed of operation: Small -Output delay Small setup time Small hold time Inherent race immunity Low power Small clock load High driving capability Integration of logic into flip-flop Multiplexed or clock scan Robustness 27 Sources of Noise Courtesy of IEEE Press, New York. 2000 28 14
Gate Isolation Courtesy of IEEE Press, New York. 2000 29 Flip-Flop Robustness Robustness of the storage node Input isolation Data stored statically, max resistance limit Min capacitance limit Preventing storage node exposure 30 15
Types of Flip-Flops Latch Pair (Master-Slave) Pulse-Triggered Latch Data L1 L2 L Data D D D 31 Flip-Flop Delay Sum of setup time and -output delay is the true measure of the performance with respect to the system speed T = T - + T Logic + T setup (ignoring skew) D Logic D N T - T Logic T Setup 32 16
Delay vs. Setup/Hold Times 350 Minimum Data-Output 300 250 -Output [ps] Setup 200 150 100 Hold 50 0-200 -150-100 -50 0 50 100 150 200 Data- [ps] 33 Master-Slave Latch Pairs Positive setup times Two clock phases:» distributed globally» generated locally Small penalty in delay for incorporating MUX Some circuit tricks needed to reduce the overall delay 34 17
Master-Slave Latch Pairs Case 1: PowerPC 603 (Gerosa, JSSC 12/94) Vdd Vdd D b b 35 T-G Master-Slave Latch Feedback added for static operation Unbuffered Ubff d input input capacitance depends on the phase of the clock over-shoot and under-shoot with long routes wirelength must be restricted at the input Clock load is high Low power Small clk-output delay, but positive setup 36 18
Master-Slave Latches Case 2: C 2 MOS Vdd Vdd D Ck Ckb Ckb Ck Vdd Vdd Vdd Vdd Ck Ckb Vdd Ck Vdd Feedback added for static operation Locally generated clock Poor driving capability Ck Ckb 37 Pulse-Triggered Latches First stage is a pulse generator generates a pulse (glitch) on a rising edge of the clock Second stage is a latch captures the pulse generated in the first stage Pulse generation results in a negative setup time Frequently exhibit a soft edge property Note: power is always consumed in the pulse generator 38 19
Pulsed Latch Simple pulsed latch Kozu, ISSCC 96 39 Intel/HP Itanium 2 Naffziger, ISSCC 02 40 20
Pulse-Triggered Latches Hybrid Latch Flip-Flop, AMD K-6 Partovi, ISSCC 96 Vdd D 41 HLFF Operation 1-0 and 0-1 transitions at the input with 0ps setup time 42 21
Hybrid Latch Flip-Flop Skew absorption Partovi et al, ISSCC 96 43 Pulse-Triggered Latches AMD K-7 Courtesy of IEEE Press, New York. 2000 44 22
Pulse-Triggered Latches Semi-Dynamic Flip-Flop (SDFF), Sun UltraSparc III, Klass, VLSI Circuits 98 Vdd Vdd D Pulse generator is dynamic, cross-coupled latch is added for robustness. Loses soft edge on rising transition Latch has one transistor less in stack - faster than HLFF, but 1-1 glitch exists Small penalty for adding logic 45 Pulse-Triggered Latches 7474, from early 1960 s S R D 46 23
Pulse-Triggered Latches Case 4: Sense-amplifier-based flip-flop, Matsui 1992. DEC Alpha 21264, StrongARM 110 First stage is a sense amplifier, precharged to high, when = 0 After rising edge of the clock sense amplifier generates the pulse on S or R The pulse is captured in S-R latch Cross-coupled NAND has different propagation delays of rising and falling edges 47 Sense Amplifier-Based Flip-Flop Courtesy of IEEE Press, New York. 2000 48 24
Sampling Window Comparison Naffziger, JSSC 11/02 49 Local Clock Gating D CKI 0.85 0.85 D I 05 0.5 0.85 0.5 0.5 CKIB CKIB 0.5 2 1.2 0.5 Data-Transition Look-Ahead 0.85 0.5 0.85 0.5 Pulse Generator XNOR CP 0.85 0.5 CKIB CKI Clock on demand Flip-flop 50 25
Next Lecture Timing 51 26