EE241 - Spring 2005 Advanced Digital Integrated Circuits

Similar documents
Clock Generation and Distribution for High-Performance Processors

Timing EECS141 EE141. EE141-Fall 2011 Digital Integrated Circuits. Pipelining. Administrative Stuff. Last Lecture. Latch-Based Clocking.

EITF35: Introduction to Structured VLSI Design

Clocking Spring /18/05

Lecture 26: Multipliers. Final presentations May 8, 1-5pm, BWRC Final reports due May 7 Final exam, Monday, May :30pm, 241 Cory

Digital System Clocking: High-Performance and Low-Power Aspects

EE178 Lecture Module 4. Eric Crabill SJSU / Xilinx Fall 2005

11. Sequential Elements

EE178 Spring 2018 Lecture Module 5. Eric Crabill

EE241 - Spring 2007 Advanced Digital Integrated Circuits. Announcements

EECS150 - Digital Design Lecture 18 - Circuit Timing (2) In General...

Lecture 6. Clocked Elements

Asynchronous inputs. 9 - Metastability and Clock Recovery. A simple synchronizer. Only one synchronizer per input

EE273 Lecture 11 Pipelined Timing Closed-Loop Timing November 2, Today s Assignment

EE141-Fall 2010 Digital Integrated Circuits. Announcements. Homework #8 due next Tuesday. Project Phase 3 plan due this Sat.

VARIABLE FREQUENCY CLOCKING HARDWARE

Chapter 6. Flip-Flops and Simple Flip-Flop Applications

EECS150 - Digital Design Lecture 10 - Interfacing. Recap and Topics

Lecture 21: Sequential Circuits. Review: Timing Definitions

Clock - key to synchronous systems. Lecture 7. Clocking Strategies in VLSI Systems. Latch vs Flip-Flop. Clock for timing synchronization

Clock - key to synchronous systems. Topic 7. Clocking Strategies in VLSI Systems. Latch vs Flip-Flop. Clock for timing synchronization

ECEN620: Network Theory Broadband Circuit Design Fall 2014

A MISSILE INSTRUMENTATION ENCODER

More on Flip-Flops Digital Design and Computer Architecture: ARM Edition 2015 Chapter 3 <98> 98

EE141-Fall 2010 Digital Integrated Circuits. Announcements. Synchronous Timing. Latch Parameters. Class Material. Homework #8 due next Tuesday

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

Outline. EECS150 - Digital Design Lecture 27 - Asynchronous Sequential Circuits. Cross-coupled NOR gates. Asynchronous State Transition Diagram

Dual Link DVI Receiver Implementation

Logic Devices for Interfacing, The 8085 MPU Lecture 4

A low jitter clock and data recovery with a single edge sensing Bang-Bang PD

ISSCC 2003 / SESSION 19 / PROCESSOR BUILDING BLOCKS / PAPER 19.5

CSE115: Digital Design Lecture 23: Latches & Flip-Flops

ISSCC 2006 / SESSION 18 / CLOCK AND DATA RECOVERY / 18.6

A Symmetric Differential Clock Generator for Bit-Serial Hardware

Introduction to Sequential Circuits

IT T35 Digital system desigm y - ii /s - iii

An FPGA Implementation of Shift Register Using Pulsed Latches

Asynchronous IC Interconnect Network Design and Implementation Using a Standard ASIC Flow

Exercises. 162 CHAPTER THREE Sequential Logic Design

Efficient 500 MHz Digital Phase Locked Loop Implementation sin 180nm CMOS Technology

DEPARTMENT OF ELECTRICAL &ELECTRONICS ENGINEERING DIGITAL DESIGN

Digital System Clocking: High-Performance and Low-Power Aspects. Microprocessor Examples

EE-382M VLSI II FLIP-FLOPS

Digital Logic & Computer Design CS Professor Dan Moldovan Spring Chapter 3 :: Sequential Logic Design

A FOUR GAIN READOUT INTEGRATED CIRCUIT : FRIC 96_1

ESE 570 STATIC SEQUENTIAL CMOS LOGIC CELLS. Kenneth R. Laker, University of Pennsylvania, updated 25Mar15

Asynchronous (Ripple) Counters

HIGH SPEED CLOCK DISTRIBUTION NETWORK USING CURRENT MODE DOUBLE EDGE TRIGGERED FLIP FLOP WITH ENABLE

Figure 30.1a Timing diagram of the divide by 60 minutes/seconds counter

Chapter 5 Flip-Flops and Related Devices

psasic Timing Generator

Clock Domain Crossing. Presented by Abramov B. 1

Administrative issues. Sequential logic

Synchronization in Asynchronously Communicating Digital Systems

SYNCHRONOUS DERIVED CLOCK AND SYNTHESIS OF LOW POWER SEQUENTIAL CIRCUITS *

DEDICATED TO EMBEDDED SOLUTIONS

EE 447/547 VLSI Design. Lecture 9: Sequential Circuits. VLSI Design EE 447/547 Sequential circuits 1

Topic 8. Sequential Circuits 1

L4: Sequential Building Blocks (Flip-flops, Latches and Registers)

Chapter 3 Unit Combinational

data and is used in digital networks and storage devices. CRC s are easy to implement in binary

ECEN454 Digital Integrated Circuit Design. Sequential Circuits. Sequencing. Output depends on current inputs

Lecture 11: Sequential Circuit Design

Combinational vs Sequential

Logic Design. Flip Flops, Registers and Counters

Flip-Flops and Sequential Circuit Design

Energy Recovery Clocking Scheme and Flip-Flops for Ultra Low-Energy Applications

EE273 Lecture 14 Synchronizer Design November 11, Today s Assignment

The outputs are formed by a combinational logic function of the inputs to the circuit or the values stored in the flip-flops (or both).

Modeling and designing of Sense Amplifier based Flip-Flop using Cadence tool at 45nm

VLSI Clock Domain Crossing

EE273 Lecture 15 Synchronizer Design

Design of High Speed Phase Frequency Detector in 0.18 μm CMOS Process for PLL Application

LOW POWER DIGITAL EQUALIZATION FOR HIGH SPEED SERDES. Masum Hossain University of Alberta

Homework 3 posted this week, due after Spring break Quiz #2 today Midterm project report due on Wednesday No office hour today

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2011

SDA 3302 Family. GHz PLL with I 2 C Bus and Four Chip Addresses

Lecture 13: Clock and Synchronization. TIE Logic Synthesis Arto Perttula Tampere University of Technology Spring 2017

CHAPTER 6 ASYNCHRONOUS QUASI DELAY INSENSITIVE TEMPLATES (QDI) BASED VITERBI DECODER

EECS150 - Digital Design Lecture 3 Synchronous Digital Systems Review. Announcements

Digital Logic Design Sequential Circuits. Dr. Basem ElHalawany

Counters

Lecture 10: Sequential Circuits

Metastability Analysis of Synchronizer

AD9884A Evaluation Kit Documentation

Universal Asynchronous Receiver- Transmitter (UART)

Chapter 4: One-Shots, Counters, and Clocks

Registers and Counters

Chapter 7 Sequential Circuits


A NOVEL APPROACH TO ACHIEVE HIGH SPEED LOW-POWER HYBRID FLIP-FLOP

MC9211 Computer Organization

Texas Instruments TNETE2201 Ethernet Transceiver Circuit Analysis

(CSC-3501) Lecture 7 (07 Feb 2008) Seung-Jong Park (Jay) CSC S.J. Park. Announcement

Sequencing. Lan-Da Van ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall,

Introduction. NAND Gate Latch. Digital Logic Design 1 FLIP-FLOP. Digital Logic Design 1

Advanced Devices. Registers Counters Multiplexers Decoders Adders. CSC258 Lecture Slides Steve Engels, 2006 Slide 1 of 20

2.6 Reset Design Strategy

CAD for VLSI Design - I Lecture 38. V. Kamakoti and Shankar Balachandran

Sequential Circuit Design: Part 1

Transcription:

EE241 - Spring 2005 Advanced Digital Integrated Circuits Lecture 21: Asynchronous Design Synchronization Clock Distribution Self-Timed Pipelined Datapath Req Ack HS Req Ack HS Req Ack HS Req Ack Start Done Start Done Start Done In R1 F1 R2 F2 R3 F3 Out t pf1 t pf2 t pf3 2 1

Hand-Shaking Protocol Req Ack Req 2 SENDER Data RECEIVER Ack 3 (a) Sender-receiver configuration Data 1 1 Two Phase Handshake cycle 1 cycle 2 Sender s action Receiver s action (b) Timing diagram 3 Event Logic The Muller-C Element A B C F A B F n+1 0 0 1 1 0 1 0 1 0 F n F n 1 (a) Schematic (b) Truth table V DD V DD V DD A B S R Q F A B B B F A F (a) Logic A B B (b) Majority Function (c) Dynamic 4 2

2-Phase Handshake Protocol Sender logic Data ready Data Receiver logic Data accepted C Req Ack Handshake logic Advantage : FAST - minimal # of signaling events (important for global interconnect) Disadvantage : edge - sensitive, has state 5 Example: Self-timed FIFO In R1 R2 R3 Out En Done Req i C C C Req 0 Ack i Ack o All 1s or 0s -> pipeline empty Alternating 1s and 0s -> pipeline full 6 3

2-Phase Protocol 7 Example From [Horowitz] 8 4

Example 9 Example 10 5

Example 11 4-Phase Handshake Protocol Req 2 4 Sender s action Receiver s action Ack 3 5 Data 1 1 Cycle 1 Cycle 2 Also known as RTZ Slower, but unambiguous 12 6

4-Phase Handshake Protocol Implementation using Muller-C elements Sender logic Data Receiver logic Data ready Data accepted C C S Req Ack Handshake logic 13 Self-Resetting Logic Precharged Logic Block (L1) completion detection (L1) Precharged Logic Block (L2) completion detection (L2) Precharged Logic Block (L3) completion detection (L3) V DD int out Post-charge logic A B C 14 7

Asynchronous-Synchronous Interface Asynchronous system f in Synchronous system f CLK Synchronization 15 Synchronizers and Arbiters Arbiter: Circuit to decide which of 2 events occurred first Synchronizer: Arbiter with clock φ as one of the inputs Problem: Circuit HAS to make a decision in limited time - which decision is not important Caveat: It is impossible to ensure correct operation But, we can decrease the error probability at the expense of delay 16 8

A Simple Synchronizer CLK D int I 1 Q CLK I 2 Data sampled on rising edge of the clock Latch will eventually resolve the signal value, but... this might take infinite time! 17 Synchronizer: Output Trajectories 2.0 V out 1.0 0.0 0 100 200 300 time [ps] Single-pole model for a flip-flop 18 9

Mean Time to Failure 19 Example T f = 10 nsec = T T signal = 50 nsec t r = 1 nsec t = 310 psec V IH - V IL = 1 V (V DD = 5 V) N(T) = 3.9 10-9 errors/sec MTF (T) = 2.6 10 8 sec = 8.3 years MTF (0) = 2.5 µsec 20 10

Influence of Noise p(v) Uniform distribution around VM T logarithmic reduction 0 V IL V IH Initial Distribution Still Uniform Low amplitude noise does not influence synchronization behavior 21 Typical Synchronizers 2 phase clocking circuit Q φ2 φ1 Q φ2 φ1 Using delay line 22 11

Cascaded Synchronizers Reduce MTF In O 1 O 2 Out Sync Sync Sync φ 23 Arbiters Req1 Req2 Arbiter Ack1 Ack2 Req1 A B Ack2 (a) Schematic symbol Req2 Ack1 Req1 (b) Implementation Req2 V A T gap B metastable Ack1 t (c) Timing diagram 24 12

PLL-Based Synchronization Digital System Chip 1 Data Chip 2 Digital System f system = N x f crystal PLL Divider reference clock PLL Clock Buffer Crystal Oscillator f crystal, 200<Mhz 25 Clock Distribution Goal: Minimization of uncertainty Clock skew (spatial uncertainty) Systematic Clock jitter (temporal uncertainty) Random cycle-to-cycle changes 26 13

Reading Chapter 13, (Chandrakasan et al), Clock Distribution by Bailey Chapter 12, (Chandrakasan et al), PLLs and DLLs by Maneatis Chapter 10, Rabaey et al. 27 Clock Distribution Tree Common, e.g. IBM S/390 Clock grid» DEC Alpha Length-matched Serpentines» Intel P6 28 14

Clock Distribution CLOCK Example: PowerPC 603 Gerosa, JSSC 12/94 H-Tree Network Observe: Only Relative Skew is Important 29 Clock Network with Distributed Buffering Local Area Module Module secondary clock drivers Module Module Module Module main clock driver CLOCK Reduces absolute delay, and makes Power-Down easier Sensitive to variations in Buffer Delay 30 15

Predriver Binary tree H - tree X - tree Arbitrary matched tree 31 Example IBM S/390 Clock skew Webb, JSSC 11/97 32 16

Clock Tree Delays Restle, VLSI 98 33 Impact of clock network sizing 34 17

Impact of clock network sizing 35 Final Stage: Tree vs. Grid RC-matched Tree Grid Courtesy of IEEE Press, New York. 2000 36 18

IR Emission Images Central buffer Clock repeaters Sector buffers Local clocks Sanda, ISSCC 99 37 Example: DEC Alpha 21164 Clock Drivers 38 19

Clock Skew in Alpha Processor 39 DEC Alpha Evolution Clock driver placements 21064 21164 21164 Gronowski, JSSC 5/98 40 20

Clock Skews 21064 21164 21264 41 Hybrid Grid DEC Alpha 21264 Bailey JSSC 11/98 42 21

Alpha 21264 43 Alpha 21264 Grids Global clock Major clock grids 44 22

Data-Dependent Gate Loading 45 Multi-GHz Clock Networks Phillip Restle, IBM Research IEEE SSCTC Workshop on Design for Multi-GigaHertz Processors, San Fransico, Feb. 7, 2000 http://www.research.ibm.com/people/r/restle/mghz.html http://www.research.ibm.com/people/r/restle/animations/dac01top.html 46 23

Clock Generation Delay-Locked Loop (Delay Line Based) f REF Phase Det U D Charge Pump Filter DL f O Phase-Locked Loop (VCO-Based) f REF U N PD D CP VCO Filter f O 47 Phase-Locked Loop Based Clock Generator Up Down Reference clock Phase detector Up Charge pump Loop filter V contr VCO Local clock Down Clock decode & buffer Divide by N φ 1 φ 2... Acts also as Clock Multiplier 48 24

Loop Components Phase Comparator Produces UP/DN pulses corresponding to phase difference Charge Pump Sources/sinks current for duration of UP/DN pulses Loop Filter Integrates current to produce control voltage Voltage-Controlled Delay Line Changes delay proportionally to voltage Voltage-Controlled Oscillator Generates frequency proportional to control voltage 49 PLL Jitter 50 25

DLL Locking Courtesy of IEEE Press, New York. 2000 51 Clock Deskewing Two clock spines, two DLLs, and a PD that controls them Geannopoulos, ISSCC 98 52 26

Clock Ring Clocks routed in parallel, opposite directions LCG aligns to the middle Shibayama, ISSCC 98 53 Synchronous Distributed Oscillators VCOs # of nearest neighbors Mizuno, ISSCC 98 54 27

Distributed PLLs Gutnik, ISSCC 2000 55 Intel Itanium TM Rusu, ISSCC 2000 56 28

Intel Itanium TM 57 29