TAU 2013 Variation Aware Timing Analysis Contest

Similar documents
Soft Error Derating Computation in Sequential Circuits

GRABLINKTM. FullTM. - DualBaseTM. - BaseTM. GRABLINK Full TM. GRABLINK DualBase TM. GRABLINK Base TM

Chapter 1: Introduction

Panel-mounted Thermostats

Safety Relay Unit G9SB

Application Support. Product Information. Omron STI. Support Engineers are available at our USA headquarters from

Chapter 5. Synchronous Sequential Logic. Outlines

Safety Relay Unit G9SB

Chapter 3: Sequential Logic Design -- Controllers

WE SERIES DIRECTIONAL CONTROL VALVES

Successful Transfer of 12V phemt Technology. Taiwan 333, ext 1557 TRANSFER MASK

ECE 274 Digital Logic. Digital Design. Datapath Components Registers. Datapath Components Register with Parallel Load

Reverse Iterative Deepening for Finite-Horizon MDPs with Large Branching Factors

arxiv: v2 [cs.sd] 13 Dec 2016

Explosion protected add-on thermostat

Engineer To Engineer Note

Have they bunched yet? An exploratory study of the impacts of bus bunching on dwell and running times.

ECE 274 Digital Logic. Digital Design. Sequential Logic Design Controller Design: Laser Timer Example

A Proposed Keystream Generator Based on LFSRs. Adel M. Salman Baghdad College for Economics Sciences

ViaLiteHD RF Fibre Optic Link

Sequencer devices. Philips Semiconductors Programmable Logic Devices

CPE 200L LABORATORY 2: DIGITAL LOGIC CIRCUITS BREADBOARD IMPLEMENTATION UNIVERSITY OF NEVADA, LAS VEGAS GOALS:

Novel Pulsed-Latch Replacement Based on Time Borrowing and Spiral Clustering

Mapping Arbitrary Logic Functions into Synchronous Embedded Memories For Area Reduction on FPGAs

ARCHITECTURAL CONSIDERATION OF TOPS-DSP FOR VIDEO PROCESSING. Takao Nishitani. Tokyo Metropolitan University

style type="text/css".wpb_animate_when_almost_visible { opacity: 1; }/style

Outline. Circuits & Layout. CMOS VLSI Design

ViaLite SatComs Fibre Optic Link

The Official IDENTITY SYSTEM. A Manual Concerning Graphic Standards and Proper Implementation. As developed and established by the

Train times. Monday to Sunday. Stoke-on-Trent. Crewe

DIGITAL EFFECTS MODULE OWNER'S MANUAL

VOCAL MUSIC I * * K-5. Red Oak Community School District Vocal Music Education. Vocal Music Program Standards and Benchmarks

Lecture 3: Circuits & Layout

Before Reading. Introduce Everyday Words. Use the following steps to introduce students to Nature Walk.

Generating lyrics with the variational autoencoder and multi-modal artist embeddings

Standards Overview (updated 7/31/17) English III Louisiana Student Standards by Collection Assessed on. Teach in Collection(s)

92.507/1. EYR 203, 207: novaflex universal controller. Sauter Systems

Standard Databases for Recognition of Handwritten Digits, Numerical Strings, Legal Amounts, Letters and Dates in Farsi Language

A New Concept of Providing Telemetry Data in Real Time

INPUT CAPTURE WITH ST62 16-BIT AUTO-RELOAD TIMER

LCD Data Projector VPL-S500U/S500E/S500M

Evaluation of the Suitability of Acoustic Characteristics of Electronic Demung to the Original Demung

Automatic Repositioning Technique for Digital Cell Based Window Comparators and Implementation within Mixed-Signal DfT Schemes

Predicted Movie Rankings: Mixture of Multinomials with Features CS229 Project Final Report 12/14/2006

A Fast Approach for Static Timing Analysis Covering All PVT Corners Sari Onaissi

CSCB58 - Lab 4. Prelab /3 Part I (in-lab) /1 Part II (in-lab) /1 Part III (in-lab) /2 TOTAL /8

4.25-GBPS CABLE AND PC BOARD EQUALIZER

Minstruments for Analog Audio Signals

Digital Integrated Circuits Lecture 19: Design for Testability

EL302 DIGITAL INTEGRATED CIRCUITS LAB #3 CMOS EDGE TRIGGERED D FLIP-FLOP. Due İLKER KALYONCU, 10043

1. Connect the wall transformer to the mating connector on the Companion. Plug the transformer into a power outlet.

Homework 3 posted this week, due after Spring break Quiz #2 today Midterm project report due on Wednesday No office hour today

Applications to Transistors

MILWAUKEE ELECTRONICS NEWS

DRAFT. Vocal Music AOS 2 WB 3. Purcell: Music for a While. Section A: Musical contexts. How is this mood achieved through the following?

High Performance Low Swing Clock Tree Synthesis with Custom D Flip-Flop Design

lookbook Higher Education

UNIT IV CMOS TESTING. EC2354_Unit IV 1

University College of Engineering, JNTUK, Kakinada, India Member of Technical Staff, Seerakademi, Hyderabad

Static Timing Analysis for Nanometer Designs

Politecnico di Torino. Porto Institutional Repository

Phosphor: Explaining Transitions in the User Interface Using Afterglow Effects

TAP 413-1: Deflecting electron beams in a magnetic field

Homework 1. Homework 1: Measure T CK-Q delay

THE SOLAR NEIGHBORHOOD. XV. DISCOVERY OF NEW HIGH PROPER MOTION STARS WITH 0B4 yr 1 BETWEEN DECLINATIONS 47 AND 00

NORTHWESTERN UNIVERSITY TECHNOLOGICAL INSTITUTE

DS /211 ED SOLENOID OPERATED DIRECTIONAL CONTROL VALVE. Q max 150 l/min SERIES 12 SUBPLATE MOUNTING ISO (CETOP 05) p max 320 bar

Structural and functional asymmetry of lateral Heschl s gyrus reflects pitch perception preference

PRACTICE FINAL EXAM T T. Music Theory II (MUT 1112) w. Name: Instructor:

Contents. English. English. Your remote control 2

Logic processing Catalogue PDE2619TCUK July 2009

Pro Series White Toner and Neon Range

Unit V Design for Testability

CPSC 121: Models of Computation Lab #2: Building Circuits

Good Evening! Welcome!

Power Optimization of Linear Feedback Shift Register (LFSR) using Power Gating

Complete technical information, including CAD and PDF drawings, is available at:

EE241 - Spring 2013 Advanced Digital Integrated Circuits. Announcements. Lecture 14: Statistical timing Latches

Binaural and temporal integration of the loudness of tones and noises

Clock Tree Power Optimization of Three Dimensional VLSI System with Network

Interactions of Folk Melody and Transformational (Dis)continuities in Chen Yi s Ba Ban

Chapter 2 Social Indicators Research and Health-Related Quality of Life Research

CAN THO UNIVERSITY JOURNAL OF SCIENCE INSTRUCTIONS FOR AUTHORS

COPY RIGHT. To Secure Your Paper As Per UGC Guidelines We Are Providing A Electronic Bar Code

EN2911X: Reconfigurable Computing Topic 01: Programmable Logic. Prof. Sherief Reda School of Engineering, Brown University Fall 2014

Lecture 23 Design for Testability (DFT): Full-Scan

lookbook Transportation - Airports

Power-Driven Flip-Flop p Merging and Relocation. Shao-Huan Wang Yu-Yi Liang Tien-Yu Kuo Wai-Kei Tsing Hua University

Corporate Logo Guidelines

LOGICAL FOUNDATION OF MUSIC

Retiming Sequential Circuits for Low Power

CPS 505 HIGH VOLTAGE PROBE INSTRUCTION MANUAL

Lecture 17: Introduction to Design For Testability (DFT) & Manufacturing Test

Association of blood lipids with Alzheimer s disease: A comprehensive lipidomics analysis

Flip-flop Clustering by Weighted K-means Algorithm

The main website for the Royal and McPherson Theatres Society is: Section view of the stage and auditorium

Reproducible music for 3, 4 or 5 octaves handbells or handchimes. by Tammy Waldrop. Contents. Performance Suggestions... 3

Electrospray Ionization Ion MoMlity Spectrometry

Outline. Annual Sales. A Brief History. Transistor Types. Invention of the Transistor. Lecture 1: Circuits & Layout. Introduction to CMOS VLSI Design

Faculty of Electrical & Electronics Engineering BEE3233 Electronics System Design. Laboratory 3: Finite State Machine (FSM)

Transcription:

TAU 2013 Vrition Awre Timing Anlysis Contest Debjit Sinh 1, Luís Guerr e Silv 2, Ji Wng 3, Shesh Rghunthn 4, Dileep Netrbile 5, nd Ahmed Shebit 6 1;5 IBM Systems nd Technology Group, 1 Hopewell Junction/ 5 Essex Junction, USA 2 INESC-ID / IST - TU Lisbon, Portugl 3 Illinois Institute of Technology, Chicgo, USA 4 IBM Systems nd Technology Group, Bnglore, Indi 6 Synopsys, Sunnyvle, USA TAU/ISPD joint session, Stteline, NV Mrch 26, 2013 TAU Sponsors: 1

Vrition wre timing Timing nlysis key component of chip design closure flow Pre/post route optimiztion, timing sign-off Incresing significnce of vribility Performnce BC WC Too smll performnce improvement Mrgin for vribility Gurnteed performnce Technology genertion Vribility wre timing nlysis essentil Growing chip sizes, complexity: Impcts timing nlysis run-time Trde-offs between modeling ccurcy/complexity nd run-time 2

TAU 2013 vrition wre timing nlysis contest Gol: Seek novel ides for fst, vrition wre timing nlysis by mens of the following Increse wreness of vrition wre timing nlysis, provide insight into some chllenging spects Encourge novel prlleliztion techniques (including multi-threding) Fcilitte cretion of publicly vilble vrition wre timing nlysis frmework nd benchmrks for reserch/future contests Trde-offs for timing model complexity Wnted focus on vrition wre timing, understnding chllenges for vrition wre timing, tool performnce Feedbck from prior contest committee: Tems spend too much time on infrstructure (e.g., prsers, fixing librry/benchmrk file bugs) Chose to expnd on single corner timing nlysis contest from 2011 3

Timing nlysis contest rchitecture Pre-processing Provided to contestnts Contest objective Multiple benchmrks Benchmrk file(s) (internl formt ++ ) Design Multiple.lib (different PVT * conditions) Plce nd Route (Cdence tool) Design Assertions Prsitics Librry file (internl formt ++ ) Vrition wre timing nlysis tool development Timing report Output file (internl formt ++ ).lib Mnully sserted prmeter sensitivity (rndom, metl) Prmeterized gte librry Dely, Slew, Test mrgin (gurd time) Monte Crlo bsed vrition wre timer Finl Evlution Golden timing report * PVT Process, Voltge, Temperture ++ Formt identicl/extension to/of PATMOS 11 contest 4

Sources of vribility (Prmeters) Six globl (inter-chip) sources of vribility Environmentl: Voltge (V), Temperture (T) Front end of line process: Chnnel length (L), Device width (W), Voltge threshold (H) Bck end of line process: Metl (M) All metl lyers ssumed perfectly correlted Rndom vribility (R) Intr-chip (cross chip) systemtic vribility ignored M Hne et l., SISPAD 2003 www.emerldinsight.com 5

Vribility modeling Prmetric liner model * µ + v V + t T + L + W H Ech prmeter ( V, T,, R) ssumed s unit norml Gussin Ech sensitivity ( v, t,, r ) denotes first-order per-sigm sensitivity Prmeter my vry between [-3, 3] sigms l w + h + m M + r R Encourged novel vribility wre timing nlysis techniques Sttisticl timing [Fewer runs, pessimism relief for rndom vribility, modeling inccurcies/simplifictions] Multi-corner timing [Less complexity, fster nlysis nd potentilly more ccurte t ech corner, lrge number of corners, pessimistic for rndom vribility] Monte Crlo bsed timing [Less complexity, most ccurte, very long run-times] Golden timer s pproch: Used for ccurcy evlutions only Hybrid/novel pproch for vribility wre timing [?] * non-liner with prmetric slews 6

Interconnect (wire) modeling Interconnects modeled s resistnce cpcitnce (RC) network Single source (port), one or more sinks (tps) No coupling cpcitnces, no grounded resistnces Single corner timing model Elmore dely model First moment vlue of impulse response Single corner dely from port to tp 5: Multi step output slew computtion Combintion of input (port) slew, dely, nd second moment vlue of impulse response Introduces non-linerity [Kshyp et l., TCAD 04] s o 2 i = s + (2β o d 2 o ) Output slew Input slew Elmore dely Second moment vlue of impulse response 7

Interconnect (wire) modeling considering vribility Wire prsitics (RC vlues) function of metl prmeter ( M) Provided sigm (corner) specific scle fctors for prsitics Tp cpcitnce contribution from gte input pin unffected Prmetric input slew Prmetric wire dely nd output-slew First order sensitivity to metl nd other prmeters my be computed vi model-fitting (e.g. finite-differencing) for sttisticl timer Complex prmetric output slew computtion Corner dely t nominl metl corner (0 sigm): First order metl sensitivity: Corner dely t thick metl corner (σ sigm): Tp gte pin cp Prmetric wire dely: Corner specific scle fctors 8

Combintionl gte (cell) modeling Extended liner gte dely/slew model from PATMOS 11 contest to vrition wre model Sensitivities (to prmeters, input slew, lod) provided in gte librry Lumped lod model (no effective Cpcitnce/current source models) Note: Input slew (S i ) nd lod (C L ) re prmetric models 9

Sequentil gte (flip-flop) modeling Test (Setup/hold) mrgin or gurd-times liner function of slews t clock nd dt points Sensitivities (to input slews) provided in gte librry Prmetric slews Prmetric gurd-time 10

Prmetric timing nlysis Trditionl timing nlysis/propgtion Forwrd propgtion of signl rrivl times (t), nd slews Bckwrd propgtion of signl required rrivl times (rt) Slck computtion Nunces Worst slew propgtion (when 2+ signls meet t point) Seprte propgtion for erly, lte modes, nd rise, fll trnsitions Needs mximum, minimum opertions on prmetric quntities Could be expensive for sttisticl timing, inccurcy concerns Single clock domin No coupling, common pth pessimism reduction, loops 11

Projection techniques nd tool output Projection of prmetric vlues: 3 modes required for contest µ + v V + t T + l L + w W + h H + m M + r R MEAN: Nominl vlue (µ) SIGMA_ONLY: Stndrd devition ( ) WORST_CASE: Worst 3 sigm projection of metl prmeter, rndom prmeter, nd ll other prmeters combined together (vi root sum squre) + + + Required tool output Must report projected vlues (bsed on shell vrible $TAU_PROJECTION) Set of lines specifying for ech primry output in design Arrivl times nd slews (for erly/lte/rise/fll combintions) Set of lines specifying for subset of pins in design Slcks 12

Contest timeline Dte Oct 12, 2012 Feb 8, 2013 Feb 20, 2013 Feb 28, 2013 (~4+ months) Mr 27, 2013 Activity Contest nnounced, webpge online (https://sites.google.com/site/tucontest2013) Detiled 22 pge.pdf contest rules document provided Benchmrk suite ver1.0 provided (24 testcses) Vrition wre gte librry provided Interconnect network prser nd viewer utility provided (debug id) Informed tht source code of winning tool from PATMOS 11 contest (thnks to Prof. Chng s tem from NTU, Tiwn) vilble upon request to void infrstructure development (optionlly re-use prsers, etc.) Detiled clcultions for toy benchmrk provided Monte Crlo results for benchmrks ver1.0 provided New vrition wre gte librry provided Updted contest rules document Contestnts requested to provide erly binries of tool for comptibility testing 5 new lrge benchmrks provided (lrgest benchmrk ~88K gtes) Finl tool binries due for evlution Results nnounced 13

Tems 8 tems Chin (1) Greece (1) Indi (2) Singpore (1) Tiwn (2) USA (1) Registered tem Contest committee member 14

Interesting tool chrcteristics Sttisticl, Monte Crlo bsed No multi-corner timers Sttisticl mximum/minimum (mx/min) opertion nunces Used Clrk s [Opertions Reserch 61] moment clcultion pproch, nd Visweswrih et l. pproch [DAC 04] Smrt Compre mens nd do sttisticl mx/min in select cses Consider neglected correltion between signl inputs Better ccurcy of output distribution. Bsed on Nderjh et l. [IEEE TVLSI 08] Prlleliztion Two tems employed pthreds Multi-threded netlist prsing Multi-threded wire pre-processing Circuit leveliztion Multi-threded forwrd propgtion Multi-threded bckwrd propgtion 15

Benchmrks Benchmrks ver 1.0 20 of 36 benchmrks used for finl evlutions ( > 250 gtes) Lrgest test-cse ~88.4K gtes Benchmrks ver 2.0 6 lrge benchmrks not relesed to contestnts erlier All benchmrks now vilble on webpge 16

Evlution metrics Score model for ech benchmrk: A * { 0.5 + 0.3/T + 0.2/M } Accurcy score (A): Eqully weighted sum of verge-rrivl time ccurcy, verge-slew ccurcy, verge-slck ccurcy, worst ccurcy, ccurcy of worst design slck Bsed on WORST-CASE projected mode results only Scoring different for pessimistic versus optimistic result 120 100 Score for pessimistic nswer Score for optimistic nswer 80 Score 60 40 20 0 1 3 5 7 9 11 13 15 17 19 21 1 3 6 10 15 15 20 20 21 Score for pessimistic nswer 100 95 90 80 70 70 50 0 0 Score for optimistic nswer 100 90 80 70 50 0 0 0 0 % ccurcy (reltive to cycle time) 17

Evlution metrics (contd.) Score model for ech benchmrk: A * { 0.5 + 0.3/T + 0.2/M } Run-time score (T): Tool run-time (seconds) per 1K gtes Averge run-time for single threded sttisticl timer found ~ 1sec/1K gte benchmrk Memory score (M): Tool pek memory (in 100Mbs) per 1K gtes Averge memory for single threded sttisticl timer found ~ 100Mb/1K gte benchmrk Finl evlution nunces Evlution on 2.33Ghz Qud core mchine, 24Gb Rm, up to 8 threds Accurcy prmount Monte Crlo bsed timer to generte golden Bised towrds tools with better run-time/memory for similr ccurcy Finl score is sum of ll (20) benchmrk scores 18

Tool comprison for top 3 tems Tem Id Tem T13_11 IIT Mdrs, Indi Tem T13_13 Ntionl Tsing Hu University, Tiwn Tem T13_14 Ntionl Chio Tung University, Tiwn Timer Sttisticl multi threded Sttisticl single threded Sttisticl single threded Num. benchmrks tht crsh tool 2 0 3 Missing pin slcks Few None Few ~Averge ccurcy 0.72 1.0 0.45 ~Averge run-time 0.3 1.0 [~ 1sec/1K gtes] 17.0 (vried between 0.3 200) ~Averge memory 3.2 1.0 [~ 100Mb/1K gtes] 0.3 Accurcy scores indicted within 3% ccurcy of sttisticl timing (for given benchmrks, librry) for rrivl times nd slews Slck ccurcy nd worst timing ccurcy using sttisticl timer usully within 10% of Monte Crlo results 19

Summry Vrition wre timing nlysis contest Increse wreness of vrition wre timing nlysis, provide insight into some chllenging spects Prmetric timing propgtion Model fitting/finite-difference concept, inccurcies Prmetric mximum/minimum opertions Projection techniques Pessimism relief Encourge novel prlleliztion techniques Multi-threded timers Fcilitte cretion of publicly vilble vrition wre timing nlysis frmework nd benchmrks for reserch/future contests Frmework for potentil concepts on prmetric pth trcing Vribility wre timing mcro-modeling, coupling, etc. Reference: Sinh et l., TAU 2013 Vrition wre timing nlysis contest, pp. 171-178, ISPD 2013, Stteline, NV 20

Finl results Plques nd csh wrds for the top three tems Tem Id Tem T13_11 IIT Mdrs, Indi Tem T13_13 Ntionl Tsing Hu University, Tiwn Tem T13_14 Ntionl Chio Tung University, Tiwn Timer Sttisticl multi threded Sttisticl single threded Sttisticl single threded Num. benchmrks tht crsh tool 2 0 3 Missing pin slcks Few None Few ~Averge ccurcy 0.72 1.0 0.45 ~Averge run-time 0.3 1.0 [~ 1sec/1K gtes] 17.0 (vried between 0.3 200) ~Averge memory 3.2 1.0 [~ 100Mb/1K gtes] 0.3 Finl score verged over ll benchmrks 119 70 31 POSITION 1 2 3 21

TAU 2013 Vrition Awre Timing Contest Third Plce Awrd Presented to Yu-Ming Yng, Yu-Wei Chng, Shih-Heng Hung nd Iris Hui-Ru Jing Ntionl Chio Tung University, Tiwn For itimer Jinjun Xiong Chiryu Amin Debjit Sinh Generl Chir Technicl Chir Contest Chir 22

TAU 2013 Vrition Awre Timing Contest Second Plce Awrd Presented to Po-Yi Hsu, Sheng-Ki Wu, Yung-Shun Lin nd Wi-Kei Mk Ntionl Tsing Hu University, Tiwn For HWL Timer Jinjun Xiong Chiryu Amin Debjit Sinh Generl Chir Technicl Chir Contest Chir 23

TAU 2013 Vrition Awre Timing Contest First Plce Awrd Presented to Jobin Jcob Kvlm, Sudhrshn V, Nitin Chndrchoodn nd Shnkr Blchndrn IIT Mdrs, Indi For IITimer Jinjun Xiong Chiryu Amin Debjit Sinh Generl Chir Technicl Chir Contest Chir 24

Bckup: Detiled scores Finl results: Accurcy score: 25