VLSI Digital Signal Processing

Similar documents
EECS150 - Digital Design Lecture 2 - CMOS

Integrated Circuit Design ELCT 701 (Winter 2017) Lecture 1: Introduction

Digital Signal Processing

Digital Integrated Circuits EECS 312

L12: Reconfigurable Logic Architectures

Digital Integrated Circuits EECS 312. Review. Remember the ENIAC? IC ENIAC. Trend for one company. First microprocessor

Digital Integrated Circuits EECS 312. People. Exams. Purpose of Course and Course Objectives I. Grading philosophy. Grading and written feedback

Introduction to Digital Signal Processing (DSP)

Sharif University of Technology. SoC: Introduction

Chapter 1. Introduction to Digital Signal Processing

LUT OPTIMIZATION USING COMBINED APC-OMS TECHNIQUE

EEC 116 Fall 2011 Lab #5: Pipelined 32b Adder

High Performance Microprocessor Design and Automation: Overview, Challenges and Opportunities IBM Corporation

Introduction to Data Conversion and Processing

EE 330 Fall 2014 Integrated Electronics

EE 330 Fall 2013 Integrated Electronics

Low Power VLSI Circuits and Systems Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

L11/12: Reconfigurable Logic Architectures

Digitally Assisted Analog Circuits. Boris Murmann Stanford University Department of Electrical Engineering

IE1204 Digital Design L1 : Course Overview. Introduction to Digital Technology. Binary Numbers

EE 330 Spring 2018 Integrated Electronics

Lecture 1: Introduction to Digital Logic Design. CK Cheng CSE Dept. UC San Diego

ELEC 310 Digital Signal Processing

SEMICONDUCTOR TECHNOLOGY -CMOS-

An FPGA Implementation of Shift Register Using Pulsed Latches

A video signal processor for motioncompensated field-rate upconversion in consumer television

Software Engineering 2DA4. Slides 3: Optimized Implementation of Logic Functions

Understanding Compression Technologies for HD and Megapixel Surveillance

Timing Error Detection: An Adaptive Scheme To Combat Variability EE241 Final Report Nathan Narevsky and Richard Ott {nnarevsky,

SEMICONDUCTOR TECHNOLOGY -CMOS-

Introduction to Signal Processing D R. T A R E K T U T U N J I P H I L A D E L P H I A U N I V E R S I T Y

1.1 Digital Signal Processing Hands-on Lab Courses

CS/EE 6710 Digital VLSI Design CAD Assignment #3 Due Thursday September 21 st, 5:00pm

Design of Fault Coverage Test Pattern Generator Using LFSR

THE USE OF forward error correction (FEC) in optical networks

Research Article Design and Implementation of High Speed and Low Power Modified Square Root Carry Select Adder (MSQRTCSLA)

COE328 Course Outline. Fall 2007

Testing Digital Systems II

WELCOME. ECE 2030: Introduction to Computer Engineering* Richard M. Dansereau Copyright by R.M. Dansereau,

UNIT V 8051 Microcontroller based Systems Design

Keywords Xilinx ISE, LUT, FIR System, SDR, Spectrum- Sensing, FPGA, Memory- optimization, A-OMS LUT.

Low Power Approach of Clock Gating in Synchronous System like FIFO: A Novel Clock Gating Approach and Comparative Analysis

POWER AND AREA EFFICIENT LFSR WITH PULSED LATCHES

nmos transistor Basics of VLSI Design and Test Solution: CMOS pmos transistor CMOS Inverter First-Order DC Analysis CMOS Inverter: Transient Response

EECS150 - Digital Design Lecture 15 Finite State Machines. Announcements

VLSI Chip Design Project TSEK06

DSP in Communications and Signal Processing

Why FPGAs? FPGA Overview. Why FPGAs?

Adding Analog and Mixed Signal Concerns to a Digital VLSI Course

Performance Driven Reliable Link Design for Network on Chips

Low-Power and Area-Efficient Shift Register Using Pulsed Latches

Future of Analog Design and Upcoming Challenges in Nanometer CMOS

VLSI Digital Signal Processing Systems: Design And Implementation PDF

data and is used in digital networks and storage devices. CRC s are easy to implement in binary

Combining Dual-Supply, Dual-Threshold and Transistor Sizing for Power Reduction

IC Layout Design of Decoders Using DSCH and Microwind Shaik Fazia Kausar MTech, Dr.K.V.Subba Reddy Institute of Technology.

Syed Muhammad Yasser Sherazi CURRICULUM VITAE

ECE Circuits Curriculum

Final Exam CPSC/ECEN 680 May 2, Name: UIN:

Digital Logic Design: An Overview & Number Systems

Bubble Razor An Architecture-Independent Approach to Timing-Error Detection and Correction

An MFA Binary Counter for Low Power Application

Reconfigurable Neural Net Chip with 32K Connections

Frame Processing Time Deviations in Video Processors

Electrical and Telecommunications Engineering Technology_TCET3122/TC520. NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York

Embedded System Design

Lab 6: Edge Detection in Image and Video

Experiment 2: Sampling and Quantization

P.Akila 1. P a g e 60

FPGA Development for Radar, Radio-Astronomy and Communications

UNIVERSITY OF MASSACHUSSETS LOWELL Department of Electrical & Computer Engineering Course Syllabus for Logic Design Fall 2013

Boolean, 1s and 0s stuff: synthesis, verification, representation This is what happens in the front end of the ASIC design process

Design and Analysis of Modified Fast Compressors for MAC Unit

24. Scaling, Economics, SOI Technology

Using on-chip Test Pattern Compression for Full Scan SoC Designs

An Efficient Reduction of Area in Multistandard Transform Core

EE241 - Spring 2001 Advanced Digital Integrated Circuits. References

Peak Dynamic Power Estimation of FPGA-mapped Digital Designs

ECE 4/517 MIXED SIGNAL IC DESIGN LECTURE 1 SLIDES. Vishal Saxena (vsaxena AT uidaho DOT edu) AMPIC Laboratory University of Idaho

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532

TKK S ASIC-PIIRIEN SUUNNITTELU

Memory efficient Distributed architecture LUT Design using Unified Architecture

ISSCC 2006 / SESSION 14 / BASEBAND AND CHANNEL PROCESSING / 14.6

A High-Performance Parallel CAVLC Encoder on a Fine-Grained Many-core System

Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register

LFSR Counter Implementation in CMOS VLSI

Based on slides/material by. Topic 14. Testing. Testing. Logic Verification. Recommended Reading:

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015

Music 4 - Exploring Music Fall 2016

UNIT IV CMOS TESTING. EC2354_Unit IV 1

Lecture 26: Multipliers. Final presentations May 8, 1-5pm, BWRC Final reports due May 7 Final exam, Monday, May :30pm, 241 Cory

SIC Vector Generation Using Test per Clock and Test per Scan

Further Details Contact: A. Vinay , , #301, 303 & 304,3rdFloor, AVR Buildings, Opp to SV Music College, Balaji

PHASE-LOCKED loops (PLLs) are widely used in many

REDUCING DYNAMIC POWER BY PULSED LATCH AND MULTIPLE PULSE GENERATOR IN CLOCKTREE

Design of Memory Based Implementation Using LUT Multiplier

SoC IC Basics. COE838: Systems on Chip Design

PICOSECOND TIMING USING FAST ANALOG SAMPLING

International Journal of Engineering Research-Online A Peer Reviewed International Journal

Amon: Advanced Mesh-Like Optical NoC

Transcription:

VLSI Digital Signal Processing EEC 28 Lecture Bevan M. Baas Tuesday, January 8, 29

Today Administrative items Syllabus and course overview My background Digital signal processing overview Read Programmable DSP Architectures, Part I by E. A. Lee 3

Course Communication Email Urgent announcements Web page http://www.ece.ucdavis.edu/~bbaas/28/ Office hours After lecture Tuesday After lecture Thursday Tentatively Friday :3 2:3 4

Course Workload 4 unit graduate course This course requires significant effort and time Multi-disciplinary field coverage DSP algorithms Digital processor architectures Arithmetic Utilizes robust industry-standard CAD tools (but we will make use of only the core essential features) Verilog Synthesis tool Matlab EEC 28, 5

Course Readings No required textbook Over 8 slides posted as handouts on the course web page You should fully understand all material in these handouts A few required papers Several posted tutorials with example code You should fully understand these Several optional textbook references EEC 28, 6

Main Course Material The main body of material is presented in the lectures, readings, and handouts Generally speaking, the hwk/projets complement the main material They go into a much greater depth on specific topics They give design experience They give significant practical application of theory The Quizzes generally focus on the main body of material EEC 8B, 7

Breadth and Depth Breadth and Depth LECTURES HANDOUTS READINGS H W K P R O J P R O J EEC 8B, 8

Course Overview EEC 28 web page contents Reading materials and references Hwk/Project descriptions Handouts http://www.ece.ucdavis.edu/~bbaas/28/ EEC 28, 9

Course Overview Canvas Grades posted here Let me know if you ever see a score different than you expect Upload electronic portions of hwk/projects here Syllabus Posted on course web page EEC 28,

Lectures Ask questions at any time Please hold conversations outside of class Please silence phones I will be out on travel Thursday, January 7 Guest lecture or possibly a special make-up lecture Integrated Solid-State Circuits Conference (ISSCC) February 8 2 Quiz and guest lecture on Tue, Feb 9; or possibly a special make-up lecture EEC 28,

Letter Grade Assignments I assign a letter grade only for the final course grade I look at the final exams and course record of the class and assign two key dividing points: the A/A+ and (probably B/B+) boundaries, and assign course grades from there using equallysized intervals No required numbers of any particular letter grades Absolute scores are not important; the boundaries shift according to the difficulty of the exams in any quarter Ignore any letter grades you might see on smartsite A/A+ Example with hypothetical data: B/B+ B B+ A A+ 2

Working With Others Collaboration Asking questions and explaining principles produces better work and dramatically increases learning Working with others Do homework and prelabs with classmates nearby Ask each other questions, help each other regarding principles, and approaches to solving only See Course Collaboration Policy on web page Dishonesty Copying produces similar work, stunts learning, is not fair to honest students, and is not allowed in this course Students engaged in dishonest work will be referred to Student Judicial Affairs I will try to keep in-class exams honest Steps will be taken to keep out of class work honest EEC 8B, 3

Penalties for Violating the Policy on Student Conduct and Discipline Penalties Minimum penalty: meetings with SJA officer, zero grade on work, record with SJA One to three quarter suspension from the university Permanent dismissal from all ten campuses of the University of California. Permanent notation on your transcript. The purpose of the penalties and me mentioning them is so that no one will get a penalty!!! Don t do anything that violates the Policy on Student Conduct! EEC 8B, 4

Penalties for Violating the Policy on Student Conduct and Discipline Typical scenario: Someone shares code/design with another They get caught The Copier feels terrible guilt for causing a friend to get a zero as well as getting their own work marked zero The Sharer deeply regrets sharing resulting in a zero when he/she should have had a full score EEC 8B, 5

Exam and Quiz Regrades Some number of exams and quizzes will be scanned before being returned Key take-away messages: Do not change anything on your work if you request a regrade One student did last quarter and got in big BIG trouble!!! EEC 8B, 6

Cheating Websites chegg, coursehero, etc. Key take-away messages: Do not post assignments Of course do not use any unpermitted outside material in work you submit Of course do not post solutions Two students did last quarter and got caught!!! EEC 8B, 7

Critical Challenges Facing Industry Energy Efficiency Performance Software development cost and time Hardware development cost and time Opportunity: Critical workloads sometimes/frequently have relatively simple tasks as critical kernels (e.g., machine learning, digital signal processing, multimedia, data record processing, pattern matching, etc.) Embedded (e.g., IoT) Mobile Datacenter 27

Number of Processors on a Single Die vs. Year Academic Industry Note: Each processor capable of independent program execution 3

Processor Eras Transistor Era: the Intel 44 was the first commercial single-chip microprocessor and it contained 23 hand-drawn transistors Single/Multi-Processor Era: focus on components of single processors and multi-processors, which generally scale well to only small numbers of processors -Processor Era: focus on making systems scalable and working with processors as building blocks. The 32 nm -processor KiloCore chip would contain approximately 23-37 processors if its area were the same as a 32 nm Intel Core i7 processor, or, processors if its area were the same as an Nvidia GP 32

Future Fabrication Technologies Basic trends Number of available devices: continually increasing Energy dissipation per operation: decreasing too slowly VDD2 VDD3 VDD There are a lot of ways to place and connect a billion transistors The most efficient implementations (throughput, energy, area) will have: Processor sizes that capture computational kernels with few excess circuits Optimized clock frequencies and supply voltages matched to dynamic workloads 33

Optimal Computational Tile Size The most efficient implementations (energy, throughput, chip area) have: Processor sizes that capture computational kernels with few excess circuits ~ ~~ ~ ~~ Interprocessor interconnect Unused or low benefit-per-cost circuits Energy Effic. Clock rate Area Effic. Tile Size 34

Advancing CMOS Technologies Moore s Law (Observation) was made in 965 and notes that transistor density ~doubles every year (every.5 years now) "Cramming more components onto integrated circuits," Gordon Moore, Electronics, April 9, 965. 39

Transistors (thousands) Number of Logical Cores Original data up to the year 2 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2-25 by K. Rupp New data added by 4

Digital Signal Processing Digital Discrete time Discrete valued Signal, 2, 3, dimensional Processing Analysis Synthesis Enhancement 4

DSP Workloads Often real-time Data producer and consumer can not be paused or held up Examples: antenna, controller, camera, video monitor, Very strict minimum performance levels Performance above that minimum is often of little value Data producer DSP system Data consumer 42

DSP Workloads Analysis. Ex: anti-lock brakes Data producer Maybe MSamples/sec DSP system Maybe Sample/sec Synthesis. Ex: music keyboard DSP system Data consumer 43

DSP Workloads Data stream can be considered infinite duration Length of data stream >> any buffering Ex: high-pass filter, automotive collision-detection radar distance measurement system DSP system 44

DSP Workloads Digital signal processing Typically very numerically intensive Lots of +, -, x DSP system 45

DSP Compared with Analog Processing Digital signal processing Compare with analog signal processing If possible in analog domain (at required precision), analog processing will likely require far fewer devices If possible in analog domain, either domain may produce the most energy-efficient solution Many algorithms are possible only with DSP (arbitrarily high precision, non-causal, ) DSP arithmetic is completely stable over process, temperature, and voltage variations Ex: 2. + 3. = 5. will always be true as long as the circuit is functioning correctly 46

Digital signal processing DSP Compared with Analog Processing Compare with analog signal processing DSP energy-efficiencies are rapidly increasing Once a DSP processor has been designed in a portable format (gate netlist, HDL, software), very little effort is required to port (re-target) the design to a different processing technology. Analog circuits typically require a nearly-complete re-design. DSP capabilities are rapidly increasing Analog A/D speed x resolution product doubles every 5 years Digital processing performance doubles every 8-24 months (6x to x every 5 years) 47

Common DSP Applications Early applications Instrumentation Radar Communication Imaging Current applications Consumer audio, video Networking Telecommunications Machine learning Imaging Many many more 48

Consumer Products Trends Analog based Digital based Music records, tapes CDs, MP3s Video VHS, 8mm DVD, Blu-ray, H.264, H.265 Telephony analog mobile (G) digital (4G, LTE, ) Television NTSC/PAL digital (DVB, ATSC, ISDB, ) Many products use digital data and speak digital: computers, networks, digital appliances Impacts Processing Transmission Storage etc. EEC 28, 49

Consumer Products Trends Analog based vs. Digital based iphone apps??? EEC 28, 5

Future Applications Very limited power budgets Require significant digital signal processing 5

Key Design Metrics ) Performance a) Throughput (high); e.g., 25 MSamples/sec b) Latency (low); e.g., 2.7 µsec from first sample in -> first out c) Numerical precision 2) Chip area (cost); e.g., mm 2 die area, area of standard cell netlist 3) Energy dissipation per workload, e.g., Joules per JPEG image 4) Design complexity Design time = lower performance Software more important as systems become more complex 5) Suitability for future fabrication technologies Many transistors Faulty devices i) During manufacturing process ii) device wear out due to effects such as NBTI EEC 28, 52