Power Efficient Architectures to Accelerate Deep Convolutional Neural Networks for edge computing and IoT

Size: px
Start display at page:

Download "Power Efficient Architectures to Accelerate Deep Convolutional Neural Networks for edge computing and IoT"

Transcription

1 Power Efficient Architectures to Accelerate Deep Convolutional Neural Networks for edge computing and IoT Giuseppe Desoli ST Central Labs STMicroelectronics

2 Artificial Intelligence is Everywhere 2 Analysis, i.e.: Where am I? Scene classification (audio, video, environmental sensors) Which objects are in the scene, where are they? Video object detection/classification What am I doing? Activity recognition (audio, video, inertial sensors) What s happening? Event recognition (audio, video, intertial sensors, environmental sensors). User Interaction: Command detection (audio) Speech Recognition (audio) Gesture Recognition (inertial sensors, video) User identification and mood detection (audio, video) Continuous Learning, i.e.. How can I detect unpredictable, unclassified events in dynamic environments? Recurrent networks (audio, video, intertial sens, environm sens) And many more..

3 Comm Sensors Comm Sensors Comm Sensors Comm Sensors Comm Sensors Comm Sensors Artificial neural network placing intelligence where required Scalability Service Distributed Intelligence: why? Big data Service enablement Global optimization Centralized Distributed 3 Responsiveness Smart units Collected data analytic Service enablement sensors 100 Mb-1 Tb / sec 10 TOPS TOPS sensors ~10 Mb / sec ~10 GOPS concentrator concentrator al analytic Real time 100 sensors 1 Mb-10 Gb / sec 100 MOPS-100 TOPS 100 sensors ~100 Kb / sec ~100 MOPS Unit Unit Unit Unit Unit Unit 1 sensor 10 KOPS-100 MOPS 1 Intelligent sensor 100 MOPS-1 TOPS 1 sensor 10 Kb-100 Mb / sec

4 Why Artificial Neural Networks? 4 The power and usefulness of ANN have been demonstrated in several applications A specific kind of neural networks, the Deep Convolutional Neural Networks (DCNN), have proven very effective Achieving human-like performance in selected cases In the last decade breakthroughs made neural networks practical Better training algorithms, Moore s law and Big data availability For IoT the current challenge is to achieve low power and adequate cost with sufficient performance for edge computing

5 A Typical CDNN Structure 5 The Artificial Neuron is a processing unit with a close connection to neurobiology DCNN s are composed of multiple layers of neurons Each layer performs feature extraction with learned filters, reduction of input resolution and non-linear operations Multiple layers compress each image into denser information Depth indicates the number of layers of the specific CDNN network. Up to millions of parameters for each layer of the network can be involved Parameters can be defined thanks to supervised or unsupervised training algorithms processing large training sets Images Source: Stanford

6 Deep Learning on images 6 Image Classification Object alization Object Detection Image Segmentation Action Recognition Images source: ImgeNet Image Generation

7 Beyond recognition: semantic captioning 7 a giraffe has it's head up to a small tree. a giraffe in a pen standing under a tree. giraffe standing next to a wooden treelike structure. a tall giraffe standing next to a tree a giraffe in an enclosure standing next to a tree. Courtesy of COCO: Common Object in Context, Microsoft

8 Deep Learning for Speech 8 Speech Recognition Natural Language Processing Speech Translation Audio Generation

9 Deep Leraning for autonomous driving 9 Simultaneous objects (cars pedestrian, signals) detection and identification Semantic segmentation Multiple sensory input (visual, radar,lidar, proximity, etc.) End to end processing and actuation

10 Convolutional NN Complexity Evolution Operations (GOPS) Parameters (Millions) ANNs ( ) 3 layers AlexNET (2012) 8 layers GoogleLeNet (2014) 22 layers VGG19 (2014) 19 layers 11.3 ResNet (2015) 152 layers

11 HW for Deep Learning: few examples 11 Intel Xeon Phi 7285 Freq 1.4GHz, 68 cores, TDP 250W, 14nm, Perf > 3.4TOPS (TBA) Price $2036 (sept 2017) instructions for deep learning (AVX512-4VNNIW, AVX512-4FMAPS) NVIDIA Xavier 8-core CPU, 512-core Volta GPU 30 TOPS TDP 30W, 16nm Mobileye (now Intel) EyeQ5 7nm 12 TOPS peak 2.4 DL TOPS Movidius Myriad X (now Intel) 16 vector 128bit VLIW, Neural Compute Engine 4 TOPS TDP 2W, 16nm

12 Exploiting parallelism (we need special HW) 12 Temporal Architectures (SIMD/SIMT) Spatial (Data-flow) Architectures (SIMD) Two broad classes of architectures can be identified Both have pros and cons Specialized HW is needed to achieve power consumption compatible with IoT applications and cost Memory access the key aspect Courtesy of MIT Eyeriss project Energy/power x word access al SRAM On-chip SRAM LPDDR 1x 10x 100x

13 An Ultra low power example: Orlando SoC HW co-processors 8x Dual Cluster (16 cores) Image Stream Processor Image Stream Processor Video out (DVI) Interface DM IM I$ SM shared DM IM I$ DM IM I$ SM shared DM IM I$ DM IM I$ SM shared DM IM I$ DM IM I$ SM shared DM IM I$ HW co-processors Color conv Crop Stream Switch Conv Conv DM IM I$ shared SM DM IM I$ DM IM I$ shared SM 8 ports DM IM I$ DM IM I$ shared SM DM IM I$ DM IM I$ shared SM STBUS T3 full (64bits) DM IM I$ Scale Integral Str. Str. Eng 0 Eng 1... Bus access arbiter & IF controller Str. Eng n Conv Conv STBUS slave Interface (type1) MEM MEM MEM MEM 4 x 1MB Global Ram T3-AXI AXI AXI-T3 HOST subsystem (e.g. ARM, Peripherals, mem, IFs, etc). Int controller T3-APB Shared Mailbox/ timer Debug controller n - 1 buf x Feature strip buffer kernel registers m * n M M Mn x M A C n (m/h) x col MACs A D D buf buf Convolution accelerators Presentation Title 3/2/2018 JTAG

14 Reconfigurable Accelerator Framework 14 Color convert Cropper H264 Ctrl Regs.... MJPEG COMP. IMAGE E15 Image Sensor IF & ISP E14 Stream Switch RGB IMAGE... Image Sensor IF & ISP E4 BATCH -1 BATCH FEATURE E3 Bus Arbiter & System Bus Interface Display out (DVI) Interface E2 E1 E0 CA 0 CA 1 CA 2 CA 3 KERNEL CA 7... Virtual stream links Ferry data to/from accelerators, interfaces and engines Flow control mechanism is provided Streams can be multicast to multiple destinations More flexible than hardware data paths More power efficient than a bus

15 um HW ACCELLER. SUBSYSTEM Prototype Chip FD-SOI 28nm 15 OTP High Speed Camera IF PLL CHIP TO CHIP M um (DSP) CORES AND LOCAL MEMS GLOBAL MEMORY SUBSYSTEM Technology Package Frequency Supply voltages Power 0.575V, 8 CAs On-chip RAM FD-SOI 28nm FBGA 15x15x MHz 1.175GHz V digital 1.8V I/O FPS 4x1MB 8x192KB 128KB Host ARM Cortex -M4 No of DSPs 16 Peak DSP perf 75 GOPS (2x16bMAC (*) ) No of CAs 8 CAs perf (1.175GHz, 1.1V) 676 GOPS (*) peak (*) 1 MAC defined as 2 OPS (ADD + MUL) (**) HW Acc avg power for AlexNet

16 Ultra-Wide DVFS Range 16 LVT design with heterogeneous Poly-Bias levels => perf vs leakage GALS and low insertion delay clock networks to minimize on chip variation margins; Mono Supply memories with fine grained power switches and sleep mode; DVFS energy efficiency improvements via body bias. Wide DVFS Range Frequency GOPS/W

17 Application Example: AlexNet 17 Input image SENSOR I/F RGB->YUV KER. MEM JPEG MEM CA CROP 227x227 IN FMAP CA HOST + SPI + To PC DSPs OUT FMAP MHz, 0.6V 10 FPS (38 ms DSPs, 62 ms CAs) 2 chained CAs Dynamic: 10 mw CAs + 17 mw system Static: 0.6 mw CAs mw system

18 FPS Power [mw] Orlando CNN inference engine performance 18 VGG16 performance vs power scaling at Vdd range CAs with 8bpp MACs 16 Kernels in parallel 1 CAs 2 CAs 4 CAs 8 CAs 16 CAs / / / /650 1/ /1175 Vdd/Freq range Compared (unfairly) to NVIDIA Tegra 1200 MHz FP32 83 FPS Cost: $ TDP: > 200W

19 Orlando at work 19 Left Orlando running Pico Yolo CNN for object detection and classification Top Orlando running a CNN trained to drive a simulated car Bottom: Orlando identifying faces and classifying expressions

RedEye Analog ConvNet Image Sensor Architecture for Continuous Mobile Vision

RedEye Analog ConvNet Image Sensor Architecture for Continuous Mobile Vision Analog ConvNet Image Sensor Architecture for Continuous Mobile Vision Robert LiKamWa Yunhui Hou Yuan Gao Mia Polansky Lin Zhong roblkw@rice.edu houyh@rice.edu yg18@rice.edu mia.polansky@rice.edu lzhong@rice.edu

More information

Lossless Compression Algorithms for Direct- Write Lithography Systems

Lossless Compression Algorithms for Direct- Write Lithography Systems Lossless Compression Algorithms for Direct- Write Lithography Systems Hsin-I Liu Video and Image Processing Lab Department of Electrical Engineering and Computer Science University of California at Berkeley

More information

IEEE Santa Clara ComSoc/CAS Weekend Workshop Event-based analog sensing

IEEE Santa Clara ComSoc/CAS Weekend Workshop Event-based analog sensing IEEE Santa Clara ComSoc/CAS Weekend Workshop Event-based analog sensing Theodore Yu theodore.yu@ti.com Texas Instruments Kilby Labs, Silicon Valley Labs September 29, 2012 1 Living in an analog world The

More information

RFSOI and FDSOI enabling smarter and IoT applications. Kirk Ouellette Digital Products Group STMicroelectronics

RFSOI and FDSOI enabling smarter and IoT applications. Kirk Ouellette Digital Products Group STMicroelectronics RFSOI and FDSOI enabling smarter and IoT applications Kirk Ouellette Digital Products Group STMicroelectronics ST in the IoT already Today 2 Kirk Ouellette More then Moore Workshop - Shanghai - March 17,

More information

TODAY computer vision technologies are used with great

TODAY computer vision technologies are used with great ARXIV PREPRINT 1 Origami: A 803 GOp/s/W Convolutional Network Accelerator Lukas Cavigelli, Student Member, IEEE, and Luca Benini, Fellow, IEEE arxiv:1512.04295v2 [cs.cv] 19 Jan 2016 Abstract An ever increasing

More information

MMI: A General Narrow Interface for Memory Devices

MMI: A General Narrow Interface for Memory Devices MMI: A General Narrow Interface for Devices Judy Chen Eric Linstadt Rambus Inc. Session 106 August 12, 2009 August 2009 1 What is MMI? WLAN BT GPS NOR S/M Baseband Processor Apps/Media Processor NAND M

More information

Alain Legault Hardent. Create Higher Resolution Displays With VESA Display Stream Compression

Alain Legault Hardent. Create Higher Resolution Displays With VESA Display Stream Compression Alain Legault Hardent Create Higher Resolution Displays With VESA Display Stream Compression What Is VESA? 2 Why Is VESA Needed? Video In Processor TX Port RX Port Display Module To Display Mobile application

More information

Pivoting Object Tracking System

Pivoting Object Tracking System Pivoting Object Tracking System [CSEE 4840 Project Design - March 2009] Damian Ancukiewicz Applied Physics and Applied Mathematics Department da2260@columbia.edu Jinglin Shen Electrical Engineering Department

More information

Design and Implementation of an AHB VGA Peripheral

Design and Implementation of an AHB VGA Peripheral Design and Implementation of an AHB VGA Peripheral 1 Module Overview Learn about VGA interface; Design and implement an AHB VGA peripheral; Program the peripheral using assembly; Lab Demonstration. System

More information

Film Grain Technology

Film Grain Technology Film Grain Technology Hollywood Post Alliance February 2006 Jeff Cooper jeff.cooper@thomson.net What is Film Grain? Film grain results from the physical granularity of the photographic emulsion Film grain

More information

AN-ENG-001. Using the AVR32 SoC for real-time video applications. Written by Matteo Vit, Approved by Andrea Marson, VERSION: 1.0.0

AN-ENG-001. Using the AVR32 SoC for real-time video applications. Written by Matteo Vit, Approved by Andrea Marson, VERSION: 1.0.0 Written by Matteo Vit, R&D Engineer Dave S.r.l. Approved by Andrea Marson, CTO Dave S.r.l. DAVE S.r.l. www.dave.eu VERSION: 1.0.0 DOCUMENT CODE: AN-ENG-001 NO. OF PAGES: 8 AN-ENG-001 Using the AVR32 SoC

More information

A Low-Power 0.7-V H p Video Decoder

A Low-Power 0.7-V H p Video Decoder A Low-Power 0.7-V H.264 720p Video Decoder D. Finchelstein, V. Sze, M.E. Sinangil, Y. Koken, A.P. Chandrakasan A-SSCC 2008 Outline Motivation for low-power video decoders Low-power techniques pipelining

More information

Hi3518A Professional HD IP Camera SoC. Brief Data Sheet. Issue 03. Date Baseline Date

Hi3518A Professional HD IP Camera SoC. Brief Data Sheet. Issue 03. Date Baseline Date Professional HD IP Camera SoC Brief Data Sheet Issue 03 Date 2012-08-15 Baseline Date 2012-06-06 . 2012. All rights reserved. No part of this document may be reproduced or transmitted in any form or by

More information

ISELED - A Bright Future for Automotive Interior Lighting

ISELED - A Bright Future for Automotive Interior Lighting ISELED - A Bright Future for Automotive Interior Lighting Rev 1.1, October 2017 White Paper Authors: Roland Neumann (Inova), Robert Isele (BMW), Manuel Alves (NXP) Contents More than interior lighting...

More information

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0 General Description Applications Features The OL_H264e core is a hardware implementation of the H.264 baseline video compression algorithm. The core

More information

Yong Cao, Debprakash Patnaik, Sean Ponce, Jeremy Archuleta, Patrick Butler, Wu-chun Feng, and Naren Ramakrishnan

Yong Cao, Debprakash Patnaik, Sean Ponce, Jeremy Archuleta, Patrick Butler, Wu-chun Feng, and Naren Ramakrishnan Yong Cao, Debprakash Patnaik, Sean Ponce, Jeremy Archuleta, Patrick Butler, Wu-chun Feng, and Naren Ramakrishnan Virginia Polytechnic Institute and State University Reverse-engineer the brain National

More information

Epiphan Frame Grabber User Guide

Epiphan Frame Grabber User Guide Epiphan Frame Grabber User Guide VGA2USB VGA2USB LR DVI2USB VGA2USB HR DVI2USB Solo VGA2USB Pro DVI2USB Duo KVM2USB www.epiphan.com 1 February 2009 Version 3.20.2 (Windows) 3.16.14 (Mac OS X) Thank you

More information

EECS150 - Digital Design Lecture 12 - Video Interfacing. Recap and Outline

EECS150 - Digital Design Lecture 12 - Video Interfacing. Recap and Outline EECS150 - Digital Design Lecture 12 - Video Interfacing Oct. 8, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy of Prof. John

More information

Sensor Development for the imote2 Smart Sensor Platform

Sensor Development for the imote2 Smart Sensor Platform Sensor Development for the imote2 Smart Sensor Platform March 7, 2008 2008 Introduction Aging infrastructure requires cost effective and timely inspection and maintenance practices The condition of a structure

More information

Reconfigurable Neural Net Chip with 32K Connections

Reconfigurable Neural Net Chip with 32K Connections Reconfigurable Neural Net Chip with 32K Connections H.P. Graf, R. Janow, D. Henderson, and R. Lee AT&T Bell Laboratories, Room 4G320, Holmdel, NJ 07733 Abstract We describe a CMOS neural net chip with

More information

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Implementation of an MPEG Codec on the Tilera TM 64 Processor 1 Implementation of an MPEG Codec on the Tilera TM 64 Processor Whitney Flohr Supervisor: Mark Franklin, Ed Richter Department of Electrical and Systems Engineering Washington University in St. Louis Fall

More information

OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features

OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0 General Description Applications Features The OL_H264MCLD core is a hardware implementation of the H.264 baseline video compression

More information

Outline. 1 Reiteration. 2 Dynamic scheduling - Tomasulo. 3 Superscalar, VLIW. 4 Speculation. 5 ILP limitations. 6 What we have done so far.

Outline. 1 Reiteration. 2 Dynamic scheduling - Tomasulo. 3 Superscalar, VLIW. 4 Speculation. 5 ILP limitations. 6 What we have done so far. Outline 1 Reiteration Lecture 5: EIT090 Computer Architecture 2 Dynamic scheduling - Tomasulo Anders Ardö 3 Superscalar, VLIW EIT Electrical and Information Technology, Lund University Sept. 30, 2009 4

More information

A High-Performance Parallel CAVLC Encoder on a Fine-Grained Many-core System

A High-Performance Parallel CAVLC Encoder on a Fine-Grained Many-core System A High-Performance Parallel CAVLC Encoder on a Fine-Grained Many-core System Zhibin Xiao and Bevan M. Baas VLSI Computation Lab, ECE Department University of California, Davis Outline Introduction to H.264

More information

Getting Started with Launchpad and Grove Starter Kit. Franklin Cooper University Marketing Manager

Getting Started with Launchpad and Grove Starter Kit. Franklin Cooper University Marketing Manager Getting Started with Launchpad and Grove Starter Kit Franklin Cooper University Marketing Manager Prelab Work Lab Documentation: https://goo.gl/vzi53y Create a free my.ti.com account Install Drivers for

More information

1ms Column Parallel Vision System and It's Application of High Speed Target Tracking

1ms Column Parallel Vision System and It's Application of High Speed Target Tracking Proceedings of the 2(X)0 IEEE International Conference on Robotics & Automation San Francisco, CA April 2000 1ms Column Parallel Vision System and It's Application of High Speed Target Tracking Y. Nakabo,

More information

NDIA Army Science and Technology Conference EWA Government Systems, Inc.

NDIA Army Science and Technology Conference EWA Government Systems, Inc. NDIA Army Science and Technology Conference EWA Government Systems, Inc. PITCH DECK Biologically-Inspired Processor for Ultra-Low Power Audio and Video Surveillance Applications Presented by Lester Foster

More information

Hi3520A H.264 Codec Processor. Brief Data Sheet. Issue 01. Date

Hi3520A H.264 Codec Processor. Brief Data Sheet. Issue 01. Date Brief Data Sheet Issue 01 Date 2012-05-04 . 2012. All rights reserved. No part of this document may be reproduced or transmitted in any form or by any means without prior written consent of HiSilicon Technologies

More information

An Introduction to Deep Image Aesthetics

An Introduction to Deep Image Aesthetics Seminar in Laboratory of Visual Intelligence and Pattern Analysis (VIPA) An Introduction to Deep Image Aesthetics Yongcheng Jing College of Computer Science and Technology Zhejiang University Zhenchuan

More information

Display Interfaces. Display solutions from Inforce. MIPI-DSI to Parallel RGB format

Display Interfaces. Display solutions from Inforce. MIPI-DSI to Parallel RGB format Display Interfaces Snapdragon processors natively support a few popular graphical displays like MIPI-DSI/LVDS and HDMI or a combination of these. HDMI displays that output any of the standard resolutions

More information

PROF. TAJANA SIMUNIC ROSING. Midterm. Problem Max. Points Points Total 150 INSTRUCTIONS:

PROF. TAJANA SIMUNIC ROSING. Midterm. Problem Max. Points Points Total 150 INSTRUCTIONS: CSE 237A FALL 2006 PROF. TAJANA SIMUNIC ROSING Midterm NAME: ID: Solutions Problem Max. Points Points 1 20 2 20 3 30 4 25 5 25 6 30 Total 150 INSTRUCTIONS: 1. There are 6 problems on 11 pages worth a total

More information

Low Power Design of the Next-Generation High Efficiency Video Coding

Low Power Design of the Next-Generation High Efficiency Video Coding Low Power Design of the Next-Generation High Efficiency Video Coding Authors: Muhammad Shafique, Jörg Henkel CES Chair for Embedded Systems Outline Introduction to the High Efficiency Video Coding (HEVC)

More information

Intelligent Monitoring Software IMZ-RS300. Series IMZ-RS301 IMZ-RS304 IMZ-RS309 IMZ-RS316 IMZ-RS332 IMZ-RS300C

Intelligent Monitoring Software IMZ-RS300. Series IMZ-RS301 IMZ-RS304 IMZ-RS309 IMZ-RS316 IMZ-RS332 IMZ-RS300C Intelligent Monitoring Software IMZ-RS300 Series IMZ-RS301 IMZ-RS304 IMZ-RS309 IMZ-RS316 IMZ-RS332 IMZ-RS300C Flexible IP Video Monitoring With the Added Functionality of Intelligent Motion Detection With

More information

Computer and Machine Vision

Computer and Machine Vision Computer and Machine Vision Lecture Week 3 Part-1 January 27, 2014 Sam Siewert Outline of Week 3 Processing Images and Moving Pictures High Level View and Computer Architecture for it Linux Platforms for

More information

Comp 410/510. Computer Graphics Spring Introduction to Graphics Systems

Comp 410/510. Computer Graphics Spring Introduction to Graphics Systems Comp 410/510 Computer Graphics Spring 2018 Introduction to Graphics Systems Computer Graphics Computer graphics deals with all aspects of 'creating images with a computer - Hardware (PC with graphics card)

More information

microenable 5 marathon ACL Product Profile of microenable 5 marathon ACL Datasheet microenable 5 marathon ACL

microenable 5 marathon ACL Product Profile of microenable 5 marathon ACL   Datasheet microenable 5 marathon ACL i Product Profile of Scalable, intelligent high performance frame grabber for highest requirements on image acquisition and preprocessing by robust industrial MV standards All formats of Camera Link standard

More information

Efficient FPGA-based Video Systems. Aaron Behman Xilinx

Efficient FPGA-based Video Systems. Aaron Behman Xilinx Efficient FPGA-based Video Systems Aaron Behman Xilinx Agenda About Xilinx Trends in Professional Video FPGAs in the Video Value Chain Video System Architecture FPGA Video System Software Defined Video

More information

USING FUSION SYSTEM ARCHITECTURE FOR BROADCAST VIDEO. Edward Callway AMD

USING FUSION SYSTEM ARCHITECTURE FOR BROADCAST VIDEO. Edward Callway AMD USING FUSION SYSTEM ARCHITECTURE FOR BROADCAST VIDEO Edward Callway AMD USING PC COMPONENTS FOR BROADCAST VIDEO Video processing from pure analog to digital compute PC Design for video Parallel GPU computing

More information

Tools to Debug Dead Boards

Tools to Debug Dead Boards Tools to Debug Dead Boards Hardware Prototype Bring-up Ryan Jones Senior Application Engineer Corelis 1 Boundary-Scan Without Boundaries click to start the show Webinar Outline What is a Dead Board? Prototype

More information

microenable IV AD1-PoCL Product Profile of microenable IV AD1-PoCL Datasheet microenable IV AD1-PoCL

microenable IV AD1-PoCL Product Profile of microenable IV AD1-PoCL Datasheet microenable IV AD1-PoCL i Product Profile of Scalable, intelligent frame grabber for image acquisition and OEM projects Base to Medium incl DualBase format frame grabber PoCL SafePower Broad camera support No camera file needed

More information

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder. Video Streaming Based on Frame Skipping and Interpolation Techniques Fadlallah Ali Fadlallah Department of Computer Science Sudan University of Science and Technology Khartoum-SUDAN fadali@sustech.edu

More information

4K Video, Real-Time Analytics, and AI Applications Drive 24G SAS

4K Video, Real-Time Analytics, and AI Applications Drive 24G SAS 4K Video, Real-Time Analytics, and AI Applications Drive 24G SAS Dennis Martin Santa Clara, CA http://www.demartek.com/demartek_presenting_flashmemorysummit_2017-08.html 1 About Demartek Industry Analysis

More information

Day & Night 1080P HD Vari-Focal Dome IR IP Camera

Day & Night 1080P HD Vari-Focal Dome IR IP Camera Day & Night 1080P HD Vari-Focal Dome IR IP Camera Features Full HD 2 megapixel CMOS image sensor True H.264 AVC/MPEG-4 part 10 real-time video compression H.264 and Motion JPEG multi-profile video streaming

More information

EN2911X: Reconfigurable Computing Topic 01: Programmable Logic. Prof. Sherief Reda School of Engineering, Brown University Fall 2014

EN2911X: Reconfigurable Computing Topic 01: Programmable Logic. Prof. Sherief Reda School of Engineering, Brown University Fall 2014 EN2911X: Reconfigurable Computing Topic 01: Programmable Logic Prof. Sherief Reda School of Engineering, Brown University Fall 2014 1 Contents 1. Architecture of modern FPGAs Programmable interconnect

More information

MEMS Mirror: A8L AU-TINY48.4

MEMS Mirror: A8L AU-TINY48.4 MEMS Mirror: A8L2.2-4600AU-TINY48.4 Description: The new A8L2 actuator is based on an established robust two-axis MEMS design which supports various bonded mirror sizes in largeangle beam steering. Previous

More information

MPEG decoder Case. K.A. Vissers UC Berkeley Chamleon Systems Inc. and Pieter van der Wolf. Philips Research Eindhoven, The Netherlands

MPEG decoder Case. K.A. Vissers UC Berkeley Chamleon Systems Inc. and Pieter van der Wolf. Philips Research Eindhoven, The Netherlands MPEG decoder Case K.A. Vissers UC Berkeley Chamleon Systems Inc. and Pieter van der Wolf Philips Research Eindhoven, The Netherlands 1 Outline Introduction Consumer Electronics Kahn Process Networks Revisited

More information

Design and analysis of microcontroller system using AMBA- Lite bus

Design and analysis of microcontroller system using AMBA- Lite bus Design and analysis of microcontroller system using AMBA- Lite bus Wang Hang Suan 1,*, and Asral Bahari Jambek 1 1 School of Microelectronic Engineering, Universiti Malaysia Perlis, Perlis, Malaysia Abstract.

More information

DSP in Communications and Signal Processing

DSP in Communications and Signal Processing Overview DSP in Communications and Signal Processing Dr. Kandeepan Sithamparanathan Wireless Signal Processing Group, National ICT Australia Introduction to digital signal processing Introduction to digital

More information

Hardware Implementation of Block GC3 Lossless Compression Algorithm for Direct-Write Lithography Systems

Hardware Implementation of Block GC3 Lossless Compression Algorithm for Direct-Write Lithography Systems Hardware Implementation of Block GC3 Lossless Compression Algorithm for Direct-Write Lithography Systems Hsin-I Liu, Brian Richards, Avideh Zakhor, and Borivoje Nikolic Dept. of Electrical Engineering

More information

Highly Parallel HEVC Decoding for Heterogeneous Systems with CPU and GPU

Highly Parallel HEVC Decoding for Heterogeneous Systems with CPU and GPU 2017. This manuscript version (accecpted manuscript) is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/. Highly Parallel HEVC Decoding for Heterogeneous

More information

Multiband Noise Reduction Component for PurePath Studio Portable Audio Devices

Multiband Noise Reduction Component for PurePath Studio Portable Audio Devices Multiband Noise Reduction Component for PurePath Studio Portable Audio Devices Audio Converters ABSTRACT This application note describes the features, operating procedures and control capabilities of a

More information

A low-power portable H.264/AVC decoder using elastic pipeline

A low-power portable H.264/AVC decoder using elastic pipeline Chapter 3 A low-power portable H.64/AVC decoder using elastic pipeline Yoshinori Sakata, Kentaro Kawakami, Hiroshi Kawaguchi, Masahiko Graduate School, Kobe University, Kobe, Hyogo, 657-8507 Japan Email:

More information

EIE: Efficient Inference Engine on Compressed Deep Neural Network

EIE: Efficient Inference Engine on Compressed Deep Neural Network EIE: Efficient Inference Engine on Compressed Deep Neural Network Song Han*, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark Horowitz, Bill Dally Stanford University June 20, 2016 Deep Learning on

More information

Major Differences Between the DT9847 Series Modules

Major Differences Between the DT9847 Series Modules DT9847 Series Dynamic Signal Analyzer for USB With Low THD and Wide Dynamic Range The DT9847 Series are high-accuracy, dynamic signal acquisition modules designed for sound and vibration applications.

More information

Technical Note PowerPC Embedded Processors Video Security with PowerPC

Technical Note PowerPC Embedded Processors Video Security with PowerPC Introduction For many reasons, digital platforms are becoming increasingly popular for video security applications. In comparison to traditional analog support, a digital solution can more effectively

More information

A NOVEL DESIGN OF COUNTER USING TSPC D FLIP-FLOP FOR HIGH PERFORMANCE AND LOW POWER VLSI DESIGN APPLICATIONS USING 45NM CMOS TECHNOLOGY

A NOVEL DESIGN OF COUNTER USING TSPC D FLIP-FLOP FOR HIGH PERFORMANCE AND LOW POWER VLSI DESIGN APPLICATIONS USING 45NM CMOS TECHNOLOGY A NOVEL DESIGN OF COUNTER USING TSPC D FLIP-FLOP FOR HIGH PERFORMANCE AND LOW POWER VLSI DESIGN APPLICATIONS USING 45NM CMOS TECHNOLOGY Ms. Chaitali V. Matey 1, Ms. Shraddha K. Mendhe 2, Mr. Sandip A.

More information

3. Configuration and Testing

3. Configuration and Testing 3. Configuration and Testing C51003-1.4 IEEE Std. 1149.1 (JTAG) Boundary Scan Support All Cyclone devices provide JTAG BST circuitry that complies with the IEEE Std. 1149.1a-1990 specification. JTAG boundary-scan

More information

8 DIGITAL SIGNAL PROCESSOR IN OPTICAL TOMOGRAPHY SYSTEM

8 DIGITAL SIGNAL PROCESSOR IN OPTICAL TOMOGRAPHY SYSTEM Recent Development in Instrumentation System 99 8 DIGITAL SIGNAL PROCESSOR IN OPTICAL TOMOGRAPHY SYSTEM Siti Zarina Mohd Muji Ruzairi Abdul Rahim Chiam Kok Thiam 8.1 INTRODUCTION Optical tomography involves

More information

Xetal-Pro: An Ultra-Low Energy and High Throughput SIMD Processor

Xetal-Pro: An Ultra-Low Energy and High Throughput SIMD Processor Xetal-Pro: An Ultra-Low Energy and High Throughput SIMD Processor Yifan He, Yu Pu Eindhoven University of {y.he, y.pu}@tue.nl Richard Kleihorst VITO, Belgium richard.kleihorst@vito.be Zhenyu Ye Eindhoven

More information

SoC IC Basics. COE838: Systems on Chip Design

SoC IC Basics. COE838: Systems on Chip Design SoC IC Basics COE838: Systems on Chip Design http://www.ee.ryerson.ca/~courses/coe838/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer Engineering Ryerson University Overview SoC

More information

EnVinci Endoscopy with PC Comfort

EnVinci Endoscopy with PC Comfort FP GA DSP ARM EnVinci Endoscopy with PC Comfort EnVinci integrates fast FPGA- and DSP based video image processing PC-like graphic user interfaces and applications directly integrated data interfaces Fast

More information

Day & Night 1080P HD IR IP Camera

Day & Night 1080P HD IR IP Camera Day & Night 1080P HD IR IP Camera Features Full HD 2 megapixel CMOS image sensor True H.264 AVC/MPEG-4 part 10 real-time video compression H.264 and Motion JPEG multi-profile video streaming 3D noise reduction

More information

IEEE802.11a Based Wireless AV Module(WAVM) with Digital AV Interface. Outline

IEEE802.11a Based Wireless AV Module(WAVM) with Digital AV Interface. Outline IEEE802.11a Based Wireless AV Module() with Digital AV Interface TOSHIBA Corp. T.Wakutsu, N.Shibuya, E.Kamagata, T.Matsumoto, Y.Nagahori, T.Sakamoto, Y.Unekawa, K.Tagami, M.Serizawa Outline Background

More information

PRODUCT GUIDE CEL5500 LIGHT ENGINE. World Leader in DLP Light Exploration. A TyRex Technology Family Company

PRODUCT GUIDE CEL5500 LIGHT ENGINE. World Leader in DLP Light Exploration. A TyRex Technology Family Company A TyRex Technology Family Company CEL5500 LIGHT ENGINE PRODUCT GUIDE World Leader in DLP Light Exploration Digital Light Innovations (512) 617-4700 dlinnovations.com CEL5500 Light Engine The CEL5500 Compact

More information

microenable IV AD4-LVDS Product Profile of microenable IV AD4-LVDS Datasheet microenable IV AD4-LVDS

microenable IV AD4-LVDS Product Profile of microenable IV AD4-LVDS Datasheet microenable IV AD4-LVDS i Product Profile of Scalable, intelligent frame grabber for highest requirements on image acquisition and preprocessing High LVDS compliance combined with modern technology PCIe technology Modern software

More information

Certus TM Silicon Debug: Don t Prototype Without It by Doug Amos, Mentor Graphics

Certus TM Silicon Debug: Don t Prototype Without It by Doug Amos, Mentor Graphics Certus TM Silicon Debug: Don t Prototype Without It by Doug Amos, Mentor Graphics FPGA PROTOTYPE RUNNING NOW WHAT? Well done team; we ve managed to get 100 s of millions of gates of FPGA-hostile RTL running

More information

DT3130 Series for Machine Vision

DT3130 Series for Machine Vision Compatible Windows Software DT Vision Foundry GLOBAL LAB /2 DT3130 Series for Machine Vision Simultaneous Frame Grabber Boards for the Key Features Contains the functionality of up to three frame grabbers

More information

microenable IV AS1-PoCL Product Profile of microenable IV AS1-PoCL Datasheet microenable IV AS1-PoCL

microenable IV AS1-PoCL Product Profile of microenable IV AS1-PoCL   Datasheet microenable IV AS1-PoCL i Product Profile of Scalable, intelligent frame grabber for image acquisition and OEM projects Single channel, Base format frame grabber PoCL SafePower Broad camera support No camera file needed Image

More information

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception LEARNING AUDIO SHEET MUSIC CORRESPONDENCES Matthias Dorfer Department of Computational Perception Short Introduction... I am a PhD Candidate in the Department of Computational Perception at Johannes Kepler

More information

IP Video driving more Users & Uses

IP Video driving more Users & Uses EMEA Distributor Conference 2010 IP Video driving more Users & Uses E. Anthony Incorvati Business Development - Transportation North America Axis milestones Channel sales business model Network access

More information

Linux+Zephyr: IoT made easy

Linux+Zephyr: IoT made easy Linux+Zephyr: IoT made easy IoT Explodes Everywhere Sensors and actuators embedded in physical objects and linked through wired and wireless networks, often using the same Internet Protocol (IP) that connects

More information

HD Network Video Recorder Workstation

HD Network Video Recorder Workstation HD Network Video Recorder Workstation Avigilon s end-to-end surveillance solutions deliver image detail no other system can match. Avigilon Control Center software, featuring High Deinition Stream Management

More information

Design Challenge of a QuadHDTV Video Decoder

Design Challenge of a QuadHDTV Video Decoder Design Challenge of a QuadHDTV Video Decoder Youn-Long Lin Department of Computer Science National Tsing Hua University MPSOC27, Japan More Pixels YLLIN NTHU-CS 2 NHK Proposes UHD TV Broadcast Super HiVision

More information

Multicore Design Considerations

Multicore Design Considerations Multicore Design Considerations Multicore: The Forefront of Computing Technology We re not going to have faster processors. Instead, making software run faster in the future will mean using parallel programming

More information

Transparent low-overhead checkpoint for GPU-accelerated clusters

Transparent low-overhead checkpoint for GPU-accelerated clusters Transparent low-overhead checkpoint for GPU-accelerated clusters Leonardo BAUTISTA GOMEZ 1,3, Akira NUKADA 1, Naoya MARUYAMA 1, Franck CAPPELLO 3,4, Satoshi MATSUOKA 1,2 1 Tokyo Institute of Technology,

More information

SEMICONDUCTOR TECHNOLOGY -CMOS-

SEMICONDUCTOR TECHNOLOGY -CMOS- SEMICONDUCTOR TECHNOLOGY -CMOS- Fire Tom Wada 2011/12/19 1 What is semiconductor and LSIs Huge number of transistors can be integrated in a small Si chip. The size of the chip is roughly the size of nails.

More information

ni.com Digital Signal Processing for Every Application

ni.com Digital Signal Processing for Every Application Digital Signal Processing for Every Application Digital Signal Processing is Everywhere High-Volume Image Processing Production Test Structural Sound Health and Vibration Monitoring RF WiMAX, and Microwave

More information

Frame Processing Time Deviations in Video Processors

Frame Processing Time Deviations in Video Processors Tensilica White Paper Frame Processing Time Deviations in Video Processors May, 2008 1 Executive Summary Chips are increasingly made with processor designs licensed as semiconductor IP (intellectual property).

More information

On the Rules of Low-Power Design

On the Rules of Low-Power Design On the Rules of Low-Power Design (and How to Break Them) Prof. Todd Austin Advanced Computer Architecture Lab University of Michigan austin@umich.edu Once upon a time 1 Rules of Low-Power Design P = acv

More information

Future of Analog Design and Upcoming Challenges in Nanometer CMOS

Future of Analog Design and Upcoming Challenges in Nanometer CMOS Future of Analog Design and Upcoming Challenges in Nanometer CMOS Greg Taylor VLSI Design 2010 Outline Introduction Logic processing trends Analog design trends Analog design challenge Approaches Conclusion

More information

Data Converters and DSPs Getting Closer to Sensors

Data Converters and DSPs Getting Closer to Sensors Data Converters and DSPs Getting Closer to Sensors As the data converters used in military applications must operate faster and at greater resolution, the digital domain is moving closer to the antenna/sensor

More information

New Technologies: 4G/LTE, IOTs & OTTS WORKSHOP

New Technologies: 4G/LTE, IOTs & OTTS WORKSHOP New Technologies: 4G/LTE, IOTs & OTTS WORKSHOP EACO Title: LTE, IOTs & OTTS Date: 13 th -17 th May 2019 Duration: 5 days Location: Kampala, Uganda Course Description: This Course is designed to: Give an

More information

Powerful Software Tools and Methods to Accelerate Test Program Development A Test Systems Strategies, Inc. (TSSI) White Paper.

Powerful Software Tools and Methods to Accelerate Test Program Development A Test Systems Strategies, Inc. (TSSI) White Paper. Powerful Software Tools and Methods to Accelerate Test Program Development A Test Systems Strategies, Inc. (TSSI) White Paper Abstract Test costs have now risen to as much as 50 percent of the total manufacturing

More information

Performance Driven Reliable Link Design for Network on Chips

Performance Driven Reliable Link Design for Network on Chips Performance Driven Reliable Link Design for Network on Chips Rutuparna Tamhankar Srinivasan Murali Prof. Giovanni De Micheli Stanford University Outline Introduction Objective Logic design and implementation

More information

ArcticLink III VX6 Solution Platform Data Sheet

ArcticLink III VX6 Solution Platform Data Sheet ArcticLink III VX6 Solution Platform Data Sheet Dual Output High Definition Visual Enhancement Engine (VEE HD+) and Display Power Optimizer (DPO HD+) Solution Platform Highlights High Definition Visual

More information

Embedded System Design

Embedded System Design Embedded System Design Stephen A. Edwards Columbia University Spring 2013 Spot the Computer Embedded Systems: Ubiquitous Computers iphone Laser Keyboard Nikon D300 Video Watch GPS Playstation 3 PC Keyboard

More information

AE16 DIGITAL AUDIO WORKSTATIONS

AE16 DIGITAL AUDIO WORKSTATIONS AE16 DIGITAL AUDIO WORKSTATIONS 1. Storage Requirements In a conventional linear PCM system without data compression the data rate (bits/sec) from one channel of digital audio will depend on the sampling

More information

LOW POWER DIGITAL EQUALIZATION FOR HIGH SPEED SERDES. Masum Hossain University of Alberta

LOW POWER DIGITAL EQUALIZATION FOR HIGH SPEED SERDES. Masum Hossain University of Alberta LOW POWER DIGITAL EQUALIZATION FOR HIGH SPEED SERDES Masum Hossain University of Alberta 0 Outline Why ADC-Based receiver? Challenges in ADC-based receiver ADC-DSP based Receiver Reducing impact of Quantization

More information

Sundance Multiprocessor Technology Limited. Capture Demo For Intech Unit / Module Number: C Hong. EVP6472 Intech Demo. Abstract

Sundance Multiprocessor Technology Limited. Capture Demo For Intech Unit / Module Number: C Hong. EVP6472 Intech Demo. Abstract Sundance Multiprocessor Technology Limited EVP6472 Intech Demo Unit / Module Description: Capture Demo For Intech Unit / Module Number: EVP6472-SMT909 Document Issue Number 1.1 Issue Data: 25th Augest

More information

L12: Reconfigurable Logic Architectures

L12: Reconfigurable Logic Architectures L12: Reconfigurable Logic Architectures Acknowledgements: Materials in this lecture are courtesy of the following sources and are used with permission. Frank Honore Prof. Randy Katz (Unified Microelectronics

More information

Understanding Compression Technologies for HD and Megapixel Surveillance

Understanding Compression Technologies for HD and Megapixel Surveillance When the security industry began the transition from using VHS tapes to hard disks for video surveillance storage, the question of how to compress and store video became a top consideration for video surveillance

More information

Sequential Logic. Introduction to Computer Yung-Yu Chuang

Sequential Logic. Introduction to Computer Yung-Yu Chuang Sequential Logic Introduction to Computer Yung-Yu Chuang with slides by Sedgewick & Wayne (introcs.cs.princeton.edu), Nisan & Schocken (www.nand2tetris.org) and Harris & Harris (DDCA) Review of Combinational

More information

Set-Top Box Video Quality Test Solution

Set-Top Box Video Quality Test Solution Specification Set-Top Box Video Quality Test Solution An Integrated Test Solution for IPTV Set-Top Boxes (over DSL) In the highly competitive telecom market, providing a high-quality video service is crucial

More information

Sundance Multiprocessor Technology Limited. Capture Demo For Intech Unit / Module Number: C Hong. EVP6472 Intech Demo. Abstract

Sundance Multiprocessor Technology Limited. Capture Demo For Intech Unit / Module Number: C Hong. EVP6472 Intech Demo. Abstract Sundance Multiprocessor Technology Limited EVP6472 Intech Demo Unit / Module Description: Capture Demo For Intech Unit / Module Number: EVP6472-SMT949 Document Issue Number 1.1 Issue Data: 27th April 2012

More information

SEMICONDUCTOR TECHNOLOGY -CMOS-

SEMICONDUCTOR TECHNOLOGY -CMOS- SEMICONDUCTOR TECHNOLOGY -CMOS- Fire Tom Wada What is semiconductor and LSIs Huge number of transistors can be integrated in a small Si chip. The size of the chip is roughly the size of nails. Currently,

More information

Based on slides/material by. Topic 14. Testing. Testing. Logic Verification. Recommended Reading:

Based on slides/material by. Topic 14. Testing. Testing. Logic Verification. Recommended Reading: Based on slides/material by Topic 4 Testing Peter Y. K. Cheung Department of Electrical & Electronic Engineering Imperial College London!! K. Masselos http://cas.ee.ic.ac.uk/~kostas!! J. Rabaey http://bwrc.eecs.berkeley.edu/classes/icbook/instructors.html

More information

Altera's 28-nm FPGAs Optimized for Broadcast Video Applications

Altera's 28-nm FPGAs Optimized for Broadcast Video Applications Altera's 28-nm FPGAs Optimized for Broadcast Video Applications WP-01163-1.0 White Paper This paper describes how Altera s 40-nm and 28-nm FPGAs are tailored to help deliver highly-integrated, HD studio

More information

1 Terasic Inc. D8M-GPIO User Manual

1  Terasic Inc. D8M-GPIO User Manual 1 Chapter 1 D8M Development Kit... 4 1.1 Package Contents... 4 1.2 D8M System CD... 5 1.3 Assemble the Camera... 5 1.4 Getting Help... 6 Chapter 2 Introduction of the D8M Board... 7 2.1 Features... 7 2.2

More information

32 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 45, NO. 1, JANUARY 2010

32 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 45, NO. 1, JANUARY 2010 32 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 45, NO. 1, JANUARY 2010 A 201.4 GOPS 496 mw Real-Time Multi-Object Recognition Processor With Bio-Inspired Neural Perception Engine Joo-Young Kim, Student

More information

VRT Radio Transport for SDR Architectures

VRT Radio Transport for SDR Architectures VRT Radio Transport for SDR Architectures Robert Normoyle, DRS Signal Solutions Paul Mesibov, Pentek Inc. Agenda VITA Radio Transport (VRT) standard for digitized IF DRS-SS VRT implementation in SDR RF

More information