Workload Prediction and Dynamic Voltage Scaling for MPEG Decoding

Similar documents
On Complexity Modeling of H.264/AVC Video Decoding and Its Application for Energy Efficient Decoding

Chapter 10 Basic Video Compression Techniques

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

The H.26L Video Coding Project

The H.263+ Video Coding Standard: Complexity and Performance

A low-power portable H.264/AVC decoder using elastic pipeline

17 October About H.265/HEVC. Things you should know about the new encoding.

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Motion Video Compression

AUDIOVISUAL COMMUNICATION

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Frame-Based Dynamic Voltage and Frequency Scaling for a MPEG Decoder

Chapter 2 Introduction to

A Low-Power 0.7-V H p Video Decoder

Key Techniques of Bit Rate Reduction for H.264 Streams

Analysis of MPEG-2 Video Streams

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Multicore Design Considerations

Frame Processing Time Deviations in Video Processors

A HIGH THROUGHPUT CABAC ALGORITHM USING SYNTAX ELEMENT PARTITIONING. Vivienne Sze Anantha P. Chandrakasan 2009 ICIP Cairo, Egypt

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS

MPEG-2. ISO/IEC (or ITU-T H.262)

Digital Image Processing

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

Video coding standards

Power Reduction via Macroblock Prioritization for Power Aware H.264 Video Applications

Understanding Compression Technologies for HD and Megapixel Surveillance

Multimedia Communications. Video compression

Improvement of MPEG-2 Compression by Position-Dependent Encoding

A High-Performance Parallel CAVLC Encoder on a Fine-Grained Many-core System

Multimedia Communications. Image and Video compression

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Video Processing Applications Image and Video Processing Dr. Anil Kokaram

Low Power MPEG Video Player Using Dynamic Voltage Scaling

Design and Implementation of a Simulator for Measuring the Quality of Service for Distributed Multimedia Applications

Constant Bit Rate for Video Streaming Over Packet Switching Networks

How to Manage Video Frame- Processing Time Deviations in ASIC and SOC Video Processors

SUMMIT LAW GROUP PLLC 315 FIFTH AVENUE SOUTH, SUITE 1000 SEATTLE, WASHINGTON Telephone: (206) Fax: (206)

Frame-Based Dynamic Voltage and Frequency Scaling for a MPEG Decoder

The Multistandard Full Hd Video-Codec Engine On Low Power Devices

COMP 9519: Tutorial 1

A look at the MPEG video coding standard for variable bit rate video transmission 1

SVC Uncovered W H I T E P A P E R. A short primer on the basics of Scalable Video Coding and its benefits

THE CAPABILITY of real-time transmission of video over

Visual Communication at Limited Colour Display Capability

Bit Rate Control for Video Transmission Over Wireless Networks

Implementation of MPEG-2 Trick Modes

Minimax Disappointment Video Broadcasting

Video 1 Video October 16, 2001

Relative frequency. I Frames P Frames B Frames No. of cells

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

Principles of Video Compression

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Digital Video Telemetry System

Modeling and Evaluating Feedback-Based Error Control for Video Transfer

Analysis of a Two Step MPEG Video System

Part1 박찬솔. Audio overview Video overview Video encoding 2/47

Overview: Video Coding Standards

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features

H.261: A Standard for VideoConferencing Applications. Nimrod Peleg Update: Nov. 2003

H.264/AVC Baseline Profile Decoder Complexity Analysis

MPEG decoder Case. K.A. Vissers UC Berkeley Chamleon Systems Inc. and Pieter van der Wolf. Philips Research Eindhoven, The Netherlands

RATE-REDUCTION TRANSCODING DESIGN FOR WIRELESS VIDEO STREAMING

An Overview of Video Coding Algorithms

06 Video. Multimedia Systems. Video Standards, Compression, Post Production

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

VVD: VCR operations for Video on Demand

ITU-T Video Coding Standards

Reduced complexity MPEG2 video post-processing for HD display

Multiview Video Coding

PACKET-SWITCHED networks have become ubiquitous

ATSC vs NTSC Spectrum. ATSC 8VSB Data Framing

SCALABLE video coding (SVC) is currently being developed

Real-Time Parallel MPEG-2 Decoding in Software

Low-Power Techniques for Video Decoding. Daniel Frederic Finchelstein

Lossless Compression Algorithms for Direct- Write Lithography Systems

In MPEG, two-dimensional spatial frequency analysis is performed using the Discrete Cosine Transform

A Real-Time MPEG Software Decoder

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

Dual Frame Video Encoding with Feedback

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

Buffering strategies and Bandwidth renegotiation for MPEG video streams

ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK. Vineeth Shetty Kolkeri, M.S.

A Novel Macroblock-Level Filtering Upsampling Architecture for H.264/AVC Scalable Extension

UC San Diego UC San Diego Previously Published Works

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Analysis of Video Transmission over Lossy Channels

Fast Mode Decision Algorithm for Intra prediction in H.264/AVC Video Coding

Midterm Review. Yao Wang Polytechnic University, Brooklyn, NY11201

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

MPEG-2. Lecture Special Topics in Signal Processing. Multimedia Communications: Coding, Systems, and Networking

Monitoring video quality inside a network

Transcription:

Workload Prediction and Dynamic Voltage Scaling for MPEG Decoding Ying Tan, Parth Malani, Qinru Qiu, Qing Wu Dept. of Electrical & Computer Engineering State University of New York at Binghamton

Outline Introduction Background on MPEG decoding Proposed workload prediction and DVFS techniques for software MPEG decoding Experimental results Conclusions

Dynamic Voltage/Frequency Scaling Using DVFS with buffer reduces the energy even more Borrow or steal processing time from adjacent tasks But latency and hardware complexity also increases Input Buffer processor Output Buffer V dd v Deadline V dd Deadline V dd Deadline v 0.75v v/2 T 2T t 1.5T Without DVFS E 1 = C L *V 2* f*(1.5t) T 2T t Without Buffer E = C L *(f*v 2* T+f/2*V 2 /4*T) = 0.75E 1 1.33T 2T With Buffer E = C L *0.75f*(0.75) 2 V*2T 0.56E 1 t

MPEG-Frame Types Video stream: a sequence of still images (frames) I-frames (intra-coded frames) do not depend on any other frame P-frames (predictive coded frames) are encoded using past I or P frame as a reference B-frames (bi-directionally predictive coded frames) use both past and future I or P frames as references I B B P B B P

MPEG-Layered Structure Sequence Group of Pictures (GOP) Block Block Picture (Frame) Slice Macroblock Block Luminance Block Block Block Chrominance A GOP is an independently decodable unit that begins with an I-frame A macroblock is a 1616 pixel area image A block is a 88 pixel area of image which carries only luminance or chrominance information Macroblocks can be divided into four types frame I P B MB I P B Bi

Workload in MPEG Decoding The number of instructions to perform one IDCT or motion compensation is almost a constant for a given processor Only need to count the number of IDCT and motion compensation IDCT only IDCT+FW FW only IDCT+BW BW only IDCT+Bi Bi only Skipped IDCT and motion estimation is done at block level Blocks are divided into 8 different types Decoding time of each type of block is assumed to be a constant I P B Bi

Workload Prediction Our workload predictor is a linear model Variables M1~M8 represent the number of 8 different types of blocks The information could be obtained from the macroblock header Variable M9 represents the frame size Coefficients are obtained using linear regression analysis frame _ decode _ time i w i M i = w + 0 1 9

Comparison with Existing Predictor Berkeley MPEG decoder 2 x 1 0-3 1. 8 running on Pentium IV 2.6GHz 1. 6 1. 4 processor 1. 2 1 Frame_Type_Len: moving 0. 8 0. 6 average of previous decoding 0. 4 0. 2 0. 2 0. 4 0. 6 0. 8 1 1. 2 1. 4 1. 6 1. 8 time combined with frame size A c t u a l d e c o d e t i m e ( s e c ) Frame_Type_Len P r e d i c te d d e c o d e ti m e b y fr a m e a v g a p p r o a c h ( s e c ) x 1 0-3 90 80 70 60 50 40 30 20 10 0 bobo flower hakinnen red % Absolute Error al_smash canyon hubble airwolf2 ski frame_type_len Our Approach Blazer lion lion-1 Average A c t u a l D e c o d e T i m e ( s e c ) 1 6 x 1 0-4 1 4 1 2 1 0 8 6 4 Our Approach 2 2 4 6 8 1 0 1 2 1 4 P r e d i c te d d e c o d e ti m e b y o u r m o d e l ( s e c ) x 1 0-4

Assumptions Optimal DVFS Continuous frequency/voltage scaling Negligible switching cost Input and display at a constant rate whose period is T The optimal DVFS is to decode every frame continuously without any pause in nt time at a constant frequency and voltage, where n is the total number of frames in a video stream Does not consider arrival time and display deadline These constraints can be met by adding input/output buffers and increasing the latency Must have the workload information of the entire stream Lowest energy, however, highest buffer requirement

GOP-Optimal DVFS Buffers all the frames in a GOP and decodes the entire GOP using a constant voltage On-line heuristic of Optimal DVFS Does not consider the frame incoming time and display deadline In the worst case the input buffer needs to be 2 GOP long

Global Grouping Divide the time into n intervals D 1 ~D n based on display deadline Consecutive intervals (D i,d i+1 )~(D k-1, D k ) will be grouped together if we can find a constant voltage/frequency such that the processor can decode frame i~k continuously before their deadline without pausing

Global Grouping The processor is running at a steady speed within the time slots in a group; The complexity of global grouping is O(n 2 ) The global grouping is an off-line algorithm since it requires the workload information for the entire stream More suitable for the movie clips that are played repeatedly It has minimal energy dissipation while meeting the deadline if all the frames are available at the beginning

Dynamic Grouping Buffers the input frames up to a certain window size at the beginning, applies the global_grouping within the window When a new frame with workload x comes in, (avg_load is the average workload for the last group in current window) if x < avg_load, make it an individual group if x = avg_load, merge it into the last group if x > avg_load, merge it into the last group i, and recalculate the average workload for each group The dynamic grouping is an on-line heuristic of global grouping. It gives better trade off between energy and buffer size

Characteristics of MPEG Clips MPEG Clips Name Index Frame Type # of Frames GOP Size hakkinen 1 I,P,B 799 12 bobo 2 I,P,B 679 90 ski 3 I,P,B 1513 15 blazer 4 I,P,B 2998 12 wg 5 I,P 130 6

Experimental Results Energy DVFS using feedback control A controller is used to adjusts the decoder s speed to keep a constant occupancy of the buffer between the decoder and the display 25 20 15 Perfect workload prediction. Decode time = nt 10 5 0 1 2 3 4 5 Feedback GOP Optimal Dynamic Grouping Global Grouping

100 90 80 70 60 50 40 30 20 10 0 Experimental Results - Buffers Perfect workload prediction. Decode time = nt Output Buffer 1 2 3 4 5 DVFS GOP-Opt Dyn-Group Global-Group Optimal Input Buffer Input Buffer 2GOP 1GOP Output_buffer±1 Output_buffer±1 Feedback GOP Optimal Dynamic Grouping Global Grouping Optimal

Summary The proposed workload prediction model utilizes the block level statistics of each MPEG frame and gives highly accurate prediction results Proposed DVFS techniques give good energy reduction, less buffer usage and work robustly with our predictor