OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features

Similar documents
OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features

Chapter 2 Introduction to

The Multistandard Full Hd Video-Codec Engine On Low Power Devices

Frame Processing Time Deviations in Video Processors

A Novel Macroblock-Level Filtering Upsampling Architecture for H.264/AVC Scalable Extension

An Overview of Video Coding Algorithms

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

A Low-Power 0.7-V H p Video Decoder

Video coding standards

Motion Compensation Hardware Accelerator Architecture for H.264/AVC

Motion Video Compression

H.264/AVC Baseline Profile Decoder Complexity Analysis

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

The H.26L Video Coding Project

A Low Power Implementation of H.264 Adaptive Deblocking Filter Algorithm

AUDIOVISUAL COMMUNICATION

The H.263+ Video Coding Standard: Complexity and Performance

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Overview: Video Coding Standards

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

A Study on AVS-M video standard

Block Diagram. deint_mode. line_width. log2_line_width. field_polarity. mem_start_addr0. mem_start_addr1. mem_burst_size.

How to Manage Video Frame- Processing Time Deviations in ASIC and SOC Video Processors

17 October About H.265/HEVC. Things you should know about the new encoding.

Video Coding IPR Issues

P1: OTA/XYZ P2: ABC c01 JWBK457-Richardson March 22, :45 Printer Name: Yet to Come

Error concealment techniques in H.264 video transmission over wireless networks

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

H.264/AVC. The emerging. standard. Ralf Schäfer, Thomas Wiegand and Heiko Schwarz Heinrich Hertz Institute, Berlin, Germany

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

Video Over Mobile Networks

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Multicore Design Considerations

STUDY OF AVS CHINA PART 7 JIBEN PROFILE FOR MOBILE APPLICATIONS

Multimedia Communications. Video compression

Visual Communication at Limited Colour Display Capability

A Novel Study on Data Rate by the Video Transmission for Teleoperated Road Vehicles

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Study of AVS China Part 7 for Mobile Applications. By Jay Mehta EE 5359 Multimedia Processing Spring 2010

Film Grain Technology

Design Challenge of a QuadHDTV Video Decoder

Hardware Decoding Architecture for H.264/AVC Digital Video Standard

Mauricio Álvarez-Mesa ; Chi Ching Chi ; Ben Juurlink ; Valeri George ; Thomas Schierl Parallel video decoding in the emerging HEVC standard

FINAL REPORT PERFORMANCE ANALYSIS OF AVS-M AND ITS APPLICATION IN MOBILE ENVIRONMENT

A High-Performance Parallel CAVLC Encoder on a Fine-Grained Many-core System

Advanced Computer Networks

SUMMIT LAW GROUP PLLC 315 FIFTH AVENUE SOUTH, SUITE 1000 SEATTLE, WASHINGTON Telephone: (206) Fax: (206)

Principles of Video Compression

Block Diagram. 16/24/32 etc. pixin pixin_sof pixin_val. Supports 300 MHz+ operation on basic FPGA devices 2 Memory Read/Write Arbiter SYSTEM SIGNALS

A parallel HEVC encoder scheme based on Multi-core platform Shu Jun1,2,3,a, Hu Dong1,2,3,b

Multiview Video Coding

Serial FIR Filter. A Brief Study in DSP. ECE448 Spring 2011 Tuesday Section 15 points 3/8/2011 GEORGE MASON UNIVERSITY.

On Complexity Modeling of H.264/AVC Video Decoding and Its Application for Energy Efficient Decoding

A low-power portable H.264/AVC decoder using elastic pipeline

High Efficiency Video coding Master Class. Matthew Goldman Senior Vice President TV Compression Technology Ericsson

Multimedia Communications. Image and Video compression

ITU-T Video Coding Standards

Hardware Implementation for the HEVC Fractional Motion Estimation Targeting Real-Time and Low-Energy

Digital Video Telemetry System

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Interframe Bus Encoding Technique and Architecture for MPEG-4 AVC/H.264 Video Compression

Video Compression - From Concepts to the H.264/AVC Standard

Low Power H.264 Deblocking Filter Hardware Implementations

FPGA Laboratory Assignment 4. Due Date: 06/11/2012

A RANDOM CONSTRAINED MOVIE VERSUS A RANDOM UNCONSTRAINED MOVIE APPLIED TO THE FUNCTIONAL VERIFICATION OF AN MPEG4 DECODER DESIGN

Block Diagram. pixin. pixin_field. pixin_vsync. pixin_hsync. pixin_val. pixin_rdy. pixels_per_line. lines_per_field. pixels_per_line [11:0]

Novel VLSI Architecture for Quantization and Variable Length Coding for H-264/AVC Video Compression Standard

MULTIMEDIA TECHNOLOGIES

A video signal processor for motioncompensated field-rate upconversion in consumer television

Commsonic. Satellite FEC Decoder CMS0077. Contact information

Verification Methodology for a Complex System-on-a-Chip

A High Performance Deblocking Filter Hardware for High Efficiency Video Coding

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Using the VideoEdge IP Encoder with Intellex IP

THE new video coding standard H.264/AVC [1] significantly

Video 1 Video October 16, 2001

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

Bridging the Gap Between CBR and VBR for H264 Standard

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

WITH the demand of higher video quality, lower bit

MPEG-2. ISO/IEC (or ITU-T H.262)

4 H.264 Compression: Understanding Profiles and Levels

In MPEG, two-dimensional spatial frequency analysis is performed using the Discrete Cosine Transform

Taos - A Revolutionary Zero Latency, Multi-Channel, High-Definition H.264 Video Codec Architecture

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

Parallel Implementation of Sample Adaptive Offset Filtering Block for Low-Power HEVC Chip. Luis A. Fernández Lara

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

H.261: A Standard for VideoConferencing Applications. Nimrod Peleg Update: Nov. 2003

A HIGH THROUGHPUT CABAC ALGORITHM USING SYNTAX ELEMENT PARTITIONING. Vivienne Sze Anantha P. Chandrakasan 2009 ICIP Cairo, Egypt

FPGA Prototyping using Behavioral Synthesis for Improving Video Processing Algorithm and FHD TV SoC Design Masaru Takahashi

complex than coding of interlaced data. This is a significant component of the reduced complexity of AVS coding.

Data Converters and DSPs Getting Closer to Sensors

Into the Depths: The Technical Details Behind AV1. Nathan Egge Mile High Video Workshop 2018 July 31, 2018

Alain Legault Hardent. Create Higher Resolution Displays With VESA Display Stream Compression

AN-ENG-001. Using the AVR32 SoC for real-time video applications. Written by Matteo Vit, Approved by Andrea Marson, VERSION: 1.0.0

Transcription:

OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0 General Description Applications Features The OL_H264MCLD core is a hardware implementation of the H.264 baseline video compression algorithm. The core decodes a bitstream produced by the OLH264e encoder and produces a video stream up to the highest HDTV resolution. Simple, fully synchronous design with low gate count. Digital video recorders. Video wireless devices. Video surveillance systems. Hand held HDTV video cameras. Fully compatible with the output of the OL_H264MCE encoder core. Up to Profile level 4.1 can be decoded. Supports up to the highest HDTV video resolution (1920x1080 @ 30 fps progressive). Very low operational frequency : from ~1.5 MHz for QCIF @ 15 fps to ~250 MHz for 1920x1080 @ 30 fps. Direct support for both progressive and interlaced video. Single core HDTV support in FPGA : 720p (1280x720) at 30 fps in high end FPGAs (Virtex4). 4 CIF (704x576) at 30 fps in low end FPGAs. No CPU required for decoding. Very low latency decoding Motion vector up to 32.00/+31.75 pixels. Support for most of intra4x4 and all intra16x16 modes. Multiple slices support for better error resilience. Block skipping logic for lower bitrate. Deblocking filter for better quality. External memory interface tolerant of high latencies and delays, ideal in a SoC system or in a shared bus with a CPU. The memory interface can be clocked at a different frequency from the core for easier integration. Supports YUV 4:2:0 video output. Min Clock speed = 4 x the raw pixel clock speed. Simple, fully synchronous design. Available as fully functional and synthesizable VHDL or Verilog soft-core. Ocean Logic Pty Ltd 1

Functional Description The OL_H264MCLD core is a hardware implementation of the H.264 baseline video compression algorithm designed to decode a bitstream created by the OL_H264e core and output progressive video up to HDTV 1920x1080 at 30 fps. Each block of 16x16 pixels is processed in just 1024 cycles. This means that each pixel is processed in just 4 cycles. Consequently, given an target video stream of resolution X by Y, and frame rate fps, the minimum clock frequency to decode a such bitstream is: F = 4*X*Y*fps This allows the core to process the video stream at relatively low clock frequencies. For example, HDTV video of 1920x1080 @ 30 fps requires ~250 MHz, whereas VGA video of 640x480 @ 30 fps requires ~37 MHz. The table below summarizes the relationship between some possible video resolutions and frame rates and the clock frequency of the core. Resolution QCIF @ 15 fps CIF @ 30fps VGA @ 30fps 1280x720 @ 30fps 1920x1080 @ 30fps Core freq. ~1.5 MHz ~12.1 MHz ~36.8 MHz ~110.5 MHz ~250.6 MHz Table 1 Core frequency versus video resolution and frame rate. A block diagram of the core is shown in Figure 1. The NAL decoding unit parses the incoming bitstream extracting the block residual information as well as other syntax elements. The block residual information is inverse quantized and transformed before being added to the prediction data. Such prediction data is generated by either the intra or inter prediction unit, according to the decoded syntax elements. The intra prediction unit is capable of generating a prediction for all intra16x16 modes as well as the supported intra4x4 modes. The inter prediction unit can generate a prediction using a single motion vector down to ¼ pixel. The subpixels are generated according to the interpolation filters specified in the ITU-T H.264 baseline specification. Each macroblock is optionally filtered, according to the information extracted from the bitstream, before being output. A reconstructed macroblock is also stored in the external memory in order to be subsequently used by the inter prediction unit. Ocean Logic Pty Ltd 2

NAL NAL Decoding + + -1 T Q -1 Deblocking Filter Inter Prediction Intra Prediction Memory Interface External Memory Figure 1 The OL_H264MCLD block diagram. Advantages of the core Some of the key advantages of the core are discussed further below: HDTV support The core is designed to support up to the highest HDTV resolution, 1920x1080 @ 30 fps progressive. This opens a whole new range of applications from high-end video camcorders to high-resolution video surveillance at very low cost. Low gate count The core is designed very efficiently, with a low gate count. This allows 4CIF (704x576) video @ 30 fps to be decoded by low end FPGAs at the slowest speed grade as well as 720p (1280x720) video @ 30 fps in high end FPGAs. Multi-channel support The pixel processing capability of the core can be shared among multiple video channels (up to 32). Each video channel can have its own resolution and switching from one channel to the other will happen on the frame boundary. For example, at 250 MHz, up to 6 D1 (720x480) channels can be decoded simultaneously at 30 fps or up to 20 CIF (352x288) channels at 30 fps or a combination of both. This allows for a very flexible decoding environment where multiple channels with different resolutions and frame rates are decoded by the same small core. Progressive and interlaced video support The core can support both progressive and interlaced video, for maximum flexibility. No external CPU required The core can decode video independently, without the support of an additional CPU. This represents a large cost saving compared to solutions that require an external CPU. Ocean Logic Pty Ltd 3

Flexible memory interface The core requires access to an external memory via a 32-bit data bus. About 50% of the whole theoretical bandwidth of the memory is actually used by the core. This interface is designed to be independent from the memory used (i.e. DDR, SDRAM, SRAM, etc.). More importantly, the memory interface is designed to tolerate high and unpredictable latencies and delays that are typical of a shared memory (i.e. AMBA and/or SoC where the bus is shared with a CPU or other cores). In addition to this the memory interface can run at a different clock speed from the rest of the core. This simplifies the integration process and can save gates by not forcing the core to be synthetised to a much higher frequency just to be synchronous with the local bus. This allows, for example, a core running at 37 MHz (decoding VGA @ 30 fps) to be easily integrated in a SoC sharing a 200 MHz bus with a processor. Error resilience The core supports multiple slices. This is useful in environments prone to data transmission problems (i.e. mobile phone or other wireless applications) in order to limit the damage inflicted to the image by transmission errors. Low data rate features The core supports two important features for low data rates: deblocking filter and macroblock skipping. The deblocking filter especially improves the visual quality of the decoded image at low bitrates where the high quantisation noise produces unappealing blockiness in an image. Macroblock skipping greatly reduces the bitrate with minimal effect on the visual quality of the decoded image. Performances Performance figures of the OL_H264MCLD core implemented with some particular technologies are shown in the table below. All the features listed above are included in the gate count. Technology Approx Area Speed Video Throughput 0.13 u LV 70 Kgates + 79 Kbits RAM ~ 250 MHz 1920x1080 (1080p) @ 30 fps 0.9V, 125 C Optimised for speed StratixII 6949 ALUTs + 1 M512 + 45 M4K + ~113 MHz 1280x720 (720p) @ > 30 fps 1 DSPs Virtex4-12 4100 slices + 1 multipliers + 21 RAM blocks ~110 MHz 1280x720 (720p) @ 30 fps Summary Table 2 Performance of the OL_H264MCLD core. The combination of low gate count, low operating frequency, and full HDTV resolution support makes this core an application-enabling technology. The applications of this core range from low power wireless application at relatively low resolution such as mobile phones to HDTV handheld recorders and video surveillance cameras. Ocean Logic Pty Ltd 4

Deliverables Synthesizable VHDL or Verilog RTL. Bit accurate C model. Complete HDL testbench. Complete data sheet. Ocean Logic Pty Ltd PO BOX 768 - Manly NSW 1655 Australia Fax: +61-2-90120979 E-Mail: contact@ocean-logic.com URL : http://www.ocean-logic.com/ Ocean Logic Pty Ltd 5