OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features

Similar documents
OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features

Chapter 2 Introduction to

The Multistandard Full Hd Video-Codec Engine On Low Power Devices

Frame Processing Time Deviations in Video Processors

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

A Novel Macroblock-Level Filtering Upsampling Architecture for H.264/AVC Scalable Extension

A Low-Power 0.7-V H p Video Decoder

An Overview of Video Coding Algorithms

Video Over Mobile Networks

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Motion Video Compression

The H.263+ Video Coding Standard: Complexity and Performance

AUDIOVISUAL COMMUNICATION

A Low Power Implementation of H.264 Adaptive Deblocking Filter Algorithm

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

MULTIMEDIA TECHNOLOGIES

Video coding standards

17 October About H.265/HEVC. Things you should know about the new encoding.

Multicore Design Considerations

The H.26L Video Coding Project

Motion Compensation Hardware Accelerator Architecture for H.264/AVC

H.264/AVC Baseline Profile Decoder Complexity Analysis

A Novel Study on Data Rate by the Video Transmission for Teleoperated Road Vehicles

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

Alain Legault Hardent. Create Higher Resolution Displays With VESA Display Stream Compression

Overview: Video Coding Standards

Taos - A Revolutionary Zero Latency, Multi-Channel, High-Definition H.264 Video Codec Architecture

Understanding Compression Technologies for HD and Megapixel Surveillance

How to Manage Video Frame- Processing Time Deviations in ASIC and SOC Video Processors

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

A High-Performance Parallel CAVLC Encoder on a Fine-Grained Many-core System

OPEN STANDARD GIGABIT ETHERNET LOW LATENCY VIDEO DISTRIBUTION ARCHITECTURE

FPGA Laboratory Assignment 4. Due Date: 06/11/2012

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Interframe Bus Encoding Technique and Architecture for MPEG-4 AVC/H.264 Video Compression

Block Diagram. 16/24/32 etc. pixin pixin_sof pixin_val. Supports 300 MHz+ operation on basic FPGA devices 2 Memory Read/Write Arbiter SYSTEM SIGNALS

Advanced Computer Networks

Implementation of an MPEG Codec on the Tilera TM 64 Processor

STUDY OF AVS CHINA PART 7 JIBEN PROFILE FOR MOBILE APPLICATIONS

Study of AVS China Part 7 for Mobile Applications. By Jay Mehta EE 5359 Multimedia Processing Spring 2010

Bridging the Gap Between CBR and VBR for H264 Standard

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

Block Diagram. deint_mode. line_width. log2_line_width. field_polarity. mem_start_addr0. mem_start_addr1. mem_burst_size.

SUMMIT LAW GROUP PLLC 315 FIFTH AVENUE SOUTH, SUITE 1000 SEATTLE, WASHINGTON Telephone: (206) Fax: (206)

Block Diagram. pixin. pixin_field. pixin_vsync. pixin_hsync. pixin_val. pixin_rdy. pixels_per_line. lines_per_field. pixels_per_line [11:0]

Visual Communication at Limited Colour Display Capability

A High Performance VLSI Architecture with Half Pel and Quarter Pel Interpolation for A Single Frame

Video Coding IPR Issues

Design Challenge of a QuadHDTV Video Decoder

On Complexity Modeling of H.264/AVC Video Decoding and Its Application for Energy Efficient Decoding

H.264/AVC. The emerging. standard. Ralf Schäfer, Thomas Wiegand and Heiko Schwarz Heinrich Hertz Institute, Berlin, Germany

A Study on AVS-M video standard

P1: OTA/XYZ P2: ABC c01 JWBK457-Richardson March 22, :45 Printer Name: Yet to Come

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

Principles of Video Compression

Video 1 Video October 16, 2001

Novel VLSI Architecture for Quantization and Variable Length Coding for H-264/AVC Video Compression Standard

Multimedia Communications. Video compression

Low Power H.264 Deblocking Filter Hardware Implementations

Hardware Implementation for the HEVC Fractional Motion Estimation Targeting Real-Time and Low-Energy

A parallel HEVC encoder scheme based on Multi-core platform Shu Jun1,2,3,a, Hu Dong1,2,3,b

Commsonic. Satellite FEC Decoder CMS0077. Contact information

WITH the demand of higher video quality, lower bit

A High Performance Deblocking Filter Hardware for High Efficiency Video Coding

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Digital Video Telemetry System

Serial FIR Filter. A Brief Study in DSP. ECE448 Spring 2011 Tuesday Section 15 points 3/8/2011 GEORGE MASON UNIVERSITY.

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Film Grain Technology

Verification Methodology for a Complex System-on-a-Chip

VID_OVERLAY. Digital Video Overlay Module Rev Key Design Features. Block Diagram. Applications. Pin-out Description

About... D 3 Technology TM.

A RANDOM CONSTRAINED MOVIE VERSUS A RANDOM UNCONSTRAINED MOVIE APPLIED TO THE FUNCTIONAL VERIFICATION OF AN MPEG4 DECODER DESIGN

4 H.264 Compression: Understanding Profiles and Levels

Hardware Decoding Architecture for H.264/AVC Digital Video Standard

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

THE new video coding standard H.264/AVC [1] significantly

A low-power portable H.264/AVC decoder using elastic pipeline

Lossless Compression Algorithms for Direct- Write Lithography Systems

Constant Bit Rate for Video Streaming Over Packet Switching Networks

Error concealment techniques in H.264 video transmission over wireless networks

A video signal processor for motioncompensated field-rate upconversion in consumer television

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder.

H.261: A Standard for VideoConferencing Applications. Nimrod Peleg Update: Nov. 2003

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

LFSRs as Functional Blocks in Wireless Applications Author: Stephen Lim and Andy Miller

A Fast Constant Coefficient Multiplier for the XC6200

microenable 5 marathon ACL Product Profile of microenable 5 marathon ACL Datasheet microenable 5 marathon ACL

Dual Frame Video Encoding with Feedback

Bit Rate Control for Video Transmission Over Wireless Networks

AN-ENG-001. Using the AVR32 SoC for real-time video applications. Written by Matteo Vit, Approved by Andrea Marson, VERSION: 1.0.0

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

Multiview Video Coding

microenable IV AD1-PoCL Product Profile of microenable IV AD1-PoCL Datasheet microenable IV AD1-PoCL

Transcription:

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0 General Description Applications Features The OL_H264e core is a hardware implementation of the H.264 baseline video compression algorithm. The core accepts up to the highest resolution HDTV video stream as input and outputs the encoded bitstream. Simple, fully synchronous design with low gate count. Digital video recorders. Video wireless devices. Video surveillance systems. Hand held HDTV video cameras. Fully compatible with the ITU-T H.264 baseline specification. Proven in FPGA : VGA (640x480) at 30 fps in VirtexII-4 demo board. Profile level 4.1, can be decoded by Main Profile decoder. Supports up to the highest HDTV video resolution (1920x1080 @ 30 fps progressive). Very low operational frequency : from ~1.5 MHz for QCIF @ 15 fps to ~250 MHz for 1920x1080 @ 30 fps. Single core HDTV support in FPGA : 720p (1280x720) at 30 fps in high end FPGAs (Virtex4). 4 CIF (704x576) at 30 fps in low end FPGAs (Spartan3-4, slowest speed grade). No CPU required for encoding. Variable Bit Rate (VBR) and Constant Bit Rate (CBR). Very low latency in VBR (~1.1 ms for VGA @ 30 fps). Motion vector up to 16.00/15.75 pixels (search area is 32x32 pixel wide down to quarter pixel). Support for most of intra4x4 and all intra16x16 modes. Multiple slices support for better error resilience. Block skipping logic for lower bitrate. Deblocking filter for better quality. External memory interface tolerant of high latencies and delays, ideal in a SoC system or in a shared bus with a CPU. The memory interface can be clocked at a different frequency from the ocre for easier integration. Supports YUV 4:2:0 video input. Min Clock speed = 4 x the raw pixel clock speed. Low gate count : from 129K gates 100 Kbits of RAM for real time VGA encoding to 178 Kgates 100 Kbits of RAM for real time 1080p encoding. Simple, fully synchronous design. Available as fully functional and synthesizable VHDL or Verilog soft-core. Ocean Logic Pty Ltd 1

Functional Description The OL_H264e core is a hardware implementation of the H.264 baseline video compression algorithm designed to process HDTV progressive video up to 1920x1080 at 30 fps. Each block of 16x16 pixels is processed in just 1024 cycles. This means that each pixel is processed in just 4 cycles. Consequently, given an uncompressed video stream of resolution X by Y, and frame rate fps, the minimum clock frequency to process a such video stream is : F = 4*X*Y*fps This allows the core to process the video stream at relatively low clock frequencies. For example, HDTV video of 1920x1080 @ 30 fps requires ~250 MHz, whereas VGA video of 640x480 @ 30 fps requires ~37 MHz. The table below summarizes the relationship between some possible video resolutions and frame rates and the clock frequency of the core. Resolution QCIF @ 15 fps CIF @ 30fps VGA @ 30fps 1280x720 @ 30fps 1920x1080 @ 30fps Core freq. ~1.5 MHz ~12.1 MHz ~36.8 MHz ~110.5 MHz ~250.6 MHz Table 1 Core frequency versus video resolution and frame rate. A block diagram of the core is shown in Figure 1. For each block, the intra prediction unit generates a suitable prediction. The intra prediction unit supports most of intra4x4 and intra16x16 modes. In case of P-frames, the motion estimation unit generates a prediction as well. It examines an area of 32x32 pixels down to the quarter pixel (motion vector from 16.00 to 15.75). Quarter pixel prediction is generated using the tap filters described in the ITU-T specification. The prediction of each unit is costed using Lagrange multipliers and the best is selected for encoding. The residual information is calculated from the difference between the current block and the prediction. It is then transformed and quantized to be encoded by the lossless encoding unit. The transformed, quantized residual is also used to reconstruct a reference frame, which will be used during the encoding of future P-frames. This is achieved by inverse quantization and transform of the residual, that is then added back to the prediction. Finally, the reconstructed frame is filtered before being stored back in the memory. Ocean Logic Pty Ltd 2

Video Input - T Q Lossless Compression NAL Motion Estimation/ Compensation Intra Prediction -1 T Q -1 Deblocking Filter Memory Interface External Memory Figure 1 The OL_H264e block diagram. Advantages of the core Some of the key advantages of the core are discussed further below: HDTV support The core is designed to support up to the highest HDTV resolution, 1920x1080 @ 30 fps progressive. This opens a whole new range of applications from high-end video camcorders to high-resolution video surveillance at very low cost. Low gate count As it can be seen in the Performance section below, the core is designed very efficiently, with a low gate count. This allows 4CIF (704x576) video @ 30 fps to be processed by low end FPGAs at the slowest speed grade as well as 720p (1280x720) video @ 30 fps in high end FPGAs. Thus HDTV 720p real time encoding is possible in FPGAs without multiple core instantiation. No external CPU required The core can encode video independently, without the support of an additional CPU. This represents a large cost saving compared to solutions that require an external CPU. Both VBR and CBR supported The core supports both VBR (Variable Bit Rate) and CBR (Constant Bit Rate). This allows maximum flexibility for the designer. Flexible memory interface The core requires access to an external memory via a 32-bit data bus. About 50% of the whole theoretical bandwidth of the memory is actually used by the core. Ocean Logic Pty Ltd 3

This interface is designed to be independent from the memory used (i.e. DDR, SDRAM, SRAM, etc.). More importantly, the memory interface is designed to tolerate high and unpredictable latencies and delays that are typical of a shared memory (i.e. AMBA and/or SoC where the bus is shared with a CPU or other cores). In addition to this the memory interface can run at a different clock speed from the rest of the core. This simplifies the integration process and can save gates by not forcing the core to be synthetised to a much higher frequency just to be synchronous with the local bus. This allows, for example, a core running at 37 MHz (encoding VGA @ 30 fps) to be easily integrated in a SoC sharing a 200 MHz bus with a processor. Error resilience The core supports multiple slices. This is useful in environments prone to data transmission problems (i.e. mobile phone or other wireless applications) in order to limit the damage inflicted to the image by transmission errors. Low data rate features The core supports two important features for low data rates: deblocking filter and macroblock skipping. The deblocking filter especially improves the visual quality of the decoded image at low bitrates where the high quantisation noise produces unappealing blockiness in an image. Macroblock skipping greatly reduces the bitrate with minimal effect on the visual quality of the decoded image. Performances Performance figures of the OL_H264e core implemented with some particular technologies are shown in the table below. Technology Approx Area Speed Video Throughput 0.13 u LV 178 Kgates 106 Kbits RAM ~ 250 MHz 1920x1080 (1080p) @ 30 fps 0.9V, 125 C Optimised for speed 0.18 u slow 129 Kgates 106 Kbits RAM ~50 MHz 4 CIF (704x576) @ 30 fps process Optimised for area StratixII 17511 ALUTs 5 M512 51 ~118 MHz 1280x720 (720p) @ 32 fps M4K 3 DSPs Virtex4-12 10,500 slices 3 multipliers ~110 MHz 1280x720 (720p) @ 30 fps 33 RAM blocks Spartan3-4 10,500 slices 3 multipliers 33 RAM blocks ~50 MHz 4 CIF (704x576) @ 30 fps Table 2 Performance of the OL_H264e core. The gate count above does not include CBR and deblocking filter. Final results will be published shortly and they are not expected to increase by more than 10 Kgates. Ocean Logic Pty Ltd 4

Summary The combination of low gate count, low operating frequency, and full HDTV resolution support makes this core an application-enabling technology. The applications of this core range from low power wireless application at relatively low resolution such as mobile phones to HDTV handheld recorders and video surveillance cameras. The very small area of this core also allows novel applications such as its direct integration on to a CMOS sensor. This would create an extremely compact intelligent sensor accepting light directly at its input and outputting an H.264 bitstream. Deliverables Synthesizable VHDL or Verilog RTL. Bit accurate C model. Complete HDL testbench. Complete data sheet. Ocean Logic Pty Ltd PO BOX 768 - Manly NSW 1655 Australia Fax: 61-2-90120979 E-Mail: contact@ocean-logic.com URL : http://www.ocean-logic.com/ Ocean Logic Pty Ltd 5