AN MPEG-4 BASED HIGH DEFINITION VTR

Similar documents
HD Portable Digital Video Recorder SRW-1. HD Video Processor SRPC-1

Motion Video Compression

INTERNATIONAL STANDARD

The following references and the references contained therein are normative.

Serial Digital Interface

06 Video. Multimedia Systems. Video Standards, Compression, Post Production

HEVC/H.265 CODEC SYSTEM AND TRANSMISSION EXPERIMENTS AIMED AT 8K BROADCASTING

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Introduction to image compression

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

MULTIMEDIA TECHNOLOGIES

Technical requirements for the reception of TV programs, with the exception of news and public affairs programs Effective as of 1 st January, 2018

Chapter 3 Fundamental Concepts in Video. 3.1 Types of Video Signals 3.2 Analog Video 3.3 Digital Video

EXTENDED RECORDING CAPABILITIES IN THE EOS C300 MARK II

Table of contents. 2. Mechanical Specifications. 2-1 Tape Cassette Helical Recordings Video Signal Processing...6

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Alpha channel A channel in an image or movie clip that controls the opacity regions of the image.

TECHNICAL SPECIFICATIONS FOR THE DELIVERY OF CLOSE TO TRANSMISSION TELEVISION PROGRAMMES TO THE

Digital Media. Daniel Fuller ITEC 2110

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

By David Acker, Broadcast Pix Hardware Engineering Vice President, and SMPTE Fellow Bob Lamm, Broadcast Pix Product Specialist

Information Transmission Chapter 3, image and video

To discuss. Types of video signals Analog Video Digital Video. Multimedia Computing (CSIT 410) 2

TECHNICAL MEDIA SPECIFICATION ON THE FILE BASED SUBMISSION OF MATERIALS TO BE AIRED

INTERNATIONAL STANDARD

SRW-5000/5500/5800 HD Digital Videocassette Recorder

OVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY

Lecture 2 Video Formation and Representation

Content storage architectures

4K UHDTV: What s Real for 2014 and Where Will We Be by 2016? Matthew Goldman Senior Vice President TV Compression Technology Ericsson

Digital Video Editing

MPEG-2. ISO/IEC (or ITU-T H.262)

Part1 박찬솔. Audio overview Video overview Video encoding 2/47

New Technologies for Premium Events Contribution over High-capacity IP Networks. By Gunnar Nessa, Appear TV December 13, 2017

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

CHAPTER 1 High Definition A Multi-Format Video

PROPOSED SMPTE STANDARD

New forms of video compression

CUFPOS402A. Information Technology for Production. Week Three: Video and Film Production Format (SD, HD, 2k/4k, 16mm, 35mm and Stereoscopic 3D)

Film Grain Technology

Universal Format Converter Implementation

Digital Video Cassette Recorder DNW-75

Will Widescreen (16:9) Work Over Cable? Ralph W. Brown

Video Basics. Video Resolution

Video Disk Recorder DSR-DR1000

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Media Delivery Technical Specifications for VMN US Network Operations

for Television Data Structure for DV-Based Audio, Data and Compressed Video 25 and 50 Mb/s

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

An Overview of Video Coding Algorithms

Cisco D9894 HD/SD AVC Low Delay Contribution Decoder

Images and Formats. Dave Bancroft. Philips Broadcast Film Imaging

RECOMMENDATION ITU-R BT.1203 *

Digital Video Cassette Player DNW-65

Transitioning from NTSC (analog) to HD Digital Video

1 Scope. 2 Introduction. 3 References MISB STD STANDARD. 9 June Inserting Time Stamps and Metadata in High Definition Uncompressed Video

Implementation of MPEG-2 Trick Modes

DVS Whitepaper. DVS and JPEG2000: Hydra OEM board

Verifying Digitized Files

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

High Efficiency Video coding Master Class. Matthew Goldman Senior Vice President TV Compression Technology Ericsson

REAL-WORLD LIVE 4K ULTRA HD BROADCASTING WITH HIGH DYNAMIC RANGE

NTSC/PAL. Network Interface Board for MPEG IMX TM. VTRs BKMW-E2000 TM

A White Paper on High Frame Rates from the EDCF Technical Support Group

Standard Definition. Commercial File Delivery. Technical Specifications

for File Format for Digital Moving- Picture Exchange (DPX)

DRAFT. Sign Language Video Encoding for Digital Cinema

So far. Chapter 4 Color spaces Chapter 3 image representations. Bitmap grayscale. 1/21/09 CSE 40373/60373: Multimedia Systems

Transmission System for ISDB-S

TECHNICAL STANDARDS FOR DELIVERY OF TELEVISION PROGRAMMES TO

DVB-UHD in TS

Graduate Institute of Electronics Engineering, NTU Digital Video Recorder

EBU R The use of DV compression with a sampling raster of 4:2:0 for professional acquisition. Status: Technical Recommendation

Digital HD Videocassette Recorder HVR-M10U

Signum BlackHive. Generation II. Broadcast Production System and video server. The new system generation signum.blackhive

HD Digital Videocassette Recorder SRW-5000/5500

White Paper Lower Costs in Broadcasting Applications With Integration Using FPGAs

APTN TECHNICAL PROGRAM DELIVERY STANDARDS

MISB ST STANDARD. Time Stamping and Metadata Transport in High Definition Uncompressed Motion Imagery. 27 February Scope.

HD Digital Videocassette Recorder SRW-5000

RECOMMENDATION ITU-R BT * Video coding for digital terrestrial television broadcasting

Satellite Digital Broadcasting Systems

AWS-750. Anycast Touch portable live content producer. Overview

Rounding Considerations SDTV-HDTV YCbCr Transforms 4:4:4 to 4:2:2 YCbCr Conversion

TR 038 SUBJECTIVE EVALUATION OF HYBRID LOG GAMMA (HLG) FOR HDR AND SDR DISTRIBUTION

SUMMIT LAW GROUP PLLC 315 FIFTH AVENUE SOUTH, SUITE 1000 SEATTLE, WASHINGTON Telephone: (206) Fax: (206)

HVR-M35E HVR-M25AE HVR-M15AE

DELIVERY SPECIFICATIONS. TAPE and FILE DELIVERY

This document is a preview generated by EVS

Understanding Multimedia - Basics

Microwave PSU Broadcast DvB Streaming Network

Delivery of Spots to

Advanced Computer Networks

How to deliver TV commercials to French TV channels digitally with IMD

TECHNOLOGY. Workflow Guidelines. Recommended Practice for Shooting with the ARRIFLEX D-21

Analog and Digital Video Basics

Jupiter PixelNet. The distributed display wall system. infocus.com

Technical specifications Television - Montreal

MSW-M2000/M2000P MSW-A2000/A2000P

Transcription:

AN MPEG-4 BASED HIGH DEFINITION VTR R. Lewis Sony Professional Solutions Europe, UK ABSTRACT The subject of this paper is an advanced tape format designed especially for Digital Cinema production and post production, ultra high quality shooting, blue and green screen shooting. It is also suitable for final on air programme delivery and interchange of HDTV. This paper will concentrate on the coding techniques used, the multi format capability demanded by users, and the implementation of a format converter to allow easy integration into a multi standard world. A key development is the implementation of an MPEG 4 Studio Profile chipset. With 10 bit recording at HD resolutions in either 4:2:2 (YCbCr) or full chroma bandwidth 4:4:4 (RGB) at up to 60 frames per second, the VTR is also backwards compatible with the two most widely used tape formats in the HD and SD world. The packing density on tape is well over 300 Mbits per square inch nearly 5 times that of the Type D6 format, and a portable version, which can record at double the net video data rate, that is 880 Mbps allowing a full 444 (RGB) recording at only 2:1 compression. This format has been proposed to SMPTE for standardisation. INTRODUCTION Two types of machine have been developed using MPEG-4 coding, a studio machine with full editing capabilities including pre-read, individual editing of the 12 x 24-bit audio channels and so on. Then a portable (battery operated) machine is available, and uniquely can support double tape speed operation allowing extended recording capability. MULTI FORMAT The internal structure of the VTR has been designed to allow recording and playback at the frame rates specified in SMPTE-274M as indicated in table 1. Normal tape speed The first thing to notice is that both 4:2:2 and 4:4:4 sampling are possible. When the tape speed is normal, 4:2:2 sampling up to 30PsF is compressed at 2.7:1, using a combination of MPEG-4 Studio Profile DCT and DPCM encoding. When recording 4:4:4 images up to 30PsF, the 4:4:4 SQ (Standard Quality) mode is used, at a compression rate of 4:1. Also 720 / 50 / 59.94P according to SMPTE-296M can be recorded in this mode.

Sampling 4:2:2 Resolution Compression Ratio 1920 x 1080 1/2.7 Tape Speed Scan Mode Interlace 50, 59.94, 60 Picture Rate PsF 23.98, 24, 25, 29.97, 30 1280 x 720 1/2.4 Normal Progressive 50, 59.94 4:4:4 SQ 1920 x 1080 1/4 4:4:4 HQ 1920 x 1080 1/2 4:2:2 3D 1920 x 1080 1/2.7 4:2:2 1920 x 1080 1/2.7 Double tape speed Double Interlace 50, 59.94, 60 PsF 23.98, 24, 25, 29.97, 30 Interlace 50, 59.94, 60 PsF 23.98, 24, 25, 29.97, 30 Interlace 50, 59.94, 60 PsF 23.98, 24, 25, 29.97, 30 Progressive 59.94, 60 Table 1: The Current and Future Multiformat Capabilities of the new VTR A second mode of operation is possible requiring double the normal tape speed. This functionality is incorporated only on the portable recorder, and allows several variations of recording use. Firstly 4:4:4 recording can be made in HQ (High Quality) mode up to 30PsF, yielding a compression ratio of 2:1. Then, 4:2:2 can be recorded at 60P with a compression of 2.7:1. Finally a so-called 3D mode is possible, where two entirely separate 4:2:2 sources at up to 30PsF can be recorded on one tape, ensuring perfect synchronism. Both channels can be played back separately by the studio machine, by employing a 200% variable speed mode. FORMAT CONVERTER Because of the enormous variety of signal formats that are allowed within the various SMPTE documents, including standard definition, a format converter has been developed to allow easy interchange among them. The basic VTR itself can handle all of the 9 different frame and field rates by changing the linear tape speed, thus avoiding the need for frame based standards conversion. In addition a 4:2:2 60P recorded tape can be played back at half speed (30PsF mode) providing 50% slow down. In all cases the tape footprint is the same. This naturally changes the programme length, but is a common feature with telecine s and so is well understood. The basic requirements are as follows 2:3 pull down for conversion from 23.98PsF to 59.94i (NTSC) 1080i and PsF conversion to 720P and vice versa 1080i and PsF conversion to 525i or 625i and vice versa 4:2:2 conversion to 4:4:4 and vice versa A combination of the above. A combination could be an original 4:4:4 tape at 23.98PsF converted to 4:2:2 then adding 2:3 pull down and finally converting to 525i standard definition for example for transmission or off

line editing. Playback Tape Format HD-SDI OUT SD OUT Format Converter OUT ----- 1080/444/23.98PsF MPEG-4 VTR or SMPTE Type D11 1080/422 23.98PsF 23.98PsF? 525/59.94i 1080/422/59.94i 1080/444/59.94i 24PsF 24PsF ----- ----- 1080/444/24PsF 1080/422/60i 1080/444/60i 25PsF 1080/422 25PsF 625/50i 1080/444/25PsF 29.97PsF 29.97PsF 525/59.94i 1080/444/29.97PsF 30PsF 30PsF ----- 1080/444/30PsF 50i 50i 625/50i 1080/444/50i 59.94i 59.94i 525/59.94i 1080/444/59.94i 60i 60i ----- 1080/444/60i ----- 1080/422/23.98PsF 23.98PsF 23.98PsF? 525/59.94i 1080/422/59.94i ----- 1080/422/24PsF 24PsF 24PsF ----- 1080/422/60i 25PsF 25PsF 625/50i 1080/422/25PsF MPEG-4 VTR 1080/444 29.97PsF 1080/444 29.97PsF 525/59.94i 1080/422/29.97PsF 30PsF 30PsF ----- 1080/422/30PsF 50i 50i 625/50i 1080/422/50i 59.94i 59.94i 525/59.94i 1080/422/59.94i 60i 60i ----- 1080/422/60i 1080/422/59.94i MPEG-4 VTR 720/422 59.94P 720/422 59.94P 525/59.94i 1080/444/59.94i IEC Format: 525 59.94i 59.94i 525/59.94i 1080/422 Digital L 625 50i 50i 625/50i ----- Table 2: The output Capabilities of the Studio Machine Note: The SD output marked? requires the format converter to be installed Note: IEC Format: Digital L requires a processor board to be installed

Table 2 shows the capabilities of the studio machine. Note that other popular ½ inch HD (SMPTE type D-11) and SD (IEC Format: Digital L) tape format playback is also supported. CODEC The codec is the key to the flexibility of the many different formats supported by this development. It has the following attributes. Compliant with the MPEG-4 Simple Studio Profile coding tools o DCT, DPCM and VLC are compliant Shuffling and Rate control are unique for this VTR Multi chip combination is possible in 1929 x 1080 as follows o 1 chip can do 4:2:2 30PsF o 1 chip can do 1280 x 720 / 50 / 59,94P o 2 chips can do 4:4:4 30PsF, 4:2:2 30PsF 3D mode and 4:2:2 60P o 4 chips can do 4:4:4 60P The sync block structure is compatible between 4:4:4 compression at 2.7:1 and 4:1 3 lines per field are assigned to uncompressed 10 bit data words (meta data) Playback compatibility: 4:2:2 60P tape can play back at 4:2:2 24, 25 and 30PsF PHYSICAL LAYOUT AND SEGMENTS Figure 1 shows the basic footprint as it is laid down on tape. Each segment is composed of 6 track pairs, so a frame at 4:2:2 30PsF equates to 24 tracks. Picture segmentation Each 4:2:2 PsF 1920 x 1080 picture is first reconstituted into a progressive 1920 x 1080 frame, then each frame is divided into 8160 16x16 shuffle blocks for luminance and two cosited 8160 8x16 blocks for chrominance. In the case of 4:4:4 PsF, there are three 8160 16x16 blocks for each of RGB. In the case of interlace signals, each field is treated as an independent 1920 x 540 field, and is divided into 4080 16x16 blocks for luminance and two 4080 8x16 blocks for chrominance. An example for 4:2:2 PsF is shown in figure 2.

Segment=0 Segment=1 Segment=2 Segment=3 Vd1 Sector UL= 1 ( Upper sector ) A8 2 A8 2 A8 2 A8 2 Tape Direction 1 0 1 0 1 0 1 0 Audio Sector 2 A8 2 A8 2 A8 2 A8 1 1 1 1 0 0 0 0 UL= 0 ( Lower sector ) ST Sector Vd0 Sector Head Scan Direction TR: 0 1 2 3 4 5 0 1 2 3 4 5 12 Tracks Record Unit ( Frame or Frame Pair ) 2 Record Unit Figure 1: Record unit, Segment, Channel and Track Pair Counts Each 120 x 68 shuffle block is then divided into 4 shuffle sets, each set containing 2040 shuffle blocks. Finally each shuffle block in a shuffle set are allocated to a unique macro block in one of 40 macro block units. For a 1920 x 1080 picture there are 204 macro blocks within a macro block unit (204 macro blocks x 40 macro block units = 8160 original sync blocks), as shown in figure 3. The actual data assigned to a macro block and then to a macro block unit is assigned by a pseudo-random equation depending on the block number and allocation size.

1920 samples 960 samples 960 samples 1080 lines 1920*1080 Y samples 960*1080 C B samples 960*1080 C R samples 1088 lines 8 Y Extension C B Extension C R Extension 120 blocks 120 blocks 120 blocks 68 blocks 8160 16*16 Y blocks 8160 8*16 C B blocks 8160 8*16 C R blocks Figure 2: 1920 1080/PsF 4:2:2 YC B C R Shuffle Blocks Half 1920x1088 Extended Picture: 1st Coded Sequence Data Half 1920x1088 Extended Picture: 2nd Coded Sequence Data MBU number in 1st Coding Channel 0 1 2 19 20 21 22 39 Macro Blocks 0 1 2 3 203 4:2:2 YC B C R samples 16*16 Y samples 8*16 C B samples 8*16 C R samples Figure 3: 1920 1080 4:2:2 YCbCr Macro Block Unit Number Allocation Thus we now have 40 macro block units of shuffled picture data. These now pass to the DCT and DPCM processes for MPEG-4 encoding. SEGMENTS Segments are used to define how the compressed data stream and twelve AES3 audio data streams are mapped to the helical tracks.

Frame field V0 V1 V2 A0 Segment Audio 12ch V3 Figure 4: 4:2:2 60i or 4:4:4 SQ 60i Segment Mapping Figure 4 shows the segment mapping for normal 4:2:2 or 4:4:4 SQ recordings, where 2 segments comprise a frame. Figure 5 shows the mapping for a 4:4:4 HQ mode recording, where the frame is extended to 4 segments. The overall frame timing remains the same (1/30 th second) but the segment duration is reduced. In the same way, it can be seen that in 4:2:2 60P mode the segment timing would be reduced to 1/120 th of a second, so that 2 segments equal 1/60 th second. In this way, each segment can process one 4:2:2 or 4:4:4 SQ signal at 440 Mbps as a net video data rate. Each segment also has it s own error correction. So various formats in the future can be achieved simply by parallel processing of this one segment unit. field Frame V0 A0 Segment A0 V1 Figure 5: 4:4:4 HQ 2:1 Compression Segment Mapping Lch-Frame Rch-Frame field L-V0 L/R-0 Segment L-V1 L/R-1 R-V0 L/R-0 R-V1 L/R-1 Figure 6: 4:2:2 3D Mode Segment Mapping Figure 6 shows the unique 4:2:2 3D mode segment mapping, the left and right channels are recorded alternately on the tape. The portable machine can playback both the Left and Right

channels simultaneously, while the studio machine can playback either channel separately by employing a 200% variable speed mode. The left and right channels both record the same audio data to ensure normal speed audio playback even if the tape is played back using the 200% variable mode on the studio machine. As previously mentioned, each segment has its own error correction, the outer correction being extremely powerful with 12 bytes of correction for every 114 byte of data, see figure 7 below. In addition 3 outer tables are recorded on each physical track, and these outer tables are shuffled amongst the 12 tracks comprising a record unit. The net result is that 1 whole track can be missed, and the data can be perfectly recreated. Inner Column Rank 2 2 226 16 ID SYNC 114 DATA INNER 36 tables / Segment 12 OUTER Figure 7: Video Segment Error Correction Table The audio data is corrected in exactly the same manner except that there are 8 bytes of outer correction for every 8 bytes of data, in addition to the 24 audio tables recorded per field (2 tables per channel), which offers further physical 100% redundancy. Figure 5 also shows that the audio is written twice that is A0 is written twice, once associated with V0 and once with V1. This means that in any condition that the video is skipped, for example single channel playback of a tape recorded in 3D mode or a 60P tape played back at 24, 25 or 30PsF, there will still be continuous audio. SUMMARY The development of this equipment has required many new developments, however one of the key ideas is the use of segments, as this allows the flexibility to multiply up simple processing units to accommodate both current and future derivatives of existing formats. In particular this will allow the portable unit, in the very near future, to record a variable frame rate from 1 FPS to 60 FPS with no external processing required to recover the data by deleting frames. The quality of the full bandwidth 10-bit interface combined with the efficiency of MPEG-4 coding, realise a machine suitable for intensive post production techniques, and tasks such as matting, that have previously only been possible with non-portable hard disk based recorders.

Acknowledgments: Thanks to my development colleagues in Sony Atsugi, Japan and the Sony UK Research Laboratories who developed the Codec Chipset, for all their help.