Version 0.5 (3/6/2012 4:08:00 a3/p3 :: application _r010.doc) Warning

Similar documents
Version 0.5 (9/7/2011 4:18:00 a9/p9 :: application v2.doc) Warning

35PM-FCD-ST app-2e Sony Pictures Notes doc. Warning

ISO INTERNATIONAL STANDARD. Digital cinema (D-cinema) packaging Part 4: MXF JPEG 2000 application

Proposed SMPTE Standard SMPTE 425M-2005 SMPTE STANDARD- 3Gb/s Signal/Data Serial Interface Source Image Format Mapping.

TWD SPECIFICATION Interoperable Master Format Broadcast & Online IMF Application Constraints - ProRes

Interoperable Master Format Application DPP (ProRes)

SMPTE STANDARD Gb/s Signal/Data Serial Interface. Proposed SMPTE Standard for Television SMPTE 424M Date: < > TP Rev 0

AMWA Draft Document. AS-07 MXF Archive and Preservation Format. DRAFT FOR COMMENT September 4, Disclaimer

Video System Characteristics of AVC in the ATSC Digital Television System

PROPOSED SMPTE STANDARD

Copyright 2016 AMWA. Licensed under a Creative Commons Attribution-Share Alike 4.0 International License. (CC BY-SA 4.0)

for File Format for Digital Moving- Picture Exchange (DPX)

Implementation of 24P, 25P and 30P Segmented Frames for Production Format

Request for Comments: 5119 Category: Informational February 2008

The following references and the references contained therein are normative.

SDTV 1 DigitalSignal/Data - Serial Digital Interface

D-BOX in SMPTE/DCI DCP

MISB ST STANDARD. Time Stamping and Metadata Transport in High Definition Uncompressed Motion Imagery. 27 February Scope.

ATSC Candidate Standard: Captions and Subtitles (A/343)

Progressive Image Sample Structure Analog and Digital Representation and Analog Interface

Erratum Spec 1.0 Page Sections Affected Description. Trusted Environment. Reel n+1... Encryption. (Reel n) [optional] Encryption (Reel n) [optional]

TECHNICAL MEDIA SPECIFICATION ON THE FILE BASED SUBMISSION OF MATERIALS TO BE AIRED

AN MPEG-4 BASED HIGH DEFINITION VTR

Subtitle Safe Crop Area SCA

High-Definition, Standard-Definition Compatible Color Bar Signal

Delivery of Spots to

DRAFT. Sign Language Video Encoding for Digital Cinema

NOTICE. (Formulated under the cognizance of the CTA R4 Video Systems Committee.)

Images and Formats. Dave Bancroft. Philips Broadcast Film Imaging

CEA Standard. Standard Definition TV Analog Component Video Interface CEA D R-2012

for Television ---- Formatting AES/EBU Audio and Auxiliary Data into Digital Video Ancillary Data Space

ANSI/SCTE

SERIES T: TERMINALS FOR TELEMATIC SERVICES Still-image compression JPEG 2000

NOTICE. (Formulated under the cognizance of the CTA R4.8 DTV Interface Subcommittee.)

ENGINEERING COMMITTEE Digital Video Subcommittee AMERICAN NATIONAL STANDARD ANSI/SCTE

RECOMMENDATION ITU-R BT (Questions ITU-R 25/11, ITU-R 60/11 and ITU-R 61/11)

ATSC Standard: Video Watermark Emission (A/335)

Specification of colour bar test pattern for high dynamic range television systems

ATSC Standard: 3D-TV Terrestrial Broadcasting, Part 1

ATSC Standard: A/342 Part 1, Audio Common Elements

Real-time serial digital interfaces for UHDTV signals

ATSC Candidate Standard: Video Watermark Emission (A/335)

ATSC Digital Television Standard: Part 6 Enhanced AC-3 Audio System Characteristics

RECOMMENDATION ITU-R BT * Video coding for digital terrestrial television broadcasting

DVB-UHD in TS

SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS Infrastructure of audiovisual services Coding of moving video

DCI Memorandum Regarding Direct View Displays

Real-time serial digital interfaces for UHDTV signals

ITU-T Y.4552/Y.2078 (02/2016) Application support models of the Internet of things

Proposed Standard Revision of ATSC Digital Television Standard Part 5 AC-3 Audio System Characteristics (A/53, Part 5:2007)

General viewing conditions for subjective assessment of quality of SDTV and HDTV television pictures on flat panel displays

Media Delivery Technical Specifications for VMN US Network Operations

RECOMMENDATION ITU-R BT Studio encoding parameters of digital television for standard 4:3 and wide-screen 16:9 aspect ratios

INTERNATIONAL STANDARD

Rec. ITU-R BT RECOMMENDATION ITU-R BT PARAMETER VALUES FOR THE HDTV STANDARDS FOR PRODUCTION AND INTERNATIONAL PROGRAMME EXCHANGE

ISO INTERNATIONAL STANDARD. Digital cinema (D-cinema) quality Part 1: Screen luminance level, chromaticity and uniformity

ATSC Digital Television Standard Part 4 MPEG-2 Video System Characteristics (A/53, Part 4:2007)

Standards Manager Web Standards List SMPTE-Society of motion picture & television Engineers

Rec. ITU-R BT RECOMMENDATION ITU-R BT * WIDE-SCREEN SIGNALLING FOR BROADCASTING

Verifying Digitized Files

Digital Cinema System Specification

Serial Digital Interface Checkfield for 10-Bit 4:2:2 Component and 4fsc Composite Digital Signals

INTERNATIONAL STANDARD

Digital interfaces for studio signals with image formats

TA Document Enhancements to the AV/C Tape Recorder/Player Subunit Specification Version 2.1

Configuration. -- Rrs Right rear surround. 13 Motion Data Synchronous signal (currently used. 14 Sync Signal Used for external sync (e.g.

Standard Definition. Commercial File Delivery. Technical Specifications

PROPOSED SMPTE STANDARD

RECOMMENDATION ITU-R BT Digital interfaces for HDTV studio signals

ATSC Standard: 3D-TV Terrestrial Broadcasting, Part 5 Service Compatible 3D-TV using Main and Mobile Hybrid Delivery

CONSOLIDATED VERSION IEC Digital audio interface Part 3: Consumer applications. colour inside. Edition

TECHNICAL REQUIREMENTS Commercial Spots

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting

Uncompressed high quality video over IP. Ladan Gharai USC/ISI

R 95 SAFE AREAS FOR 16:9 TELEVISION PRODUCTION VERSION 1.1 SOURCE: VIDEO SYSTEMS

Technical requirements for the reception of TV programs, with the exception of news and public affairs programs Effective as of 1 st January, 2018

ENGINEERING COMMITTEE Digital Video Subcommittee AMERICAN NATIONAL STANDARD. HEVC Video Constraints for Cable Television Part 2- Transport

INTERNATIONAL STANDARD

TR 040 EVALUATION OF IMF FOR BROADCASTERS

INTERNATIONAL STANDARD

Metadata for Enhanced Electronic Program Guides

RECOMMENDATION ITU-R BR.716-2* (Question ITU-R 113/11)

TECHNICAL STANDARDS FOR DELIVERY OF TELEVISION PROGRAMMES TO

TR 038 SUBJECTIVE EVALUATION OF HYBRID LOG GAMMA (HLG) FOR HDR AND SDR DISTRIBUTION

Serial Digital Interface

Interface Practices Subcommittee SCTE STANDARD SCTE Composite Distortion Measurements (CSO & CTB)

for Television Data Structure for DV-Based Audio, Data and Compressed Video 25 and 50 Mb/s

4 H.264 Compression: Understanding Profiles and Levels

Digital Video Subcommittee SCTE STANDARD SCTE HEVC Video Constraints for Cable Television Part 2- Transport

APEX SPECIFICATION 0403 DIGITAL CONTENT DELIVERY METHODOLOGY FOR AIRLINE IN-FLIGHT ENTERTAINMENT SYSTEMS VERSION 1.34

ATSC Candidate Standard: A/341 Amendment SL-HDR1

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Standards Update. Alan Lambshead, SVP Standards Howard Lukk, Director of Engineering and Standards November 2, 2017

High Definition Television. Commercial File Delivery. Technical Specifications

TEN.02_TECHNICAL DELIVERY - INTERNATIONAL

IPTV delivery of media over networks managed end-to-end, usually with quality of service comparable to Broadcast TV

SERIES J: CABLE NETWORKS AND TRANSMISSION OF TELEVISION, SOUND PROGRAMME AND OTHER MULTIMEDIA SIGNALS Measurement of the quality of service

ENGINEERING COMMITTEE Digital Video Subcommittee SCTE STANDARD SCTE

Chapter 3 Fundamental Concepts in Video. 3.1 Types of Video Signals 3.2 Analog Video 3.3 Digital Video

445 Hamilton, White Plains, NY Telephone: Fax:

Transcription:

WD SMPTE STANDARD SMPTE 2067-30-201x Interoperable Master Format Application #3 Page 1 of 20 pages Version 0.5 (3/6/2012 4:08:00 a3/p3 :: application-3-2012-03-06_r010.doc) Warning This document is not a SMPTE Standard. It is distributed for review and comment. It is subject to change without notice and shall not be referred to as a SMPTE Standard. Recipients of this document are invited to submit, with their comments, notification of any relevant patent rights of which they are aware and to provide supporting documentation. Distribution does not constitute publication. Table of Contents 1 Introduction 2 2 Scope 2 3 Conformance Notation 2 4 Normative References 3 5 Overall 4 6 Essence 5 7 Track Files 9 8 Composition 17 9 IMF Master Package (IMP) 19 Annex A Sequence Elements 19 Annex B StereoImageTrackFileResourceType 19 Foreword SMPTE (the Society of Motion Picture and Television Engineers) is an internationally-recognized standards developing organization. Headquartered and incorporated in the United States of America, SMPTE has members in over 80 countries on six continents. SMPTE s Engineering Documents, including Standards, Recommended Practices, and Engineering Guidelines, are prepared by SMPTE s Technology Committees. Participation in these Committees is open to all with a bona fide interest in their work. SMPTE cooperates closely with other standards-developing organizations, including ISO, IEC and ITU. Copyright 2007 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 3 Barker Avenue., White Plains, NY 10601 (914) 761-1100 Approved <TBD>

SMPTE Engineering Documents are drafted in accordance with the rules given in Part XIII of its Administrative Practices. This SMPTE Engineering Document was prepared by Technology Committee 35PM-50. EDITOR'S NOTE: URI values defined herein, e.g. http://www.smpte-ra.org/schemas/2065-20/xxxx, are temporary and will be replaced by their final values prior to publication, at which point this note will be removed. 1 Introduction A single TV or movie title is transformed into multiple content versions (airline edits, special edition, languages ) These versions, which share common assets sourced from high-quality source masters, are ultimately made available to multiple distribution channels (Internet, optical media, broadcast...) across multiple territories and over the span of many months to over a year. The IMF is a file-based framework that allows these high-quality versions, called Compositions, to be efficiently represented, managed and processed on file-based systems. For example, it facilitates the generation of multiple outputs of the same Composition (through instructions contained in an Output Profile List define in other documents) to accommodate the specific needs of distribution channels. Since management and processing of Compositions are performed across multiple devices and service providers, interoperability is desirable. Figure 1. The IMF framework allows the management and processing of multiple high-quality versions, called Compositions, of a finished content. 2 Scope This document defines Compositions for IMF Application #3, i.e. the parameters of Image/Audio/Data Essences, Track Files, and the constraints of Composition Playlist. It is based on Image Essence coded as a MPEG-4 Visual Simple Studio Profile codestream and Audio Essence coded as linear PCM. Compliant implementations need not implement all formats and format values, manufacturers are encouraged to indicate supported formats. 3 Conformance Notation Normative text is text that describes elements of the design that are indispensable or contains the Page 2 of 20 pages

conformance language keywords: "shall", "should", or "may". Informative text is text that is potentially helpful to the user, but not indispensable, and can be removed, changed, or added editorially without affecting interoperability. Informative text does not contain any conformance keywords. All text in this document is, by default, normative, except: the Introduction, any section explicitly labeled as "Informative" or individual paragraphs that start with "Note: The keywords "shall" and "shall not" indicate requirements strictly to be followed in order to conform to the document and from which no deviation is permitted. The keywords, "should" and "should not" indicate that, among several possibilities, one is recommended as particularly suitable, without mentioning or excluding others; or that a certain course of action is preferred but not necessarily required; or that (in the negative form) a certain possibility or course of action is deprecated but not prohibited. The keywords "may" and "need not" indicate courses of action permissible within the limits of the document. The keyword reserved indicates a provision that is not defined at this time, shall not be used, and may be defined in the future. The keyword forbidden indicates reserved and in addition indicates that the provision will never be defined in the future. A conformant implementation according to this document is one that includes all mandatory provisions ("shall") and, if implemented, all recommended provisions ("should") as described. A conformant implementation need not implement optional provisions ("may") and need not implement them as described. Unless otherwise specified, the order of precedence of the types of normative information in this document shall be as follows: Normative prose shall be the authoritative definition; Tables shall be next; followed by formal languages; then figures; and then any other language forms. 4 Normative References The following standards contain provisions which, through reference in this text, constitute provisions of this standard. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this standard are encouraged to investigate the possibility of applying the most recent edition of the standards indicated below. SMPTE ST 381-2:2011 Material Exchange Format (MXF) - Mapping MPEG Streams into the MXF Constrained Generic Container SMPTE ST 382:2007 Material Exchange Format - Mapping AES3 and Broadcast Wave Audio into the MXF Generic Container SMPTE ST 274:2008 Television 1920 x 1080 Image Sample Structure, Digital Representation and Digital Timing Reference Sequences for Multiple Picture Rates SMPTE ST 296:2001 Television 1280 x 720 Progressive Image Sample Structure Analog and Digital Representation and Analog Interface (R2006) Page 3 of 20 pages

SMPTE ST 2048-1:2011 2048 x 1080 and 4096 x 2160 Digital Cinematography Production Image Formats FS/709 SMPTE ST2067-3:2012, Interoperable Master Format Composition Playlist SMPTE ST2067-5:2012, Interoperable Master Format Essence Component ISO/IEC 14496-2 (2004-06), Information Technology Coding of Audio Visual Objects Part 2: Visual ISO/IEC 14496-2:2004/Amd 5:2009 Simple studio profile levels 5 and 6 ITU-R BT.709-5:2002, Parameter values for the HDTV standards for production and international programme exchange ITU-R BR.1352-2:2002, Broadcast Wave Format(BWF), Annex 1, Annex 1 Appendix 1 and 2, and Annex 3 World Wide Web Consortium (W3C) (2004, October 28). XML Schema Part 1: Structures (Second Edition) World Wide Web Consortium (W3C) (2004, February 4). Extensible Markup Language (XML) 1.0 (Third Edition) SMPTE ST 428-1:2006 D-Cinema Distribution Master - Image Characteristics SMPTE ST 429-8:2007 D-Cinema Packaging - Packing List SMPTE ST 429-5:2009 D-Cinema Packaging - Timed Text Track File SMPTE ST 330:2011 Unique Material Identifier (UMID) SMPTE ST 377-1:2011 Material Exchange Format (MXF) - File Format Specification SMPTE RP 224 SMPTE Labels Register Internet Engineering Task Force (IETF) (2005, July). RFC 4122 - A Universally Unique Identifier (UUID) URN Namespace 5 Overall 5.1 XML Schema and Namespace This Application defines XML elements using the XML Schema language specified in W3C XML Schema Part 1. Table 1 specifies the XML schema root element and target namespace. Table 1. XML Schema root element definition. <xs:schema targetnamespace="http://www.smpte-ra.org/schemas/2065-20/xxxx" xmlns:cpl="http://www.smpte-ra.org/schemas/2065-3/xxxx" xmlns:atwo="http://www.smpte-ra.org/schemas/2065-20/xxxx" elementformdefault="qualified" attributeformdefault="unqualified"> <xs:import namespace=" http://www.smpte-ra.org/schemas/2065-3/xxxx" /> <!-- schema definitions found in this document excluding this one --> </xs:schema> Page 4 of 20 pages

5.2 UUID Generation UUID values used by this Application shall be generated as specified in IETF RFC 4122. UUID values that identify assets or cryptographic information shall be generated using a truly-random or pseudo-random number source, and shall have a Version field value of 4 (or 0100b). NOTE: The b suffix on this value indicates a binary encoding, most significant bit (MSB) first. 5.3 XML Character Encoding XML documents shall be encoded using UTF-8, as specified in W3C Extensible Markup Language (XML) 1.0. 6 Essence 6.1 Image Essence 6.1.1 Characteristics An Image Lattice and the Production Aperture shall be as defined in SMPTE ST 2046-1. 6.1.1.1 Safe Action and Safe Title Areas Safe Action and Safe Title Areas shall be as defined in SMPTE ST 2046-1. 6.1.1.2 Image Sample Structure and Frame Rates The 1920 x 1080 image sample structure and frame rates shall be as defined in SMPTE ST 274. Table 2. 1920 x 1080 image sample structure and frame rates System No. System nomenclature Frame Rate(Hz) 1 1920x1080/60/P 60 2 1920x1080/59.94/P 60/1.001 3 1920x1080/50/P 50 4 1920x1080/60/I 30 5 1920x1080/59.94/I 30/1.001 6 1920x1080/50/I 25 7 1920x1080/30/P 30 8 1920x1080/29.97/P 30/1.001 9 1920x1080/25/P 25 10 1920x1080/24/P 24 11 1920x1080/23.98/P 24/1.001 The 1280 x 720 progressive image sample structure and frame rates shall be as defined in SMPTE ST 296. Page 5 of 20 pages

Table 3. 1270 x 720 progressive image sample structure and frame rates System No. System nomenclature Frame Rate(Hz) 1 1280x720/60/P 60 2 1280x720/59.94/P 60/1.001 3 1280x720/50/P 50 4 1280x720/30/P 30 5 1280x720/29.97/P 30/1.001 6 1280x720/25/P 25 7 1280x720/24/P 24 8 1280x720/23.98/P 24/1.001 The 2048 x 1080 progressive image sample structure and frame rates shall be as defined in SMPTE ST 2048-1. Table 4. 2048 x 1080 progressive image sample structure and frame rates System No. System nomenclature Frame Rate(Hz) 1 2048x1080/60/P 60 2 2048x1080/59.94/P 60/1.001 3 2048x1080/50/P 50 4 2048x1080/48/P 48 5 2048x1080/47.95/P 48/1.001 6 2048x1080/30/P 30 7 2048x1080/29.97/P 30/1.001 8 2048x1080/25/P 25 9 2048x1080/24/P 24 10 2048x1080/23.98/P 24/1.001 The 4096 x 2160 progressive image sample structure and frame rates shall be as defined in SMPTE ST 2048-1. Table 5. 4096 x 2160 progressive image sample structure and frame rates Page 6 of 20 pages

System No. System nomenclature Frame Rate(Hz) 1 4096x2160/60/P 60 2 4096x2160/59.94/P 60/1.001 3 4096x2160/50/P 50 4 4096x2160/48/P 48 5 4096x2160/47.95/P 48/1.001 6 4096x2160/30/P 30 7 4096x2160/29.97/P 30/1.001 8 4096x2160/25/P 25 9 4096x2160/24/P 24 10 4096x2160/23.98/P 24/1.001 6.1.1.3 Color Components Each image frame shall be sampled using either R G B, X Y Z or Y C BC R color component triplets as specified below. 6.1.1.3.1 Bit Depth Each component shall be represented by an integer in the set {0..2 n 1}, with n being one of the values of Table 6 Table 6. Component Bit Depth. 10 12 6.1.1.3.2 Colorimetry Values of the R G B, X Y Z or Y C BC R components shall be mapped to red, green and blue tristimulus values according to one of the colorimetry systems of Table 7. Table 7. Colorimetry. Colorimetry System BT709 Applicable Component Triplets R G B Y C BC R Description Mapped as specified in ITU BT.709 DCDM X Y Z Mapped as specified in SMPTE ST 428-1 NOTE: In ITU BT.709, the signals R', G', B', Y, C B and C R are referred to as signals E' R, E' G, E' B, E' Y, E' CB and E' CR respectively, i.e. they correspond to gamma pre-corrected signals. Page 7 of 20 pages

6.1.1.3.3 Sampling In 4:4:4 sampling, each component shall be sampled once at each image frame pixel. In 4:2:2 Y C BC R sampling, the Y' component shall be sampled at each pixel, but the C R and C B components shall be horizontally subsampled by a factor of two with respect to the Y component, co-sited with evennumbered Y samples. Progressive and interlaced frame structures shall be sampled according to Table 8. Table 8. Component Sampling. 4:4:4 R'G'B' Progressive Structure X'Y'Z' Interlaced Structure R'G'B' 4:2:2 Y C BC R Y C BC R 6.1.1.4 Stereoscopic and Monoscopic Image Essence Monoscopic Essence consists of a single sequence of image frames. Stereoscopic Essence consists of a sequence of pairs of image frames, a left eye frame and a right eye frame, for stereoscopic viewing. The two images of a pair shall be coincident in time. Stereoscopic Essence shall be used only with progressive frame structure. 6.1.2 Encoding 6.1.2.1 Profile The Image Essence shall be encoded using one of the Profile And Level of ISO/IEC 14996 listed in Table 9. Profile And Level Table 9. MPEG-4 Video Profile And Level Typical Visual Session formats 1 Max. bitrate (Mbit/s) MPEG-4 Visual Simple Studio Profile/Level 2 ITU - R709.60I:422 600 MPEG-4 Visual Simple Studio Profile/Level 3 MPEG-4 Visual Simple Studio Profile/Level 4 MPEG-4 Visual Simple Studio Profile/Level 5 ITU - R709.60I:444 ITU - R709.60I:4224 ITU - R709.60P:444 ITU - R709.60I:4444 4Kx2Kx24P:444 4Kx2Kx30P:444 900 1350 1800 MPEG-4 Visual Simple Studio Profile/Level 6 4Kx2Kx60P:444 3600 1 This column is for informative use only. It provides an example configuration of max. no.of pixels/session Page 8 of 20 pages

6.1.2.2 Component Ordering In a codestream, color components shall be ordered as specified in Table 10. Table 10. MPEG-4 Color Component Ordering. Component Index R'G'B' Y C BC R X'Y'Z' 0 G' Y Y' 1 B' C B Z' 2 R' C R X' 6.2 Audio Essence The Audio Essence characteristics shall conform to ITU-R BR.1352-2, with the following constraints applied. 6.2.1 Bits per Sample The number of bits per Audio Essence sample shall be one of the values listed in Table 11. Table 11. Audio Essence Bits Per Samples. 24 6.2.2 Sampling Rates The Audio Essence shall be sampled at one of the rates listed in Table 11. Table 12. Audio Sampling Rates. 48,000 96,000 6.3 Data Essence T.B.D. 7 Track Files 7.1 Overall 7.1.1 Format Track Files shall conform to SMPTE ST2067-5. 7.1.2 Identification The Package UID of a given Track File shall be a basic UMID per SMPTE ST 330, having a UUID value in the material number part and a material number generation method of UUID/UL. The Package UID value shall be further constrained as follows: Byte 11 of the UL portion of the UMID shall be 0Fh (unidentified material type). Page 9 of 20 pages

Byte 12 of the UL portion of the UMID shall be 20h (UUID/UL material number generation method and undefined instance number generation method). The three bytes of the instance number shall be 0 (zero). Package UID values generated in accordance with the normative provisions of this subsection will thus have the following contents in the first 16 bytes: 060a2b34h 01010105h 01010f20h 13000000h. 7.1.3 MIME Type Track File shall use the MIME type application/mxf. T.B.D. 7.1.4 Encryption 7.2 Image Track Files 7.2.1 Essence Image Track Files shall contain Image Essence conforming to Section 6. 7.2.2 Track Format and Mapping Image Track Files shall comply with constrained parameters defined in SMPTE ST 2067-5, i.e. OP1a single essence, Frame-Wrapping, and so on. The mapping method conforms to SMPTE ST 381-2. In the case of progressive source image, each frame-coded Access Unit (AU) shall be wrapped in a single KLV the Edit Rate shall be equal to the frame rate. In the case of interlaced source image, each of two field-coded AUs shall be wrapped in a single KLV where the first field AU precedes the second field AU the Edit Rate shall be equal to the frame rate. The Top-Level File Package of Image Track File shall reference a CDCI Picture Essence Descriptor in SMPTE ST 377-1 if the Image Essence uses Y C BC R color components; or an RGBA Picture Essence Descriptor in SMPTE S T377-1 if the Image Essence uses R'G'B' or X'Y'Z' color components. 7.2.2.1 Generic Picture Essence Descriptor The Generic Picture Essence Descriptor items shall be constrained as specified in Table 13. Table 13. Generic Picture Essence Descriptor Items. Generic Picture Essence Descriptor Item Constraints Page 10 of 20 pages

Signal Standard Shall be present. See SMPTE ST377-1 G.2.3 HD=SMPTE ST274(04h), 2K/4K XYZ=SMPTE ST428-1(07h), 2K/4K RGB=(00h) Frame Layout Shall be present. See Section 7.2.2.1.3. progressive(0), interlace(1) Stored Width See Section 7.2.2.1.1. HD=1920, 2K=2048, 4K=4096 Stored Height See Section 7.2.2.1.1. HD p=1088, i=544 2K p=1088, 4K p=2176 StoredF2Offset Sampled Width Sampled Height Shall be present if the image is Interlaced and the default value is 0. Shall be present and shall be equal to the image active area width HD=1920, 2K=2048, 4K=4096 Shall be present and shall be equal to the image active area height HD p=1080, i=540 2K p=1080, 4K p=2160 SampledXOffset Shall be present and the default value is 0 SampledYOffset Shall be present and the default value is 0 DisplayHeight DisplayWidth Shall be present and shall be equal to the image active area height HD p=1080, i=540 2K p=1080, 4K p=2160 Shall be present and shall be equal to the image active area width HD=1920, 2K=2048, 4K=4096 DisplayXOffset Shall be present and the default value is 0 DisplayYOffset Shall be present and the default value is 0 DisplayF2Offset Aspect Ratio Video Line Map Shall be present if the image is Interlaced and the default value is 0. Shall be present. See ST377-1 G.2.4 HD=16:9, 2K/4K=256:135 Shall be present. See ST377-1 G.2.12 HD p={42,0}, i={21,584} 2K p={42,0}, 4K p={84,0} Transfer Characteristic Shall be present. See Section 7.2.2.1.4. YUV/RGB = ITU-R BT.709 Page 11 of 20 pages

(06 0e 2b 34 04 01 01 01 04 01 01 01 01 02 00 00) XYZ = ST428-1 (06 0e 2b 34 04 01 01 08 04 01 01 01 01 07 00 00) Image Alignment Offset Shall be present and the default value is 0 Image Start Offset Shall be present and the default value is 0 Image End Offset Shall be present and the default value is 0 FieldDominance Shall be present if the image is Interlaced and the default value is 1.. Picture Essence Coding Shall be present. See Section 7.2.2.1.7. MP4 SStP L2 (06 0e 2b 34 04 01 01 03 04 01 02 02 01 20 10 02). [422] 1920x1080/422/30P SQ/LQ 2 1920x1080/422/60P LQ 2 1920x1080/422/60P SQ 3 [444] 1920x1080/444/30P SQ 2 1920x1080/444/30P HQ 3 2048x1080/444/30P HQ 3 1920x1080/444/60P SQ 3 2048x1080/444/60P SQ 3 1920x1080/444/60P HQ 4 2048x1080/444/60P HQ 4 4096x2160/444/30P SQ 5 4096x2160/444/60P SQ 6 7.2.2.1.1 Signal Standard The value of the Signal Standard item shall be equal to 04h (SMPTE ST 274) if 1920x1080, YUV 10bit, REC709 04h (SMPTE ST 274) if 1920x1080, RGB 10bit/12bit, REC709 00h (No specific underlying standard) if 2048x1080/4096x2160, RGB 10bit, REC709 07h (SMPTE ST 428-1) if 2048x1080/4096x2160, XYZ 12bit, DCDM 7.2.2.1.2 Stored Width and Stored Height The values of the Stored Width and Stored Height items shall be set according to the image frame structure, as specified in Table 14. Table 14. Stored Width and Stored Height values. Page 12 of 20 pages

Frame Structure Stored Width Stored Height Progressiv e Image Frame Width Image Frame Height Interlaced Image Frame Width Image Frame Height / 2 7.2.2.1.3 Frame Layout The value of the Frame Layout item shall be equal to 00h (FULL_FRAME) if the image structure is progressive 01h (SEPARATE_FIELDS) if the image structure is interlaced. 7.2.2.1.4 Transfer Characteristic The value of the Transfer Characteristic item shall be equal to 06.0E.2B.34.04.01.01.01.04.01.01.01.01.02.00.00 ["Identifies ITU-R BT.709 transfer characteristic" in RP 224] if the color components are R G B or Y C BC R 06.0E.2B.34.04.01.01.08.04.01.01.01.01.07.00.00 ["Identifies DCDM ST428-1 transfer characteristic" in RP 224] if the color components are X'Y'Z' 7.2.2.1.5 Coding Equations The value of the Coding Equations item shall be equal to 06.0E.2B.34.04.01.01.01.04.01.01.01.02.02.00.00 ["Identifies ITU BT-709 Coding Equations" in RP 224] if the BT709 colorimetry systems are used, as defined in Section 6.1.1.3.2. 7.2.2.1.6 Color Primaries The value of the Color Primaries item shall be equal to 06.0E.2B.34.04.01.01.06.04.01.01.01.03.03.00.00 ["Identifies ITU-R BT.709 color primaries and white point" in RP 224] if the BT709 colorimetry systems are used, as defined in Section 6.1.1.3.2. 7.2.2.1.7 Picture Essence Coding The value of Picture Essence Coding item shall reflect Mpeg4 simple studio profile used to encode the Image Essence (see Section 6) and shall be set to one of the Picture Essence Compression Uls [SMPTE RP224] of Table 15. Table 15. Specification of the Picture Essence Coding UL. Page 13 of 20 pages

Byte No. Description Value (hex) Meaning 1-15 06.0E.2B.34.04.01.01.03.04.01.02.02.01.20.10 16 Level 02h MPEG-4 Visual Simple Studio Profile Level 2 per ISO/IEC 14496-2 03h 04h 05h 06h MPEG-4 Visual Simple Studio Profile Level 3 per ISO/IEC 14496-2 MPEG-4 Visual Simple Studio Profile Level 4 per ISO/IEC 14496-2 MPEG-4 Visual Simple Studio Profile Level 5 per ISO/IEC 14496-2 MPEG-4 Visual Simple Studio Profile Level 6 per ISO/IEC 14496-2 7.2.2.2 RGBA Picture Essence Descriptor The RGBA Picture Essence Descriptor items shall be constrained as specified in Table 16. Table 16. RGBA Essence Descriptor items RGBA Picture Essence Descriptor Item Component Max Ref Component Min Ref PixelLayout Constraints Shall be present. See Section 7.2.2.2.2. RGB 10bit =940 RGB 12bit = 3760 XYZ 12bit = 4095 Shall be present. See Section 7.2.2.2.2. RGB 10bit = 64 RGB 12bit = 256 XYZ 12bit = 0 Shall be present. See Section 7.2.2.2.1. 7.2.2.2.1 PixelLayout The value of the PixelLayout item shall be equal to { G, x, B, x, R, x, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 } or { Y, 12, Z, 12, X, 12, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 } where x is equal to 10 or 12 if 10-bit or 12-bit color components are used per Section 6.1.1.3.1, respectively. 7.2.2.2.2 Component Max Ref and Component Min Ref Component Max Ref is a Property, whose unsigned 32-bit integer value specifies the RGB/Y(of XYZ) sample value for reference white level. Similarly, Component Min Ref is a Property, whose unsigned 32-bit integer value specifies the RGB/Y(of XYZ) sample value for reference black level. Example: For 10-bit ITU-R BT.709, Component Max Ref is 940 and Component Min Ref is 64; For DCDM, Page 14 of 20 pages

Component Max Ref is 3960 and Component Min Ref is 47 (2000:1) or 36 (4000:1); For 10-bit full scale linear, Component Max Ref is 1023 and Component Min Ref is 0 as specified in Table 17. Table 17. Component Max Ref and Component Min Ref values. Scale BT709 RGB DCDM XYZ Full scale RGB/Y Componen t Bit Depth Componen t Min Ref Componen t Max Ref 10 12 12 10 12 64 256 47 (2000:1) or 36 (4000:1) 0 0 940 3760 3960 1023 4095 7.2.2.3 CDCI Picture Essence Descriptor The CDCI Picture Essence Descriptor items shall be constrained as specified in Table 18. Table 18. CDCI Essence Descriptor items CDCI Picture Essence Descriptor Item Component Depth Constraints Shall be present and shall be equal to the Component Bit Depth used (see Section 6.1.1.3.1) Horizontal Subsampling Shall be present. See Section 7.2.2.3.1. Shall be present. See ST377-1 G.2.29 (shall be 2) Vertical Subsampling Color Siting Shall be present and shall be 01h Shall be 00h ReversedByteOrder Shall be present and the default value is 0 PaddingBits Shall be present and the default value is 0 Black Ref Level Shall be present. See Section 7.2.2.3.2. White Ref level Shall be present. See Section 7.2.2.3.2. Color Range Shall be present. See Section 7.2.2.3.2. 7.2.2.3.1 Horizontal Subsampling The value of Horizontal Subsampling item shall be equal to 01h if 4:4:4 sampling is used per Section 6.1.1.3.3. 02h if 4:2:2 sampling is used per Section 6.1.1.3.3. 7.2.2.3.2 Black Ref Level, White Ref level and Color Range Example: For 10-bit, Black Ref Level is 64 and White Ref level is 940; For 12-bit, Black Ref Level is 256 and White Ref level is 3760 as specified in Table 19. Page 15 of 20 pages

Table 19. Black Ref Level, White Ref level and Color Range values. Componen t Bit Depth Black Ref Level White Ref level Color Range 10 12 64 256 940 3760 897 NOTE: The White Ref level item applies only to the Y' component and the Color Range item to the C B and C R components. 7.2.2.4 MPEG-4 Visual Sub Descriptor The Top-Level File Package of Image Track File shall reference a MPEG-4 Visual Sub Descriptor defined in SMPTE ST 381-2. 7.2.3 Reference The material number creation method of UMID for the top-level file package shall be UUID method. 7.3 Audio Track Files 7.3.1 Essence An Audio Track File shall contain Audio Essence conforming to Section 6. 7.3.2 Track Format and Mapping Audio Track Files shall comply with constrained parameters defined in SMPTE ST 2067-5, i.e. OP1a single essence, Clip-Wrapping, and so on. The mapping method conforms to SMPTE ST 382. 7.3.2.1 Wave Audio Essence Descriptor The Top-Level File Package shall reference a Wave Audio Essence Descriptor defined in SMPTE ST 382. The ChannelCount, Quantization Bits and Audio Sampling Rate items shall be present. 7.3.2.2 Extended Audio Essence Subdescriptor T.B.D. 7.3.2.3 Audio Labeling T.B.D. 7.3.3 Reference The material number creation method of UMID for the top-level file package shall be UUID method. Page 16 of 20 pages

7.3.4 Audio Channels and Soundfield Groups T.B.D. 7.4 Data Essence Track Files 7.4.1 Essence A Data Essence Track File shall contain Data Essence conforming to Section 6. 7.4.2 Track Format and Mapping Data Essence Track Files shall comply with constrained parameters defined in SMPTE ST 2067-5, i.e. OP1a single essence, and so on. [T.B.D. for Mapping]. 7.4.3 Reference The material number creation method of UMID for the top-level file package shall be UUID method. 8 Composition 8.1 Homogenous Essence 8.1.1 Image Within a given a composition, the following shall remain constant: all Image Essence characteristics specified in Section 6.1.1. the codestream profile specified in Section 6.1.2. 8.1.2 Audio Within a given a composition, the Audio Essence characteristics listed in Table 20 shall remain constant. Table 20. Homogeneous Image Essence Characteristics. Characteristic Definition Bits per Sample See Section 6 Sampling Rate See Section 6 8.1.3 Data Essence Within a given a composition, the characteristics of the Data Essence listed in Table 21 shall remain constant. Table 21. Homogeneous Data Essence Characteristics. Characteristic Definition Standard See Section 6 Sub-type See Section 6 Page 17 of 20 pages

8.2 Virtual Tracks The following specifies constraints on virtual tracks (see SMPTE ST2067-3) present in a Composition. 8.2.1 Main Image Virtual Track A Composition shall contain exactly one Main Image Virtual Track. The Main Image Virtual Track shall consist of one or more MainImageSequence elements, as specified in Section A.1. If the underlying Image Essence is stereoscopic, as defined in Section 6.1.1.4, each MainImageSequence element shall contain Resource elements of type StereoImageTrackFileResourceType, as defined in Annex B, that reference two Track Files that conform to Section 7.2.. If the underlying Image Essence is monoscopic, as defined in Section 6.1.1.4, all MainImageSequence elements shall contain Resource elements of type TrackFileResourceType that reference Track File that conform to Section 7.2. The Edit Rate of the Resource elements shall be equal to the image frame rate of the underlying Essence. 8.2.2 Audio Virtual Tracks A Composition shall contain one or more Audio Virtual Tracks. Each Audio Virtual Track shall consist of one or more MainAudioSequence elements, as specified in Section A.2. Each MainAudioSequence element shall contain Resource elements of type TrackFileResourceType that reference Track File conforming to Section 7.3. The Edit Rate of the Resource elements shall be equal to the image frame rate of the underlying Essence. T.B.D. 8.2.3 Data Essence Virtual Tracks 8.3 Composition Edit Rate The Composition Edit Rate shall be equal to the edit rate of the Image Essence frame sample rate referenced by the Main Image Virtual Track. NOTE: The Composition Edit Rate does not constrain the rate at which implementations can reproduce a Composition. For instance, a Composition with an Edit Rate of 24/1 can be reproduced at 24,000/1,001 frames per second. 8.4 Track File Identification The TrackFileId element of TrackFileResourceType shall be equal to the Package UID value of the Top-level File Package. Page 18 of 20 pages

8.5 Segment Duration The duration of a Segment shall be greater than or equal to the duration of one Image Essence frame referenced by the Main Image Virtual Track. If the average number of audio samples per Composition Edit Unit is not an integer, the duration of each Segment shall be an integer multiples of 5/Composition Edit Rate. 8.6 Content Version A Composition Playlist instance shall contain at least one ContentVersion element. The Id element within each ContentVersion element may contain a valid URI. A scheme is shown in Table 22 for illustration. Table 22. Illustrative Content Version Id URI schemes (informative). UUID [RFC 4122] 8.7 EssenceDescriptor The File Descriptor structure (including Sub Descriptors) referenced by the top-level File Package of the Track Files referenced by the Composition Playlist shall be mapped to an EssenceDescriptor element using SMPTE ST 434:2006 [or https://www.smpte.org/apps/org/workgroup/31fs-ahg-reg_xml/]. 8.8 Digital Certificates T.B.D. 9 IMF Master Package (IMP) T.B.D. Annex A Sequence Elements A.1 Image <xs:element name="mainimagesequence" type="cpl:sequencetype"/> A.2 Audio <xs:element name="mainaudiosequence" type="cpl:sequencetype"/> A.3 Data Essence T.B.D. Annex B StereoImageTrackFileResourceType The StereoImageTrackFileResourceType shall represent stereoscopic content consisting of two Track Files, Page 19 of 20 pages

each consisting of monoscopic Essence. The Track Files referenced by the Left Eye and Right Eye elements shall contain views associated with the left and right eyes of the viewer. <xs:complextype name="stereoimagetrackfileresourcetype"> <xs:complexcontent> <xs:extension base="cpl:baseresourcetype"> <xs:sequence> <xs:element name="lefteye" type="cpl:trackfileresourcetype" /> <xs:element name="righteye" type="cpl:trackfileresourcetype" /> </xs:sequence> </xs:extension> </xs:complexcontent> </xs:complextype> The SourceDuration of an instance of StereoImageTrackFileResourceType shall be equal to the corresponding parameters contained in its LeftEye and RightEye elements. Page 20 of 20 pages