Skip to content

MPEG 145

MPEG 145 took place as an online conference from 2024-01-22 until 2024-01-26.

Press Release

Latest Edition of the High Efficiency Image Format Standard Unveils Cutting-Edge Features for Enhanced Image Decoding and Annotation

At the 145th MPEG meeting, MPEG Systems (WG 3) ratified the third edition of its High Efficiency Image Format (HEIF; ISO/IEC 23008-12: Image file format). HEIF has solidified its position as one of the most rapidly and widely adopted standards in the imaging industry. The newest edition represents a significant leap forward, introducing progressive decoding capabilities that elevate image quality through a sequential, single-decoder instance process. This enhancement empowers users to decode a bitstream in successive steps, with each phase delivering perceptible improvements in image quality compared to the preceding step.

Additionally, this edition introduces a sophisticated data structure that meticulously describes the spatial configuration of the camera and outlines the distinctive characteristics of the camera responsible for generating the image content.

Furthermore, the updated HEIF specification encompasses innovative tools for annotating specific areas in diverse shapes, enhancing the versatility of image content manipulation. The inclusion of these annotation features adds a layer of creativity and customization, catering to the diverse needs of users across various industries.

Not stopping at these remarkable upgrades, the HEIF standard is actively advancing its technology portfolio. The ongoing development efforts promise to introduce support for renderable text items, providing a comprehensive solution for incorporating textual elements seamlessly into images. Additionally, the inclusion of slim versions of image files addresses the demand for efficient use cases, particularly for smaller image sizes such as icons.

MPEG Systems finalizes Standards supporting Interoperability Testing

At the 145th MPEG meeting, MPEG Systems (WG 3) finalized two standards comprising conformance and reference software by promoting it to the Final Draft International Standard (FDIS), the final stage of standards development. This pivotal milestone represents the culmination of rigorous standards development and underscores MPEG Systems’ commitment to innovation and excellence in the field.

The finalized standards, ISO/IEC 23090-24 and ISO/IEC 23090-25, showcase the pinnacle of conformance and reference software for scene description and visual volumetric video-based coding data, respectively. These standards offer not only reference implementations but also essential bitstreams for conformance testing, ensuring robustness and reliability in real-world applications.

ISO/IEC 23090-24 focuses on conformance and reference software for scene description, providing a comprehensive reference implementation and bitstream tailored for conformance testing related to ISO/IEC 23090-14, scene description. This standard opens new avenues for advancements in scene depiction technologies, setting a new standard for conformance and software reference in this domain.

Similarly, ISO/IEC 23090-25 targets conformance and reference software for the carriage of visual volumetric video-based coding data. With a dedicated reference implementation and bitstream, this standard is poised to elevate the conformance testing standards for ISO/IEC 23090-10, the carriage of visual volumetric video-based coding data. The introduction of this standard is expected to have a transformative impact on the visualization of volumetric video data.

Both standards, ISO/IEC 23090-24 and ISO/IEC 23090-25, will be made freely accessible for download on the official ISO website, ensuring widespread availability for industry professionals, researchers, and enthusiasts alike. This commitment to openness and accessibility aligns with MPEG Systems’ mission to contribute to the broader technological community and foster collaboration.

MPEG finalizes the Third Edition of MPEG-D Dynamic Range Control

At the 145th MPEG meeting, MPEG Audio Coding (WG6) completed the work on the third edition of ISO/IEC 23003-4, Dynamic range control, promoting it to the Final Draft International Standard (FDIS) stage. This update incorporates two amendments into the second edition, originally published in 2020.

The third edition includes the specification of dynamic range control (DRC) side chain information and metadata-based real-time loudness leveling for live workflows. The technologies enable producers of live content, such as sports broadcasts and concerts, to seamlessly integrate MPEG-D DRC-based loudness leveling into their existing workflows. The metadata-based approach offers highest possible quality of loudness processing and dynamic range control while maintaining full flexibility and control in playback devices. The technology can be tightly integrated with existing audio codecs such as MPEG-D USAC, MPEG-H Audio or any other audio codec supporting MPEG-D DRC.

MPEG finalizes the Second Edition of MPEG-4 Audio Conformance

At the 145th MPEG meeting, MPEG Audio Coding (WG6) celebrated the completion of the second edition of ISO/IEC 14496-26, audio conformance, elevating it to the Final Draft International Standard (FDIS) stage. This significant update incorporates seven corrigenda and five amendments into the initial edition, originally published in 2010.

ISO/IEC 14496-26 serves as a pivotal standard, providing a framework for designing tests to ensure the compliance of compressed data and decoders with the requirements outlined in ISO/IEC 14496-3 (MPEG-4 Audio). The second edition reflects an evolution of the original, addressing key updates and enhancements through diligent amendments and corrigenda.

This latest edition, now at the FDIS stage, marks a notable stride in MPEG Audio Coding’s commitment to refining audio conformance standards and ensuring the seamless integration of compressed data within the MPEG-4 Audio framework.

MPEG Genomic Coding extended to support
Transport and File Format for Genomic Annotations

At the 145th MPEG meeting, the MPEG Genomic Coding (WG 8) working group extended the support of transport and file format to the coding of any common type of annotations obtained by the analysis results of DNA sequencing data.

The ISO/IEC 23092-1 (3rd edition) – Transport and file format, supporting a joint coding of sequencing and annotation data, has been promoted to Final Draft International Standard (FDIS). The current MPEG-G standard series (ISO/IEC 23092) can now support full application pipelines, covering data representation and compression from the output of the sequencing up to the results of tertiary analysis support in a single structured transport and file format. The extended structured and compressed representation provides the basis for standard APIs implementing advanced standard browsing and searching features. They include standard APIs for exact and approximate string-matching capabilities directly in the compressed domain for sequencing data metadata and annotations. These new standard functionalities are fundamental for searching large databases of compressed sequencing and annotation data resulting from the massive amounts of sequencing data that are generated by next generation sequencing technologies.

In addition, the MPEG Genomic Coding working group also reached the first milestone for the 2nd edition of ISO/IEC 23092-5 (MPEG-G Part 5, Conformance) by promoting the text to Committee Draft (CD) status. This new edition incorporates support for the newly issued Part 6 of the MPEG Genomics family of standards: the coding of genomic annotations. The conformance standard supports detailed diagnostic assessment of decoder implementations so that conformant implementations can be certified and can provide functional guarantees as required by regulations of diagnostic devices.

MPEG White Paper

At the 145th MPEG meeting, MPEG Liaison and Communication (AG 3) approved the following MPEG white paper, which is available at https://www.mpeg.org/whitepapers/.

Neural Network Coding (NNC) – Efficient Storage and Inference of Neural Networks for Multimedia Applications

Artificial neural networks have been adopted for a broad range of tasks in almost every technical field, such as medical applications, transportation, network optimization, big data analysis, surveillance, speech, audio, image and video classification, image and video compression, and many more. An additional factor for the exponential growth is the appearance of new use cases, such as federated learning with continuous communication between many devices. To effectively reduce bandwidth usage in communication and reduce the size of networks for inference, achieving an optimal compression ratio must be prioritized. Thus, a standard for neural network coding (NNC) has been defined in ISO/IEC 15938-17 (Compression of Neural Networks for Multimedia Description and Analysis), with the second edition adding new compression tools and support for coding incremental updates of neural networks.

Incremental coding, one of the main extensions in the second edition, targets neural network updates as a difference signal between a base neural network (i.e., an instance of a trained neural network for the particular use case) and an updated neural network. The updated neural network is typically the result of one of the following operations (this list is considered non-exhaustive), for example:

  • The base neural network is retrained with other data or hyper-parameters.
  • The base neural network and the updated neural network are compressed versions of the same network with different compression ratio.
  • The updated neural network is the result of applying transfer learning, starting from the base neural network.
  • The updated neural network uses the base neural network in its structure (possibly retrained end-to-end).

Output documents published in MPEG 145

MPEG-I

#PartTitle
2Omnidirectional Media FormatTechnologies under Consideration for OMAF
2Omnidirectional Media FormatDraft text of ISO/IEC 23090-2 DAM 1 Server-side dynamic adaptation
3Versatile Video CodingPreliminary working draft 7 of SEI processing order and processing order nesting SEI messages in VVC
4Immersive AudioMPEG-I immersive audio Encoder Input Format, Version 8
6Immersive Media MetricsTechnologies under Consideration for ISO/IEC 23090-6 Immersive media metrics
7Immersive Media MetadataTechnologies under Consideration for Immersive media metadata
8Network based Media ProcessingTechnologies under Consideration for NBMP
9Geometry-based Point Cloud CompressionCommon test conditions for G-PCC
10Carriage of Visual Volumetric Video-based Coding DataTechnologies under consideration on carriage of V3C data
13Video Decoding Interface for Immersive MediaWD of ISO/IEC 23090-13 2nd edition Video decoding interface for immersive media
14Scene Description for MPEG MediaPotential improvement of ISO/IEC 23090-14 DAM 2 Support for haptics, augmented reality, avatars, interactivity and lighting
14Scene Description for MPEG MediaDraft registration of Khronos extensions 2nd edition
14Scene Description for MPEG MediaTechnologies under consideration for ISO/IEC 23090-14 Scene Description
14Scene Description for MPEG MediaProcedures for standard development for ISO/IEC 23090-14 (MPEG-I Scene Description)
14Scene Description for MPEG MediaRequirements Coverage of MPEG-I Scene Description
14Scene Description for MPEG MediaDraft Text of ISO/IEC 23090-14 2nd edition Scene description
14Scene Description for MPEG MediaProposed Khronos blog post on MPEG-I Scene Description
17Reference Software and Conformance for OMAFWD of Reference software and conformance for omnidirectional media format (OMAF) 2nd edition
18Carriage of Geometry-based Point Cloud Compression DataTechnologies under Considerations on Carriage of geometry-based point cloud compression data
18Carriage of Geometry-based Point Cloud Compression DataWD of ISO/IEC 23090-18 AMD 2 Point reliability indication and other improvements
24Conformance and Reference Software for Scene Description for MPEG MediaProcedures for test scenarios and reference software development for MPEG-I Scene Description
24Conformance and Reference Software for Scene Description for MPEG MediaWD of ISO/IEC 23090-24 AMD 1 Conformance and reference software for scene description on haptics, augmented reality, avatars, interactivity and lighting
29Video-based dynamic mesh codingCommon Test Conditions for V-DMC
31Haptics codingDefinition of Haptics Media
33Conformance and reference software for haptics codingText of ISO/IEC CD 23090-33 Conformance and reference software

MPEG-DASH

#PartTitle
1Media Presentation Description and Segment FormatsTechnologies under Consideration for DASH
7Delivery of CMAF content with DASHExploration on alignment of ISOBMFF/DASH/CMAF terminology, concepts and solutions
9Encoder and packager synchronizationDraft text of ISO/IEC FDIS 23009-9 Redundant encoding and packaging for segmented live media (REAP)

MPEG-H

#PartTitle
12Image File FormatTechnology under Consideration on ISO/IEC 23008-12
12Image File FormatExploration on low-overhead HEIF-compatible image file format
12Image File FormatWD of ISO/IEC 23008-12 3rd edition AMD 2 Support for tone map derivation and others

MPEG-G

#PartTitle
Use case and requirements for adaptable private data management of genomic information representation on the large-scale cloud environment

MPEG-4

#PartTitle
12ISO base Media File FormatTechnologies under Consideration for ISO/IEC 14496-12
12ISO base Media File FormatExploration of carriage of depth map and alpha map as a new media type in ISOBMFF
14MP4 File FormatTechnologies under Consideration for ISO/IEC 14496-14 MP4 File format
15Carriage of Network Abstraction Layer (NAL) Unit Structured Video in the ISO base Media File FormatTechnologies under Consideration for ISO/IEC 14496-15 Carriage of NAL unit structured video in ISOBMFF
22Open Font FormatWD of ISO/IEC 14496-22 5th edition Open font format
34Syntactic description languageDraft DoC on ISO/IEC DIS 14496-34 Syntactic description language
34Syntactic description languageTechnology under Consideration on ISO/IEC 14496-34 Syntactic Description Language
34Syntactic description languageDraft text of ISO/IEC FDIS 14496-34 Syntactic description language

MPEG-2

#PartTitle
1SystemsWD of ISO/IEC 13818-1 9th edition AMD 1 Codec parameter clarifications and other improvements

MPEG-C

#PartTitle
7Versatile Supplemental Enhancement Information Messages for Coded Video BitstreamsPreliminary WD: SEI messages for VSEI version 4
7Versatile Supplemental Enhancement Information Messages for Coded Video BitstreamsTechnologies under consideration for future extensions of VSEI (version 3)

MPEG-B

#PartTitle
7Common Encryption in ISO Base Media File Format FilesTechnologies under Consideration for ISO/IEC 23001-7 Common Encryption
10Carriage of Timed Metadata Metrics of Media in ISO Base Media File FormatTechnologies under Consideration for ISO/IEC 23001-10 Carriage of timed metadata metrics of media in ISOBMFF
10Carriage of Timed Metadata Metrics of Media in ISO Base Media File FormatText of ISO/IEC 23001-10 CDAM 2 Support for display attenuation map
11Energy-Efficient Media Consumption (green metadata)Preliminary draft of consolidated text on carriage of green metadata
16Derived Visual Tracks in the ISO Base Media File FormatTechnologies under Consideration for ISO/IEC 23001-16 Derived visual tracks including further visual derivations
17Carriage of Uncompressed Video in ISOBMFFText of ISO/IEC 23001-17 CDAM 2 Agnostically compressed media

MPEG-A

#PartTitle
19Common Media Application Format (CMAF) for Segmented MediaWD of ISO/IEC 23000-19 AMD 2 New Structural CMAF Brand Profile
23Decentralized media rights application formatWD of ISO/IEC 23000-23 Decentralized media rights application format
24Messaging media application formatWorking Draft of ISO/IEC 23000-24 Messaging media application format

MPEG-21

#PartTitle
6Rights Data DictionaryCall for Candidates for Registration Authority for ISO/IEC 21000-6 Rights Data Dictionary

MPEG-7

#PartTitle
17Compression of Neural Networks for Multimedia Content Description and analysisApplication and Verification of NNC in Different Use Cases
17Compression of Neural Networks for Multimedia Content Description and analysisWhite paper on Neural Network Compression

Explorations

#PartTitle
7Immersive VideoOverview of lenslet video coding activities
7Immersive VideoCommon test conditions of lenslet video coding
36Neural Network-based Video CompressionExploration experiment on neural network-based video coding (EE1)
36Neural Network-based Video CompressionDescription of algorithms and software in neural network-based video coding (NNVC) version 6
36Neural Network-based Video CompressionCall for training materials for neural network-based video coding tool development
41Enhanced compression beyond VVC capabilityExploration experiment on enhanced compression beyond VVC capability (EE2)
41Enhanced compression beyond VVC capabilityAlgorithm description of enhanced compression model 12 (ECM 12)
46Audio Coding for MachinesScope and Roadmap for Audio coding for Machines (ACoM)
47Metadata Definition and Carriage for Split RenderingExploration on Metadata Definition and Carriage for Split Rendering
48Indicating AI generated/altered content using the MPEG Systems technologiesExploration on indicating AI generated/altered content using the MPEG Systems technologies

MPEG-AI

#PartTitle
2Video coding for machinesCommon test conditions for video coding for machines
4Feature coding for machinesCommon test and training conditions for FCM

Other documents published in MPEG 145

TypeTitle
AhGWG2 AHGs established at the 14th WG2 meeting (MPEG 145)
OutputAssets of communication
Time lineMPEG Roadmap after the MPEG 145 meeting
Time lineMPEG Roadmap after the MPEG 145 meeting (extended version)
AdministrationCalling Notice of the 14th SC 29/WG 03 Meeting (MPEG 145), 2024-01-22 ~ 26, OnLine
Administrative MattersRequest for offers to host a MPEG meeting (MPEG 149 - MPEG 154)
Administrative MattersMeeting Notice of the 146th MPEG meeting including the 15th meeting of SC29/AG2,3,5, WG2,3,4,5,6,7,8