Check that hvcC matches `ispe` and `pixi` #58

leo-barnes · 2023-04-28T08:09:52Z

I investigated the file mentioned in this comment and concluded that the issue is that the ispe does not match the actual dimensions of the coded item.

In this case the hvcC says that the dimensions of the coded item is 1600x1096, while the ispe says 1599x1096. This in turn leads to the clap property specifying a crop rect that starts at a negative offset.

It would be great if the compliance warden could do some minimal validation of the HEVC codec config with regards to the ispe. Other properties that could be checked would potentially be pixi, colr and potentially the MIAF brands that set limits on profiles.

The text was updated successfully, but these errors were encountered:

leo-barnes · 2023-04-28T08:16:34Z

I realize this might be a large ask, but given that the compliance warden can handle the av1C, it would be a nice addition.

rbouqueau · 2023-04-28T09:53:16Z

We need two things:

Parse de NALUs here
Define a rule. Is it something you aim at doing normative (i.e. add to HEIF or MIAF, either as a SHALL or a SHOULD?)

leo-barnes · 2023-04-28T10:44:06Z

Define a rule. Is it something you aim at doing normative (i.e. add to HEIF or MIAF, either as a SHALL or a SHOULD?)

I would say that this is already a requirement. But potentially it's not spelled out as such clearly enough.
@podborski and @cconcolato may know more details.

From HEIF:

6.3 Derivation of an output image of an image item

The reconstructed image of an image item is derived as follows:

if the image item contains a coded image, the coded image is decoded and the reconstructed image is the output of the > decoding process specified for the item type of the image item;
...

6.5.3 Image spatial extents

...
The ImageSpatialExtentsProperty documents the width and height of the associated image item. Every image item shall be associated with one property of this type, prior to the association of all transformative properties.
...
image_width specifies the width of the reconstructed image in pixels, as specified in 6.3.
image_height specifies the height of the reconstructed image in pixels, as specified in 6.3.

So we have:

ispe shall be present
ispe documents the dimensions of the reconstructed image
If the image item is a coded item, the reconstructed image is the output of the decoding process specified for that type of item

I don't know my way around the HEVC spec well enough to immediately say which section describes the output dimensions, but it's most likely a combination of pic_width_in_luma_samples, pic_height_in_luma_samples and the conformance cropping window (if specified) in the VPS.

rbouqueau · 2023-06-01T15:11:45Z

Is anyone able to comment here?

cconcolato · 2023-06-01T20:04:34Z

I agree with @leo-barnes's analysis

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check that hvcC matches `ispe` and `pixi` #58

Check that hvcC matches `ispe` and `pixi` #58

leo-barnes commented Apr 28, 2023

leo-barnes commented Apr 28, 2023

rbouqueau commented Apr 28, 2023

leo-barnes commented Apr 28, 2023

6.3 Derivation of an output image of an image item

6.5.3 Image spatial extents

rbouqueau commented Jun 1, 2023

cconcolato commented Jun 1, 2023

Check that hvcC matches ispe and pixi #58

Check that hvcC matches ispe and pixi #58

Comments

leo-barnes commented Apr 28, 2023

leo-barnes commented Apr 28, 2023

rbouqueau commented Apr 28, 2023

leo-barnes commented Apr 28, 2023

6.3 Derivation of an output image of an image item

6.5.3 Image spatial extents

rbouqueau commented Jun 1, 2023

cconcolato commented Jun 1, 2023

Check that hvcC matches `ispe` and `pixi` #58

Check that hvcC matches `ispe` and `pixi` #58