Improve use of layered images. #118

cconcolato · 2020-10-29T00:23:51Z

closes #103

leo-barnes

Looks good. Found a typo which I commented on inline. Do we also want to add recommendations, or do we do that as part of #104 ?

index.bs

leo-barnes

Looks good to me. My only concern is for the case where you have two output images (for instance two viewpoints) sharing a base layer. If you don't use lsel, a decoder may try to use the layers for progressive refinement which is going to look a bit weird. On the other hand, the spec does note that if you want full control over what should happen you should be using lsel.

index.bs

leo-barnes

Looks good to me, minor comments inline.
Sorry for the delay, must have missed the notification that you updated the review.

index.bs

…elector property

leo-barnes

I think this looks good. Some minor questions inline.

index.bs

…g property

leo-barnes

👍

wantehchang

LGTM. I suggest a few small changes.

index.bs

wantehchang · 2021-01-20T01:19:11Z

index.bs

+
+[[!AV1]] supports encoding a frame using multiple spatial layers. A spatial layer may improve the resolution or quality of the image decoded based on one or more of the previous layers. A layer may also provide an image that does not depend on the previous layers. Additionally, not all layers are expected to produce an image meant to be rendered. Some decoded images may be used only as intermediate decodes. Finally, layers are grouped into one or more [=Operating Points=]. The [=Sequence Header OBU=] defines the list of [=Operating Points=], provides required decoding capabilities, and indicates which layers form each [=Operating Point=].
+
+[[!AV1]] delegates the selection of which [=Operating Point=] to process to the application. AVIF defines the [=OperatingPointSelectorProperty=] to control this selection. In the absence of an [=OperatingPointSelectorProperty=] associated with an [=AV1 Image Item=], the AVIF renderer is free to process any [=Operating Point=] present in the [=AV1 Image Item Data=]. In particular, when the [=AV1 Image Item=] is composed of a unique [=Operating Point=], the [=OperatingPointSelectorProperty=] should not be present. If an [=OperatingPointSelectorProperty=] is associated with an [=AV1 Image Item=], the op_index field indicates which [=Operating Point=] is expected to be presented for this item.


[Just a comment. No change requested.] Just FYI: The op_index field here corresponds to the operatingPoint variable in the AV1 spec. There is also an OperatingPointIdc variable in the AV1 spec, but OperatingPointIdc is a bit mask, not an index. Assuming "Idc" stands for "index", I find the names used in the AV1 spec very confusing.

Should the semantics of op_index indicate that it gives the value of the operatingPoint variable?

Should the semantics of op_index indicate that it gives the value of the operatingPoint variable?

If it is not too verbose, we can mention that OperatingPointSelectorProperty implements the choose_operating_point() function and op_index gives the value of the operatingPoint variable in the AV1 spec. In other words, this is equivalent to the following statement in Section. 5.5.1 of the AV1 spec:

operatingPoint = choose_operating_point( )

wantehchang

Hi Cyril: The new version of the PR looks good! I noticed a contradiction with HEIF's requirement on whether the 'lsel' property may be absent for a layered image item.

index.bs

wantehchang · 2021-01-20T20:00:24Z

index.bs

+
+NOTE: When an author wants to offer the ability to render multiple [=Operating Points=] from the same AV1 image (e.g. in the case of multi-view images), multiple [=AV1 Image Items=] can be created that share the same [=AV1 Image Item Data=] but have different [=OperatingPointSelectorProperty=]s.
+
+[[!AV1]] expects the renderer to display only one frame within the selected [=Operating Point=], which should be the highest spatial layer that is both within the [=Operating Point=] and present within the temporal unit, but [[!AV1]] leaves the option for other applications to set their own policy about which frames are output. AVIF sets a different policy, and defines how the 'lsel' property (defined in [[!HEIF]]) is used to control which layer is rendered. In the absence of a 'lsel' property associated with an [=AV1 Image Item=], the renderer is free to render either: only the output image of the highest spatial layer, or to render all output images of all the intermediate layers, resulting in a form of progressive decoding. If a 'lsel' property is associated with an [=AV1 Image Item=], the renderer is expected to render only the output image for that layer.


The first sentence says "..., but [[!AV1]] leaves the option for other applications to set their own policy about which frames are output." We could note that this refers to the second paragraph in Section 7.18.1 of the AV1 spec:

If scalability is being used (OperatingPointIdc not equal to 0), an application-specific function is called to decide whether this frame will be output. If this function returns a value equal to 0, then this process terminates immediately.

I think this cross reference would be useful because that "application-specific function" does not have a name and it is hard to find that paragraph when one reads the AV1 spec.

I'm not sure we want to add a reference to a specific section, but I added an indication that it's in the "general output process" section.

index.bs

cconcolato added 2 commits October 28, 2020 17:20

adding initial version of the section about multi-layered images (#103)

9974868

update change log for layered images

3888adf

cconcolato requested a review from leo-barnes October 29, 2020 00:23

leo-barnes requested changes Oct 29, 2020

View reviewed changes

index.bs Outdated Show resolved Hide resolved

cconcolato added 2 commits November 11, 2020 08:26

Merge branch 'master' into layered

c76858e

fix typo

2afcab8

leo-barnes approved these changes Nov 18, 2020

View reviewed changes

add layer index property

1b54926

cconcolato requested review from leo-barnes, wantehchang and joedrago November 25, 2020 23:39

Merge branch 'master' into layered

4ff2e6e

wantehchang reviewed Dec 9, 2020

View reviewed changes

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Show resolved Hide resolved

leo-barnes approved these changes Dec 9, 2020

View reviewed changes

index.bs Outdated Show resolved Hide resolved

index.bs Show resolved Hide resolved

leo-barnes reviewed Dec 11, 2020

View reviewed changes

index.bs Outdated Show resolved Hide resolved

cconcolato added 3 commits January 5, 2021 17:56

address nitpicks

1c211e8

rewrite of layered properties section and addition of operatingpoints…

c84559c

…elector property

Merge branch 'master' into layered

a869073

leo-barnes approved these changes Jan 12, 2021

View reviewed changes

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

cconcolato added 3 commits January 14, 2021 17:53

clarify which byte ranges can be found with the layered image indexin…

7ddab8a

…g property

fix some section hyper links

aaee9b2

clarify value range for lsel property

4b39200

leo-barnes approved these changes Jan 15, 2021

View reviewed changes

wantehchang approved these changes Jan 20, 2021

View reviewed changes

cconcolato added 2 commits January 20, 2021 10:57

Merge branch 'master' into layered

a995ee7

address editorial comments

22a7572

wantehchang approved these changes Jan 20, 2021

View reviewed changes

fix quantity field of new boxees

b824aaa

cconcolato changed the title ~~Define use of LayerSelectorProperty for multi-layered images.~~ Improve use of layered images. Feb 3, 2021

Merge branch 'master' into layered

e6727ba

cconcolato merged commit d1220be into master Feb 4, 2021

cconcolato deleted the layered branch February 4, 2021 21:15


		[[!AV1]] supports encoding a frame using multiple spatial layers. A spatial layer may improve the resolution or quality of the image decoded based on one or more of the previous layers. A layer may also provide an image that does not depend on the previous layers. Additionally, not all layers are expected to produce an image meant to be rendered. Some decoded images may be used only as intermediate decodes. Finally, layers are grouped into one or more [=Operating Points=]. The [=Sequence Header OBU=] defines the list of [=Operating Points=], provides required decoding capabilities, and indicates which layers form each [=Operating Point=].

		[[!AV1]] delegates the selection of which [=Operating Point=] to process to the application. AVIF defines the [=OperatingPointSelectorProperty=] to control this selection. In the absence of an [=OperatingPointSelectorProperty=] associated with an [=AV1 Image Item=], the AVIF renderer is free to process any [=Operating Point=] present in the [=AV1 Image Item Data=]. In particular, when the [=AV1 Image Item=] is composed of a unique [=Operating Point=], the [=OperatingPointSelectorProperty=] should not be present. If an [=OperatingPointSelectorProperty=] is associated with an [=AV1 Image Item=], the `op_index` field indicates which [=Operating Point=] is expected to be presented for this item.


		NOTE: When an author wants to offer the ability to render multiple [=Operating Points=] from the same AV1 image (e.g. in the case of multi-view images), multiple [=AV1 Image Items=] can be created that share the same [=AV1 Image Item Data=] but have different [=OperatingPointSelectorProperty=]s.

		[[!AV1]] expects the renderer to display only one frame within the selected [=Operating Point=], which should be the highest spatial layer that is both within the [=Operating Point=] and present within the temporal unit, but [[!AV1]] leaves the option for other applications to set their own policy about which frames are output. AVIF sets a different policy, and defines how the 'lsel' property (defined in [[!HEIF]]) is used to control which layer is rendered. In the absence of a 'lsel' property associated with an [=AV1 Image Item=], the renderer is free to render either: only the output image of the highest spatial layer, or to render all output images of all the intermediate layers, resulting in a form of progressive decoding. If a 'lsel' property is associated with an [=AV1 Image Item=], the renderer is expected to render only the output image for that layer.

Improve use of layered images. #118

Improve use of layered images. #118

Uh oh!

Conversation

cconcolato commented Oct 29, 2020 • edited by pr-preview bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leo-barnes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

leo-barnes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leo-barnes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leo-barnes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leo-barnes left a comment

Choose a reason for hiding this comment

Uh oh!

wantehchang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wantehchang Jan 20, 2021

Choose a reason for hiding this comment

Uh oh!

cconcolato Jan 20, 2021

Choose a reason for hiding this comment

Uh oh!

wantehchang Jan 20, 2021

Choose a reason for hiding this comment

Uh oh!

wantehchang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wantehchang Jan 20, 2021

Choose a reason for hiding this comment

Uh oh!

cconcolato Feb 3, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cconcolato commented Oct 29, 2020 •

edited by pr-preview bot

Loading