Michael Graczyk wrote:
https://datatracker.ietf.org/doc/draft-graczyk-codec-ambisonics/

Thanks for presenting this document to the group.

With my chair hat on: I think it would be helpful for us to gage the level of interest if people would comment on the list if they think this is something useful for this group to work on.

As an individual, I had a few comments on the draft:

The Ogg format is a container for transmission and storage of audio.

Ogg is a general purpose container, supporting audio, video, subtitles, etc. (though its most common use in current applications is for audio).

I think it may be useful to say that this channel mapping number will also likely impact other formats which use the same channel mapping families, e.g., Maktroska, MP4, MPEG TS. Just adding a sentence to the introduction along the lines of, "This mapping can also be used in other contexts which make use of the channel mappings defined by the Opus Channel Mapping Families registry."

Allowed numbers of channels: (1 + n)^2 for n = 0...14.  Explicitly 4,
9, 16, 25, 36, 49, 64, 81, 100, 121, 144, 169, 196, 225.

For n = 0, (1 + n)^2 == 1, but that's not on your explicit list. Was the intention here to exclude mono as redundant with other channel mapping families? We didn't do that for channel mapping family 1 (since, for example, the presence of an explicit channel mapping table allows you to extract a single channel from an already-encoded coupled stream, which is not possible with channel mapping family 0). But if you did intend to exclude mono, you should explicitly say why, and adjust the range of n.

sqrt((2 - delta(m)) * ((n - m)! / (n + m)!)),

I've read the [ambix] reference before, and it still took me a while to figure out what this normalization coefficient is actually referring to. It might be useful to say in more detail what "normalization" even means: is this a factor you are expecting encoders to apply to the audio before encoding? Are you defining what decoders will apply during reconstruction? The coefficient in general seems highly dependent on how you express the spherical harmonic basis functions. E.g., part of this just comes from the Legendre polynomials, which could have simply been expressed by saying to use Legendre polynomials with unit norm (under some norm).

It may be better to simply refer to [ambix] Section 2.1, equation (1) for the definition of the spherical harmonics basis functions (the explicit section/equation makes it easy to find), and leave out this equation entirely. Conversely, it would be reasonable to spell out the full equation for those basis functions in this document (to make it more self-contained). But this kind of sits in-between, where it looks like it's trying to say something that you can understand on its own, but in fact it's impossible to interpret outside of the context of that reference (which already says it).

Figure 1: Stereo Downmixing Matrix

I doubt this will really cause confusion, but you don't define W and Y. You might want to say they correspond to the first two ambisonics channels, explicitly.

Updates: 7845 (if approved)

Does this actually update RFC7845? I think the intent was to allow updates to the channel mapping registry to be done without an explicit update of 7845, since it's possible it might not even be done with an IETF RFC (e.g., if there were some broadcast-specific standard done at a different standards body).

_______________________________________________
codec mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/codec

Reply via email to