ref: e37262c97f0a4eef999ec78e1e192a541838efb7
parent: 3f3cd994242814f0d8ab187d8635a626b0ea17ce
author: Ron <[email protected]>
date: Sun Jan 19 10:20:05 EST 2014
Formally introduce the terms mode and configuration and use them more strictly in the rest of the text.
--- a/doc/draft-ietf-codec-oggopus.xml
+++ b/doc/draft-ietf-codec-oggopus.xml
@@ -179,13 +179,19 @@
regular, undelimited framing from Section 3 of <xref target="RFC6716"/>.
All of the Opus packets in a single Ogg packet MUST be constrained to have the
same duration.
-The duration and coding modes of each Opus packet are contained in the
- TOC (table of contents) sequence in the first few bytes.
A decoder SHOULD treat any Opus packet whose duration is different from that of
the first Opus packet in an Ogg packet as if it were an Opus packet with an
illegal TOC sequence.
</t>
<t>
+The coding mode (SILK, Hybrid, or CELT), audio bandwidth, channel count,
+ duration (frame size), and number of frames per packet, are indicated in the
+ TOC (table of contents) in the first byte of each Opus packet, as described
+ in Section 3.1 of <xref target="RFC3533"/>.
+The combination of mode, audio bandwidth, and frame size, is referred to as
+ the configuration of an Opus packet.
+</t>
+<t>
The first audio data page SHOULD NOT have the 'continued packet' flag set
(which would indicate the first audio data packet is continued from a previous
page).
@@ -329,7 +335,7 @@
</t>
<t>
-Since medium-band audio is only supported in the SILK mode, wideband frames
+Since medium-band audio is an option only in the SILK mode, wideband frames
SHOULD be generated if switching from it to CELT mode, to ensure that
any PLC implementation that does try to migrate state between the modes
will be able to preserve all of the available audio bandwidth.
@@ -340,9 +346,10 @@
<section anchor="preskip" title="Pre-skip">
<t>
There is some amount of latency introduced during the decoding process, to
- allow for overlap in the CELT modes, stereo mixing in the SILK modes, and
- resampling, and the encoder will introduce even more latency (though the exact
- amount is not specified).
+ allow for overlap in the CELT mode, stereo mixing in the SILK mode, and
+ resampling.
+The encoder will also introduce latency (though the exact amount is not
+ specified).
Therefore, the first few samples produced by the decoder do not correspond to
real input audio, but are instead composed of padding inserted by the encoder
to compensate for this latency.
@@ -603,8 +610,8 @@
This field is <spanx style="emph">not</spanx> the sample rate to use for
playback of the encoded data.
<vspace blankLines="1"/>
-Opus has a handful of coding modes, with internal audio bandwidths of 4, 6, 8,
- 12, and 20 kHz.
+Opus can switch between internal audio bandwidths of 4, 6, 8, 12, and
+ 20 kHz.
Each packet in the stream may have a different audio bandwidth.
Regardless of the audio bandwidth, the reference decoder supports decoding any
stream at a sample rate of 8, 12, 16, 24, or 48 kHz.
@@ -760,8 +767,8 @@
mono (a single channel) or stereo (two channels) by appropriate initialization
of the decoder.
The 'coupled stream count' field indicates that the first M Opus decoders are
- to be initialized in stereo mode, and the remaining N-M decoders are to be
- initialized in mono mode.
+ to be initialized for stereo output, and the remaining N-M decoders are to be
+ initialized for mono only.
The total number of decoded channels, (M+N), MUST be no larger than 255, as
there is no way to index more channels than that in the channel mapping.
<vspace blankLines="1"/>
@@ -827,7 +834,7 @@
</t>
<t>
Each channel is assigned to a speaker location in a conventional surround
- configuration.
+ arrangement.
Specific locations depend on the number of channels, and are given below
in order of the corresponding channel indicies.
<list style="symbols">
@@ -840,8 +847,8 @@
<t>7 channels: 6.1 surround (front left, front center, front right, side left, side right, rear center, LFE).</t>
<t>8 channels: 7.1 surround (front left, front center, front right, side left, side right, rear left, rear right, LFE)</t>
</list>
-This set of surround configurations and speaker location orderings is the same
- as the one used by the Vorbis codec <xref target="vorbis-mapping"/>.
+This set of surround options and speaker location orderings is the same
+ as those used by the Vorbis codec <xref target="vorbis-mapping"/>.
The ordering is different from the one used by the
WAVE <xref target="wave-multichannel"/> and
FLAC <xref target="flac"/> formats,