ref: 63396f3904ac03379372f64d337c5f4038797841
parent: 849669547e7282366119fac5b18f164475316f0a
author: Jean-Marc Valin <[email protected]>
date: Tue Jun 30 19:54:18 EDT 2009
ietf doc: misc
--- a/doc/ietf/draft-valin-celt-codec.xml
+++ b/doc/ietf/draft-valin-celt-codec.xml
@@ -243,9 +243,15 @@
<section anchor="CELT Encoder" title="CELT Encoder">
-<!--Insert encoder overview-->
+<t>
+The basic block diagram of the CELT encoder is illustrated in <xref target="encoder-diagram"></xref>.
+The encoder contains most of the building blocks of the decoder and can,
+with very little extra computation, compute the signal that would be decoded by the decoder.
+The top-level function for encoding a CELT frame in the reference implementation is
+celt_encode() (<xref target="celt.c">celt.c</xref>).
+</t>
-<figure>
+<figure anchor="encoder-diagram">
<artwork>
<![CDATA[
+-----------+ +--+
@@ -274,13 +280,9 @@
+--------------------------------------+--------------------+
]]>
</artwork>
-<postamble>Overview of the CELT encoder</postamble>
+<postamble>Block diagram of the CELT encoder</postamble>
</figure>
-<t>The top-level function for encoding a CELT frame in the reference implementation is
-celt_encode() (<xref target="celt.c">celt.c</xref>).
-</t>
-
<!--
<texttable anchor="bitstream">
<ttcol align='center'>Parameter(s)</ttcol>
@@ -322,7 +324,7 @@
</section> <!-- pre-emphasis -->
-<section anchor="range-coder" title="Range Coder">
+<section anchor="range-encoder" title="Range Coder">
<t>
CELT uses an entropy coder based upon <xref target="range-coding"></xref>,
which is itself a rediscovery of the FIFO arithmetic code introduced by <xref target="coding-thesis"></xref>.
@@ -335,7 +337,7 @@
The range coder also acts as the bit-packer for CELT. It is
used in three different ways to encode:
<list style="symbols">
-<t>symbols with a probability model using ec_encode() (<xref target="rangedec.c">rangeenc.c</xref>)</t>
+<t>entropy-coded symbols with a fixed probability model using ec_encode() (<xref target="rangedec.c">rangeenc.c</xref>)</t>
<t>integers from 0 to 2^M-1 using ec_enc_uint() or ec_enc_bits() (<xref target="entenc.c">encenc.c</xref>)</t>
<t>integers from 0 to N-1 (where N is not a power of two) using ec_enc_uint() (<xref target="entenc.c">encenc.c</xref>)</t>
</list>
@@ -650,10 +652,11 @@
<section anchor="CELT-decoder" title="CELT Decoder">
<t>
-Like for most audio codecs, the CELT decoder is less complex than the encoder.
+Like for most audio codecs, the CELT decoder is less complex than the encoder, as can be
+observed in the decoder block diagram in <xref target="decoder-diagram"></xref>.
</t>
-<figure>
+<figure anchor="decoder-diagram">
<artwork>
<![CDATA[
+--+
@@ -679,7 +682,7 @@
+--------------------+
]]>
</artwork>
-<postamble>Overview of the CELT decoder</postamble>
+<postamble>Block diagram of the CELT decoder</postamble>
</figure>
<t>
@@ -692,7 +695,8 @@
<section anchor="range-decoder" title="Range Decoder">
<t>
-derf?
+The range decoder extracts the symbols and integers encoded using the range encoder
+<xref target="range-encoder"></xref>.
</t>
</section>
@@ -706,15 +710,17 @@
<t>
The spherical codebook is decoded by alg_unquant() (<xref target="vq.c">vq.c</xref>).
The index of the PVQ entry is obtained from the range coder and converted to
-a pulse vector by decode_pulses() (<xref target="cwrs.c">cwrs.c</xref>). Derf??
+a pulse vector by decode_pulses() (<xref target="cwrs.c">cwrs.c</xref>).
</t>
+<t>The decoded normalised vector for each band is equal to</t>
+<t>X' = P + g_f * y,</t>
+<t>where g_f = ( sqrt( (y^T*P)^2 + ||y||^2*(1-||P||^2) ) - y^T*P ) / ||y||^2. </t>
+
<t>
-mix_pitch_and_residual() (<xref target="vq.c">vq.c</xref>).
+This operation is implemented in mix_pitch_and_residual() (<xref target="vq.c">vq.c</xref>),
+which is the same function as used in the encoder.
</t>
-</section>
-
-<section anchor="index-decoding" title="Index Decoding">
</section>
<section anchor="denormalization" title="Denormalization">