shithub: opus

--- a/celt/ecintrin.h

+++ b/celt/ecintrin.h

@@ -38,21 +38,6 @@

   We define macros for them to allow easy incorporation of these non-ANSI

    features.*/

-/*Note that we do not provide a macro for abs(), because it is provided as a

-   library function, which we assume is translated into an intrinsic to avoid

-   the function call overhead and then implemented in the smartest way for the

-   target platform.

-  With modern gcc (4.x), this is true: it uses cmov instructions if the

-   architecture supports it and branchless bit-twiddling if it does not (the

-   speed difference between the two approaches is not measurable).

-  Interestingly, the bit-twiddling method was patented in 2000 (US 6,073,150)

-   by Sun Microsystems, despite prior art dating back to at least 1996:

-   http://web.archive.org/web/19961201174141/www.x86.org/ftp/articles/pentopt/PENTOPT.TXT

-  On gcc 3.x, however, our assumption is not true, as abs() is translated to a

-   conditional jump, which is horrible on deeply piplined architectures (e.g.,

-   all consumer architectures for the past decade or more) when the sign cannot

-   be reliably predicted.*/

 /*Modern gcc (4.x) can compile the naive versions of min and max with cmov if

    given an appropriate architecture, but the branchless bit-twiddling versions

    are just as fast, and do not require any special target architecture.

--- a/celt/entdec.c

+++ b/celt/entdec.c

@@ -47,10 +47,6 @@

   This only seems true when using near-infinite precision arithmetic so that

    the process is carried out with no rounding errors.

-  IBM (the author's employer) never sought to patent the idea, and to my

-   knowledge the algorithm is unencumbered by any patents, though its

-   performance is very competitive with proprietary arithmetic coding.

-  The two are based on very similar ideas, however.

   An excellent description of implementation details is available at

    http://www.arturocampos.com/ac_range.html

   A recent work \cite{MNW98} which proposes several changes to arithmetic

--- a/doc/draft-ietf-codec-opus.xml

+++ b/doc/draft-ietf-codec-opus.xml

@@ -516,12 +516,18 @@

 Support for that variant is OPTIONAL.

 </t>

+<t>

+All bit diagrams in this document number the bits so that bit 0 is the most

+ significant bit of the first byte, and bit 7 is the least significant.

+Bit 8 is thus the most significant bit of the second byte, etc.

+</t>

 <section anchor="toc_byte" title="The TOC Byte">

<t>

 An Opus packet begins with a single-byte table-of-contents (TOC) header that

  signals which of the various modes and configurations a given packet uses.

-It is composed of a frame count code, "c", a stereo flag, "s", and a

- configuration number, "config", arranged as illustrated in

+It is composed of a configuration number, "config", a stereo flag, "s", and a

+ frame count code, "c", arranged as illustrated in

  <xref target="toc_byte_fig"/>.

 A description of each of these fields follows.

 </t>

@@ -531,7 +537,7 @@

  0 1 2 3 4 5 6 7

 +-+-+-+-+-+-+-+-+

-| c |s| config  |

+| config  |s| c |

 +-+-+-+-+-+-+-+-+

 ]]></artwork>

 </figure>

@@ -656,7 +662,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|0|0|s| config  |                                               |

+| config  |s|0|0|                                               |

 +-+-+-+-+-+-+-+-+                                               |

 |                    Compressed frame 1 (N-1 bytes)...          :

 :                                                               |

@@ -680,7 +686,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|0|s| config  |                                               |

+| config  |s|0|1|                                               |

 +-+-+-+-+-+-+-+-+                                               :

 |             Compressed frame 1 ((N-1)/2 bytes)...             |

 :                               +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

@@ -716,7 +722,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|0|1|s| config  | N1 (1-2 bytes):                               |

+| config  |s|1|0| N1 (1-2 bytes):                               |

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+                               :

 |               Compressed frame 1 (N1 bytes)...                |

 :                               +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

@@ -737,8 +743,8 @@

  Opus layer, rather than at the transport layer.

 Code 3 packets MUST have at least 2 bytes.

 The TOC byte is followed by a byte encoding the number of frames in the packet

- in bits 0 to 5 (marked "M" in the figure below), with bit 6 indicating whether

- or not Opus padding is inserted (marked "p" in the figure below), and bit 7

+ in bits 2 to 7 (marked "M" in the figure below), with bit 1 indicating whether

+ or not Opus padding is inserted (marked "p" in the figure below), and bit 0

  indicating VBR (marked "v" in the figure below).

 M MUST NOT be zero, and the audio duration contained within a packet MUST NOT

  exceed 120&nbsp;ms.

@@ -752,7 +758,7 @@

  0 1 2 3 4 5 6 7

 +-+-+-+-+-+-+-+-+

-|     M     |p|v|

+|v|p|     M     |

 +-+-+-+-+-+-+-+-+

 ]]></artwork>

 </figure>

@@ -764,14 +770,25 @@

 If the value is 255, then the size of the additional padding is 254&nbsp;bytes,

  plus the padding value encoded in the next byte.

 There MUST be at least one more byte in the packet in this case.

-By using the value 255 multiple times, it is possible to create a packet of any

- specific, desired size.

 The additional padding bytes appear at the end of the packet, and MUST be set

  to zero by the encoder to avoid creating a covert channel.

 The decoder MUST accept any value for the padding bytes, however.

-Let P be the total amount of padding, including both the trailing padding bytes

- themselves and the header bytes used to indicate how many trailing bytes there

- are.

+</t>

+<t>

+Although this encoding provides multiple ways to indicate a given number of

+ padding bytes, each uses a different number of bytes to indicate the padding

+ size, and thus will increase the total packet size by a different amount.

+For example, to add 255 bytes to a packet, set the padding bit, p, to 1, insert

+ a single byte after the frame count byte with a value of 254, and append 254

+ padding bytes with the value zero to the end of the packet.

+To add 256 bytes to a packet, set the padding bit to 1, insert two bytes after

+ the frame count byte with the values 255 and 0, respectively, and append 254

+ padding bytes with the value zero to the end of the packet.

+By using the value 255 multiple times, it is possible to create a packet of any

+ specific, desired size.

+Let P be the number of header bytes used to indicate the padding size plus the

+ total amount of padding bytes (i.e., the total number of bytes added to the

+ packet).

 Then P MUST be no more than N-2.

 </t>

<t>

@@ -788,7 +805,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|1|s| config  |     M     |p|0|  Padding length (Optional)    :

+| config  |s|1|1|0|p|     M     |  Padding length (Optional)    :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 |                                                               |

 :            Compressed frame 1 ((N-2-P)/M bytes)...            :

@@ -832,7 +849,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|1|s| config  |     M     |p|1| Padding length (Optional)     :

+| config  |s|1|1|1|p|     M     | Padding length (Optional)     :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 : N1 (1-2 bytes): N2 (1-2 bytes):     ...       :     N[M-1]    |

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

@@ -869,7 +886,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|0|0|0|    1    |               compressed data...              :

+|    1    |0|0|0|               compressed data...              :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 ]]></artwork>

 </figure>

@@ -883,7 +900,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|0|0|   29    |               compressed data...              :

+|   29    |0|0|1|               compressed data...              :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 ]]></artwork>

 </figure>

@@ -897,7 +914,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|1|0|   15    |     2     |0|1|      N1       |               |

+|   15    |0|1|1|1|0|     2     |      N1       |               |

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+               |

 |                       compressed data...                      :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

@@ -913,18 +930,22 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|1|1|   31    |     4     |0|0|      compressed data...       :

+|   31    |1|1|1|0|0|     4     |      compressed data...       :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 ]]></artwork>

 </figure>

 </section>

-<section title="Extending Opus">

+<section title="Receiving Malformed Packets">

<t>

 A receiver MUST NOT process packets which violate any of the rules above as

  normal Opus packets.

 They are reserved for future applications, such as in-band headers (containing

  metadata, etc.).

+Packets which violate these constraints may cause implementations of

+ <em>this</em> specification to treat them as malformed, and discard them.

+</t>

+<t>

 These constraints are summarized here for reference:

 <list style="symbols">

 <t>Packets are at least one byte.</t>

@@ -935,9 +956,9 @@

  that length is no larger than the number of bytes remaining in the packet.</t>

 <t>Code 3 packets contain at least one frame, but no more than 120&nbsp;ms of

  audio total.</t>

-<t>The length of a CBR code 3 packet, N, is at least two bytes, the size of the

- padding, P (including both the padding length bytes in the header and the

- trailing padding bytes) is no more than N-2, and the frame count, M, satisfies

+<t>The length of a CBR code 3 packet, N, is at least two bytes, the number of

+ bytes added to indicate the padding size plus the trailing padding bytes

+ themselves, P, is no more than N-2, and the frame count, M, satisfies

  the constraint that (N-2-P) is a non-negative integer multiple of M.</t>

 <t>VBR code 3 packets are large enough to contain all the header bytes (TOC

  byte, frame count byte, any padding length bytes, and any frame length bytes),

@@ -999,8 +1020,8 @@

 <figure anchor="rawbits-example" title="Illustrative example of packing range

  coder and raw bits data">

 <artwork align="center"><![CDATA[

-               0               1               2               3

- 7 6 5 4 3 2 1 0 7 6 5 4 3 2 1 0 7 6 5 4 3 2 1 0 7 6 5 4 3 2 1 0

+ 0                   1                   2                   3

+ 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 | Range coder data (packed MSB to LSB) ->                       :

 +                                                               +

@@ -1160,8 +1181,8 @@

 <figure anchor="finalize-example" title="Illustrative example of raw bits

  overlapping range coder data">

 <artwork align="center"><![CDATA[

-               n              n+1             n+2             n+3

- 7 6 5 4 3 2 1 0 7 6 5 4 3 2 1 0 7 6 5 4 3 2 1 0 7 6 5 4 3 2 1 0

+ n              n+1             n+2             n+3

+ 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 :     | <----------- Overlap region ------------> |             :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

@@ -3324,6 +3345,10 @@

  coefficient to 32767, though it is both less than 0.999 and, for

  k&nbsp;&gt;&nbsp;0 when maxabs_Q12 is much greater than 32767, still slightly

  too large.

+The upper bound on maxabs_Q12, 163838, was chosen because it is equal to

+ ((2**31&nbsp;-&nbsp;1)&nbsp;&gt;&gt;&nbsp;14)&nbsp;+&nbsp;32767, i.e., the

+ largest value of maxabs_Q12 that would not overflow the numerator in the

+ equation above when stored in a signed 32-bit integer.

 </t>

<t>

 silk_bwexpander_32() (bwexpander_32.c) performs the bandwidth expansion (again,

@@ -7801,7 +7826,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|0|0|s| config  | N1 (1-2 bytes):                               |

+| config  |s|0|0| N1 (1-2 bytes):                               |

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+                               |

 |               Compressed frame 1 (N1 bytes)...                :

 :                                                               |

@@ -7816,7 +7841,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|0|s| config  | N1 (1-2 bytes):                               |

+| config  |s|0|1| N1 (1-2 bytes):                               |

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+                               :

 |               Compressed frame 1 (N1 bytes)...                |

 :                               +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

@@ -7835,7 +7860,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|0|1|s| config  | N1 (1-2 bytes): N2 (1-2 bytes :               |

+| config  |s|1|0| N1 (1-2 bytes): N2 (1-2 bytes :               |

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+               :

 |               Compressed frame 1 (N1 bytes)...                |

 :                               +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

@@ -7854,7 +7879,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|1|s| config  |     M     |p|0| Pad len (Opt) : N1 (1-2 bytes):

+| config  |s|1|1|0|p|     M     | Pad len (Opt) : N1 (1-2 bytes):

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 |                                                               |

 :               Compressed frame 1 (N1 bytes)...                :

@@ -7883,7 +7908,7 @@

  0                   1                   2                   3

  0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

-|1|1|s| config  |     M     |p|1| Padding length (Optional)     :

+| config  |s|1|1|1|p|     M     | Padding length (Optional)     :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

 : N1 (1-2 bytes):     ...       :     N[M-1]    |     N[M]      :

 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+