Spectral band replication explained

Spectral band replication (SBR) is a technology to enhance audio or speech codecs, especially at low bit rates and is based on harmonic redundancy in the frequency domain.

It can be combined with any audio compression codec: the codec itself transmits the lower and midfrequencies of the spectrum, while SBR replicates higher frequency content by transposing up harmonics from the lower and midfrequencies at the decoder.[1] Some guidance information for reconstruction of the high-frequency spectral envelope is transmitted as side information.

When needed, it also reconstructs or adaptively mixes in noise-like information in selected frequency bands in order to faithfully replicate signals that originally contained no or fewer tonal components.

The SBR idea is based on the principle that the psychoacoustic part of the human brain tends to analyse higher frequencies with less accuracy; thus harmonic phenomena associated with the spectral band replication process needs only be accurate in a perceptual sense and not technically or mathematically exact.

History and use

A Swedish company Coding Technologies (acquired by Dolby in 2007) developed and pioneered the use of SBR in its MPEG-2 AAC-derived codec called aacPlus, which first appeared in 2001. This codec was submitted to MPEG and formed the basis of MPEG-4 High-Efficiency AAC (HE-AAC), standardized in 2003.[2] Lars Liljeryd, Kristofer Kjörling, and Martin Dietz received the IEEE Masaru Ibuka Consumer Electronics Award in 2013 for their work developing and marketing HE-AAC.[3] [4] Coding Technologies' SBR method has also been used with WMA 10 Professional to create WMA 10 Pro LBR, and with MP3 to create mp3PRO.

HE-AAC which uses SBR is used in broadcast systems like DAB+, Digital Radio Mondiale (including xHE-AAC), HD Radio, and XM Satellite Radio.[5]

If the player is not capable of using the side information that has been transmitted alongside the "normal" compressed audio data, it may still be able to play the "baseband" data (e.g. sampled at 22.05 kHz instead of 44.1 kHz) as usual, resulting in a dull (since the high frequencies are missing), but otherwise mostly acceptable sound. This is, for example, the case if an mp3PRO file is played back with MP3 software incapable of utilizing the SBR information.

Opus's CELT part performs spectral folding on the MDCT bin level, making it a far less advanced but lower-delay technique compared to SBR.[6]

Dolby Digital Plus (E-AC3) performs Spectral Extension (SPX). SPX reduces high-frequency components to metadata and is similar to E-AC3 multichannel coupling calculation.[7] Dolby AC-4 expands the technique to Advanced Spectral Extension (A-SPX), with the option of interleaving with regular, non-extended data in time or frequency domain. As a result, SPX can be selective disabled for difficult portions.[8]

Methods

Encoding of SBR produces a downsampled (usually 2:1) audio signal and guidance information. In an early publication, the guiding data is described as being produced by quadrature mirror filter (QMF) analysis and an envelope estimator.[9]

Decoding of SBR requires transposing harmonics, a case of audio time stretching and pitch scaling.[10]

See also

External links

Notes and References

  1. Web site: Novak . Clark . Spectral Band Replication and aacPlus Coding - An Overview . February 8, 2010 . dead . https://web.archive.org/web/20101130235115/http://telos-systems.com/techtalk/aacplus/aacPlus_overview.pdf . November 30, 2010 .
  2. Web site: Bandwidth extension, ISO/IEC 14496-3:2001/Amd 1:2003 . ISO . ISO . 2003 . 2009-10-13.
  3. Web site: IEEE Masaru Ibuka Consumer Electronics Award . https://web.archive.org/web/20100408000509/http://www.ieee.org/about/awards/bios/ibuka_recipients.html . dead . April 8, 2010 . IEEE.org . 7 July 2015.
  4. Web site: Interview with Martin Dietz, Kristofer Kjörling, and Lars Liljeryd . YouTube . 7 July 2015.
  5. Web site: XM Radio – Fast Facts . February 8, 2010 . November 15, 2006 . https://web.archive.org/web/20061115191347/http://sounds.xmradio.com/about/fast-facts/sound.xmc . dead .
  6. Web site: Jean-Marc Valin . Gregory Maxwell . Timothy B. Terriberry . Koen Vos . High-Quality, Low-Delay Music Coding in the Opus Codec . www.xiph.org . Xiph.Org Foundation. 19 August 2014 . New York, NY . 2 . October 17–20, 2013 . https://web.archive.org/web/20180714000735/http://jmvalin.ca/papers/aes135_opus_celt.pdf. 14 July 2018. dead.
  7. Web site: Andersen . Robert Loring . Crockett . B. . Davidson . G. . Davis . Mark . Fielder . L. . Turner . Stephen C. . Vinton . M. . Williams . P. . Introduction to Dolby Digital Plus, an Enhancement to the Dolby Digital Coding System . https://web.archive.org/web/20161119192949/https://www.dolby.com/us/en/technologies/aes-convention-paper-intro-to-dolby-digital-plus.pdf . dead . 2016-11-19 . Journal of The Audio Engineering Society . 1 October 2004.
  8. Web site: Dolby® AC-4: Audio delivery for next-generation entertainment services .
  9. Web site: Ekstrand . Per . Bandwidth extension of audio signals by spectral band replication . Proc.1st IEEE Benelux Workshop on Model based Processing and Coding of Audio (MPCA-2002), Leuven, Belgium . November 2022.
  10. Web site: Zhong . Haishan . Villemoes . Lars . Ekstrand . Per . Disch . Sascha . Nagel . Frederik . Wilde . Stephan . Chong . Kok Seng . Norimatsu . Takeshi . QMF Based Harmonic Spectral Band Replication . Audio Engineering Society . English . 19 October 2011.