ecodis :: Efficient Audio Codecs

MPEG-4 High-Efficiency Advanced Audio Coding (HE-AAC) in Winamp
^2008–2013

MPEG-D Unified Speech and Audio Coding (USAC) and MPEG-H Audio
^2010–2016

exhale achieves «excellent» overall audio quality (MOS above 3.9) at all bit-rates,
it outperforms all other tested similar-bit-rate audio encoders at 64 and 96 kbit/s,
with only one exception, its worst-case per-item quality remains above-average,
no critical software issues were reported recently, so the implementation is stable.

3GPP Enhanced Voice Services (EVS) for Voice-over-IP Communication
^2010–2014

FSLAC: A FLAC Backward-Compatible Free Semi-Lossless Audio Coder
^{Nov–Dec 2016}

OPUS: A General-Purpose Speech and Audio Codec for Streaming
^2009–2018

Summary: Audio Coding has Matured, Future: More Post-Processing
^{May 2018}

Further, Lesser Known Audio Codecs and Links to Additional Resources
^2019–2020

AVS3, China's latest-generation Audio Video System standard. Tech-
nically, its audio coding part (also known as China 3D Audio) seems
to be similar to some parts of the MPEG-H Audio codec, thus with
probably similar compression performance. See this press release.

The Bluetooth™ LC3 and ETSI LC3plus speech and audio codecs are very low
complexity and low-delay equivalents of the 3GPP EVS codec for Bluetooth™ or
DECT enabled low-energy wireless devices like earphones. See also this page.
An interactive demo of the LC3 codec at five bit-rates is published on this page.

MPEG-4 High-Efficiency Advanced Audio Coding (HE-AAC) in Winamp
^2008–2013

MPEG-D Unified Speech and Audio Coding (USAC) and MPEG-H Audio
^2010–2016

3GPP Enhanced Voice Services (EVS) for Voice-over-IP Communication
^2010–2014

FSLAC: A FLAC Backward-Compatible Free Semi-Lossless Audio Coder
^{Nov–Dec 2016}

OPUS: A General-Purpose Speech and Audio Codec for Streaming
^2009–2018

Summary: Audio Coding has Matured, Future: More Post-Processing
^{May 2018}

Further, Lesser Known Audio Codecs and Links to Additional Resources
^2019–2020

efficient coding of digital signals

by C. Helmrich

Audio codecs which I worked on: HE-AAC Encoder in Winamp 5.6 A free high-quality MPEG-4 encoder MPEG-D USAC, MPEG-H Audio Broadcasting and streaming codecs 3GPP Enhanced Voice Services Speech & audio voice-over-IP codec Free semi-lossless audio coder Constrained VBR coding using FLAC

Other state-of-the-art codecs: OPUS codec (IETF RfC 6716) General-purpose codec for streaming

Comments, further information: My comments on audio coding Coding mature, more post-processing Further codecs and resources Other audio codecs, further reading

Wikipedia page on efficiency, 2017

On this page, some more recent and much more efficient audio compression formats are presented. I had the privilege to participate in the development of some of them.

MPEG-4 High-Efficiency Advanced Audio Coding (HE-AAC) in Winamp2008–2013

Download Winamp 5.666 (Build 3516)with Fraunhofer's HE-AAC Encoder v03.02.16

Figure 1. How to convert audio files into the HE-AACformat using the Format Converter dialog in Winamp.Note that more recent Winamp versions fromwww.winamp.com may look differently and/orthe illustrated dialogs may not exist anymore.

MPEG-D Unified Speech and Audio Coding (USAC) and MPEG-H Audio2010–2016

LG Licenses MPEG-H Software fromFraunhofer IIS (article on Hugh's News) New Products Supporting MPEG-HAudio Hitting the Market (blog article)

So it seems that we simply have to wait for more MPEG-H Audio ready hardware and software to arrive before we can draw ultimate conclusions on its coding performance. The results of the USAC and MPEG-H Audio «Baseline» verification tests are as follows:

Figure 2. Results of theformal USAC verificationtest (higher-rate stereo)conducted by MPEG. (a)Speech input, (b) mixedspeech-and-music input,(c) music input, (d) all,averaged across (a)–(c). See section 2.6 of mydissertation for details.

Figure 3. Results of the'Baseline' MPEG-H Audioverification test. (a) 2.0stereo, (b) 5.1, (c) 5.1plus 2 additional heightchannels, (d) all, avera-ges of (a)–(c). 7.0-kHz anchor, 3.5-kHzanchor. Data taken fromMPEG output document.

Download exhale 1.2.1, exhale Wiki pagesopen Extended HE-AAC encoding and documentation

exhale achieves «excellent» overall audio quality (MOS above 3.9) at all bit-rates,

it outperforms all other tested similar-bit-rate audio encoders at 64 and 96 kbit/s,

with only one exception, its worst-case per-item quality remains above-average,

no critical software issues were reported recently, so the implementation is stable.

Figure 4. Results of thefirst two personal compa-rative blind listening testsincluding exhale, reportedon the HydrogenAudioforum in summer of 2020.Left: low, center: medium,right: high coding bit-rate.The low-bit-rate scores areper-codec averages of twosubtests (classical & pop).

3GPP Enhanced Voice Services (EVS) for Voice-over-IP Communication2010–2014

Figure 5. Results of theEVS verification tests (allmono input signals) con-ducted by Nokia in 2014.See section 3 of Nokia'sIEEE paper for details.(Fig. copyright A. Rämö)Bit-rate (kbit/s)

The latest version of the EVS software encoder and decoder is available via this link:

Download EVS Software version 14.2.0floating-point software edition from 3GPP site

FSLAC: A FLAC Backward-Compatible Free Semi-Lossless Audio CoderNov–Dec 2016

Having finished the first edition of my dissertation [ Helmrich, 2017], I finally had the time for an after-work project which had long been on my to-do list: a constrained VBR (CVBR) version of the publicly available open-source lossless audio coder FLAC.

Download FSLAC 1.3.4 (Build 05-2022)Windows (32-bit), compiled with Visual Studio Download FSLAC 1.3.4 source code filereplaces the stream_encoder.c in FLAC source

OPUS: A General-Purpose Speech and Audio Codec for Streaming2009–2018

OPUS download page, Mozilla's buildsoffering libraries, executables, and source code

For all historians: some early documentation of OPUS's transform coding core, called Constrained Energy Lapped Transform (CELT), is archived here (papers from 2009). A further subjective comparison of OPUS and EVS on music content is available here.

Summary: Audio Coding has Matured, Future: More Post-ProcessingMay 2018

Further, Lesser Known Audio Codecs and Links to Additional Resources2019–2020

AVS3, China's latest-generation Audio Video System standard. Tech-nically, its audio coding part (also known as China 3D Audio) seemsto be similar to some parts of the MPEG-H Audio codec, thus withprobably similar compression performance. See this press release.

page last modified in Aug. 2024, updated IVAS release text

Audio codecs which I worked on:

HE-AAC Encoder in Winamp 5.6
A free high-quality MPEG-4 encoder

MPEG-D USAC, MPEG-H Audio
Broadcasting and streaming codecs

3GPP Enhanced Voice Services
Speech & audio voice-over-IP codec

Free semi-lossless audio coder
Constrained VBR coding using FLAC

Other state-of-the-art codecs:

OPUS codec (IETF RfC 6716)
General-purpose codec for streaming

Comments, further information:

My comments on audio coding
Coding mature, more post-processing

Further codecs and resources
Other audio codecs, further reading

MPEG-4 High-Efficiency Advanced Audio Coding (HE-AAC) in Winamp
^2008–2013

Download Winamp 5.666 (Build 3516)
with Fraunhofer's HE-AAC Encoder v03.02.16

Figure 1. How to convert audio files into the HE-AAC
format using the Format Converter dialog in Winamp.

_{Note that more recent Winamp versions from
www.winamp.com may look differently and/or
the illustrated dialogs may not exist anymore.}

MPEG-D Unified Speech and Audio Coding (USAC) and MPEG-H Audio
^2010–2016

LG Licenses MPEG-H Software from
Fraunhofer IIS (article on Hugh's News)

New Products Supporting MPEG-H
Audio Hitting the Market (blog article)

Figure 2. Results of the
formal USAC verification
test (higher-rate stereo)
conducted by MPEG. (a)
Speech input, (b) mixed
speech-and-music input,
(c) music input, (d) all,
averaged across (a)–(c).

See section 2.6 of my
dissertation for details.

Figure 3. Results of the
'Baseline' MPEG-H Audio
verification test. (a) 2.0
stereo, (b) 5.1, (c) 5.1
plus 2 additional height
channels, (d) all, avera-
ges of (a)–(c). 7.0-
kHz anchor, 3.5-kHz
anchor. Data taken from
MPEG output document.

Download exhale 1.2.1, exhale Wiki pages
open Extended HE-AAC encoding and documentation

Figure 4.Results of the
first two personal compa-
rative blind listening tests
including exhale, reported
on the HydrogenAudio
forum in summer of 2020.

Left: low, center: medium,
right: high coding bit-rate.
The low-bit-rate scores are
per-codec averages of two
subtests (classical & pop).

3GPP Enhanced Voice Services (EVS) for Voice-over-IP Communication
^2010–2014

Figure 5.Results of the
EVS verification tests (all
mono input signals) con-
ducted by Nokia in 2014.

See section 3 of Nokia's
IEEE paper for details.

(Fig. copyright A. Rämö)
_{Bit-rate (kbit/s)}

Download EVS Software version 14.2.0
floating-point software edition from 3GPP site

FSLAC: A FLAC Backward-Compatible Free Semi-Lossless Audio Coder
^{Nov–Dec 2016}

Download FSLAC 1.3.4 (Build 05-2022)
Windows (32-bit), compiled with Visual Studio

Download FSLAC 1.3.4 source code file
replaces the stream_encoder.c in FLAC source

OPUS: A General-Purpose Speech and Audio Codec for Streaming
^2009–2018

OPUS download page, Mozilla's builds
offering libraries, executables, and source code

Summary: Audio Coding has Matured, Future: More Post-Processing
^{May 2018}

Further, Lesser Known Audio Codecs and Links to Additional Resources
^2019–2020

AVS3, China's latest-generation Audio Video System standard. Tech-
nically, its audio coding part (also known as China 3D Audio) seems
to be similar to some parts of the MPEG-H Audio codec, thus with
probably similar compression performance. See this press release.