Comparison of audio coding formats
The following tables compare general and technical information for a variety of audio [coding format]s.
For listening tests comparing the perceived audio quality of audio formats and codecs, see the article Codec listening test.
General information
Operating system support
Multimedia frameworks support
| Audio compression format | ACM | DirectShow | QuickTime | GStreamer | FFmpeg | Media Foundation |
| 8SVX | ||||||
| AAC | ||||||
| AC3 | ||||||
| AC4 | ||||||
| ALS | ||||||
| AMR | ||||||
| ALAC | ||||||
| aptX | ||||||
| ATRAC3 | ||||||
| ATRAC3plus | ||||||
| CELT | ||||||
| Cook Codec | ||||||
| DCA aka DTS | ||||||
| DTS-HD | ||||||
| E-AC-3 | ||||||
| FLAC | ||||||
| L2HC | ||||||
| HE-AAC | ||||||
| Monkey's Audio | ||||||
| WavPack | ||||||
| Shorten | ||||||
| Sonic Audio | ||||||
| MP3 | ||||||
| RealAudio | ||||||
| Speex | ||||||
| Vorbis | ||||||
| La ??? | ||||||
| LPAC | ||||||
| Windows Media Audio v1 | ||||||
| Windows Media Audio v2 | ||||||
| Windows Media Audio Pro | ||||||
| Windows Media Audio Lossless | ||||||
| Windows Media Audio Voice | ||||||
| Musepack | ||||||
| Meridian Lossless Packing | ||||||
| Codec in Flash">Audio codec">Codec in Flash | ||||||
| OptimFROG | ||||||
| Truespeech | ||||||
| True Audio | ||||||
| QCELP | ||||||
| Audio compression format | ACM | DirectShow | QuickTime | GStreamer | FFmpeg | Media Foundation |
Technical details
| Audio compression format | Algorithm | Sample rate | Bit rate | Latency | CBR | VBR | Stereo | Multichannel |
| AAC | MDCT, Hybrid Subband | 8–192 kHz, also: 7.35 kHz, but used rarely. | 8–529 kbit/s 8–576 kbit/s | 20–405 ms | : Dual, Mid/Side, Intensity, Parametric | : Up to 48 channels | ||
| AC3 | MDCT | 32, 44.1, 48 kHz | 32–640 kbit/s | 32–48 ms | : Up to 6 channels | |||
| AC4 | MDCT | 48 kHz | 24-1536 kbit/s | ? | : Up to 24 channels | |||
| AMBE | 8 kHz | 2-9.6 kbit/s | ||||||
| AMR | ACELP | 8 kHz | 4.75, 5.15, 5.90, 6.70, 7.40, 7.95, 10.20, 12.20 kbit/s | 25 ms | ||||
| AMR-WB | ACELP | 16 kHz | 6.60, 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05, 23.85 kbit/s | 25 ms | : only in MPEG-4 Part 12 container | |||
| AMR-WB+ | ACELP | 8, 11.025, 16, 22.05, 32, 44.1, 48 kHz | 6–36 kbit/s 7–48 kbit/s | 60–90 ms | ||||
| aptX | Subband ADPCM | 24–48 kHz | 192–384 kbit/s | 2 ms | : Up to 8 channels | |||
| ATRAC1 | MDCT-Hybrid Subband | 44.1 kHz | 292 kbit/s | >100 ms | : Dual Only | |||
| ATRAC3 | MDCT-Hybrid Subband | 44.1 kHz | 66, 105, 132, 146, 176, 264, 352 kbit/s | >100 ms | : Dual, Mid/Side | |||
| ATRAC3plus | MDCT-Hybrid Subband | 44.1, 48 kHz | 32–768 kbit/s | >100 ms | : Up to 8 channels | |||
| ATRAC9 | MDCT-Hybrid Subband | 12, 24, 48 kHz | 36-672 kbit/s | >100 ms | : Up to 8 channels | |||
| BroadVoice | Two-Stage Noise Feedback Coding | 8, 16 kHz | 16, 32 kbit/s | 5 ms | ||||
| Codec2 | Speech | 8 kHz | 0.7, 1.2, 1.3, 1.4, 1.6, 2.4, 3.2 kbit/s | 20–40 ms | ||||
| E-AC3 | MDCT | 32, 44.1, 48 kHz | 32–6144 kbit/s | 5.33–48 ms | : Up to 15 channels | |||
| E-aptX | Subband ADPCM | 15–48 kHz | 60, 384, 767, 1024 kbit/s, 1.28 Mbit/s | : Up to 8 channels | ||||
| Encodec | Neural networks | 24 kHz 48 kHz | 1.5, 3, 6, 12, 24 kbit/s | ? | ||||
| EVS | Speech | 8, 16, 32, 48 kHz | 5.9, 7.2, 8, 9.6, 13.2, 16.4, 24.4, 32, 48, 64, 96, 128 kbit/s + AMR-WB bitrate modes | 20 ms | ||||
| GSM-HR | VSELP | 8 kHz | 5.6 kbit/s | 25 ms | ||||
| GSM-FR | RPE-LTP | 8 kHz | 13 kbit/s | 20–30 ms | ||||
| GSM-EFR | ACELP | 8 kHz | 12.2 kbit/s | 20–30 ms | ||||
| HE-AAC | 22 ~ 96 kHz | 16 ~ 80 kbit/s | ~130 ms | : Dual, Mid/Side, Intensity, Parametric | : Up to 48 channels | |||
| HVXC | Speech | 8 kHz | 2, 4 kbit/s | 36 ms | ||||
| iLBC | Block Independent LPC | 8 kHz | 13.33, 15.20 kbit/s | 25, 40 ms | ||||
| iSAC | Transform coding | 16, 32 kHz | 10–52 kbit/s | 33, 63 ms | ||||
| LC3 | Speech | 8, 16, 24, 32, 48 kHz | 16-320 kbit/s | 7.5-10 ms | ||||
| LDAC | Subband ADPCM | 44.1, 48, 88.2, 96, 176.4, 192 kHz | 303, 606, 909 kbit/s 330, 660, 990 kbit/s | ? | ||||
| LHDC | Subband ADPCM | 44.1, 48, 96 kHz | 400, 560, 900 kbit/s | ? | ||||
| L2HC | Subband ADPCM | 44.1, 48, 96, 192 kHz | 400, 560, 960, 1920 kbit/s | |||||
| Lyra | Speech | 8, 16, 32, 48 kHz | 3.2, 6, 9.2 kbit/s | 20 ms | ||||
| MP3 | MDCT, Hybrid Subband | 8, 11.025, 12, 16, 22.05, 24, 32, 44.1, 48 kHz | 8, 16, 24, 32, 40, 48, 56, 64, 80, 96, 112, 128, 144, 160, 192, 224, 256, 288, 320 kbit/s | >100 ms | : Dual, Mid/Side, Intensity | |||
| MPEG-1 Audio Layer II | Subband | 32, 44.1, 48 kHz | 32, 48, 56, 64, 80, 96, 112, 128, 160, 192, 224, 256, 320, 384 kbit/s | : but decoders are not required to support it | : Dual, Intensity | |||
| MPEG-2 Audio Layer II | Subband | 16, 22.05, 24 kHz | 8, 16, 24, 32, 40, 48, 56, 64, 80, 96, 112, 128, 144, 160 kbit/s | : but decoders are not required to support it | : up to 5 full range audio channels and an LFE-channel with MPEG Multichannel | |||
| Musepack | Subband | 32, 37.8, 44.1, 48 kHz | 20–350 kbit/s | : Up to 8 channels | ||||
| Opus | MDCT, LPC, LTP | 8–48 kHz | 6–510 kbit/s | 5–66.5 ms | : Up to 255 channels | |||
| RealAudio | MDCT | Varies | Varies | Varies | : Up to 6 channels | |||
| Satin | Speech | 8, 16, 32, 48 kHz | 6-36 kbit/s | ? | ||||
| SILK | LPC, LTP | 8, 12, 16, 24 kHz | 6–40 kbit/s | 25 ms | ||||
| Siren 7 | Derived from PT716plus, MLT | 16 kHz | 16, 24, 32 kbit/s | 40 ms | ||||
| Siren 14 | MLT | 32 kHz | 24, 32, 48 kbit/s 48, 64, 96 kbit/s | 40 ms | ||||
| Siren 22 | MLT | 48 kHz | 32, 48, 64 kbit/s 64, 96, 128 kbit/s | 40 ms | ||||
| Neural networks | 24, 32, 44.1 kHz | 0.98, 1.9, 2.6 kbit/s | ||||||
| Speex | CELP | 8, 16, 32, kHz | 2.15–24.6 kbit/s 4–44.2 kbit/s | 30 ms 34 ms | : Intensity | |||
| Neural networks, modified version of Descript Audio Codec, extended for stereo with a transformer model to shrink even more while keeping the quality high. | 44.1 kHz. Other sample rates: Uncertain. | ~0.45-5.5 kbps ~0.6-7.5 kbps | ? | |||||
| USAC | ? | 7.35, 8, 11.025, 12, 16, 19.2, 22.05, 24, 32, 38.4, 44.1, 48, 57.6, 64, 88.2, 96 kHz | 6-128 kbit/s 12-320 kbit/s | |||||
| VMR-WB | ACELP | 16 kHz | 8.55, 4.0, 0.8, 13.3, 6.2, 2.7, 1.0 kbit/s | 33.75 ms | ||||
| Vorbis | MDCT | 8–192 kHz | 45-500 kbit/s | >100 ms | /ABR | : Dual, Lossless, Phase, Point | : Up to 255 channels | |
| WavPack Lossy | Prediction, Quantization | 1 Hz to 16.777216 MHz | 196 kbit/s and up in lossy mode | 3523.8 ms | : Up to 256 channels | |||
| Windows Media Audio Standard | MDCT | 8, 11.025, 16, 22.05, 32, 44.1, 48 kHz | 8–768 kbit/s | >100 ms | ||||
| Windows Media Audio Pro | MDCT | 8, 11.025, 16, 22.05, 32, 44.1, 48, 88.2, 96 kHz | 4–768 kbit/s | >100 ms | : At least 8 channels, expandable | |||
| Windows Media Audio Voice | ACELP? | 8, 11.025, 16, 22.05 kHz officially | 4-20 kbit/s officially |
| Audio compression format | Algorithm | Sample rate | Bits per sample | Latency | Stereo | Multichannel |
| ALAC | Lossless | 1–384 kHz | 16, 20, 24, 32 | : Up to 8 channels | ||
| ALS | Lossless | 0–4 GHz | 8, 16, 24, 32 | : Up to 65536 channels | ||
| ATRAC Advanced Lossless | Lossless | 44.1 kHz | 16 | |||
| Dolby TrueHD | Lossless | 48, 96, 192 kHz | 16, 20, 24 | : 5.1, 7.1, and Dolby Atmos object-based audio | ||
| DTS-HD Master Audio | Lossless | 48, 96, 192 kHz | 16, 24 | : Up to 7.1 | ||
| FLAC | Lossless | 1–1.048575 MHz | 4–32 | 4.3–92 ms | : Up to 8 channels | |
| HD-AAC | ||||||
| Lossless | 0–4 GiHz | 16 | ? | |||
| L2HC | Lossless | 44.1kHz | 16 | |||
| Monkey's Audio | Lossless | 1–655.350 kHz | 8, 16, 24, 32 and 32-bit float | 1670 to 26,749 ms | : Up to 32 channels since version 5.50 | |
| OptimFROG | Lossless | 0–4 GiHz | 8, 16, 24, 32 | |||
| OSQ | Lossless | 6, 8, 11.025, 16, 22.05, 32, 44.1, 48, 64, 88.2, 96 kHz | 8, 16, 20, 24 | |||
| RealAudio Lossless | Lossless | 8, 11.025, 16, 22.05, 32, 44.1 kHz | 16 | Varies | ||
| Lossless | 1–48 kHz | 1-16 | ? | |||
| Lossless | 8–192 kHz | 8, 16, 24 | ? | : Up to 6 channels | ||
| True Audio | Lossless | 0–4 GiHz | 8, 16, 24 | approx. 1045 ms | : Up to 16, but no channel allocation | |
| WavPack Lossless | Lossless, Hybrid | 1 Hz to 1 GiHz | Up to 32, and 1-bit DSD. | : Up to 256 channels | ||
| Windows Media Audio Lossless | Lossless | 8, 11.025, 16, 22.05, 32, 44.1, 48, 88.2, 96 kHz | 16, 24 | >100 ms | : Up to 6 channels |
| Audio compression format | Algorithm | Sample rate | Bit rate | Bits per sample | Latency | CBR | VBR | Stereo | Multichannel |
| G.711 | companding A-law or μ-law, PCM | 8 kHz | 64 kbit/s | 8 bit | 125 μs | ||||
| G.711.0 | Lossless compression of G.711 | 8 kHz | 0.2–65.6 kbit/s | 8 bit | 5–40 ms | ||||
| G.711.1 | MDCT, A-law, μ-law | 8, 16 kHz | 64, 80, 96 kbit/s | 16 bit | 11.875 ms | ||||
| G.718 | CELP, MDCT, Lossy | 8, 16 kHz | 8, 12, 12.65, 16, 24, 32 kbit/s | 16 bit | 42.875–43.875 ms | ||||
| G.718B | CELP, MDCT, Lossy, Sinusoidal Coding | 32 kHz | 36, 40, 48 kbit/s | 16 bit | 49.625 ms | ||||
| G.719 | MDCT, Lossy | 48 kHz | 32–88 kbit/s in 4 kbit/s steps, 88–128 kbit/s in 8 kbit/s steps | 16 bit | 40 ms | : only in MPEG-4 Part 12 container | : only in MPEG-4 Part 12 container | ||
| G.721 | ADPCM, Lossy | 8 kHz | 32 kbit/s | 13 bit | |||||
| G.722 | sub-band ADPCM, Lossy | 16 kHz | 64 kbit/s | 14 bit | 4 ms | ||||
| G.722.1 | Modulated Lapped Transform, Lossy | 16 kHz | 24, 32 kbit/s | 16 bit | 40 ms | ||||
| G.722.1C | Modulated Lapped Transform, Lossy | 32 kHz | 24, 32, 48 kbit/s | 16 bit | 40 ms | ||||
| G.722.2 | multi-rate wideband ACELP, Lossy | 16 kHz | 6.60, 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05, 23.85 kbit/s | 14 bit | 25 ms | : only in MPEG-4 Part 12 container | |||
| G.723 | ADPCM, Lossy | 8 kHz | 24, 40 kbit/s | 13 bit | |||||
| G.723.1 | MP-MLQ, ACELP, Lossy | 8 kHz | 5.3, 6.3 kbit/s | 13 bit | 37.5 ms | ||||
| G.726 | ADPCM, Lossy | 8 kHz | 16, 24, 32, 40 kbit/s | 13 bit | 125 μs | ||||
| G.727 | ADPCM, Lossy | 8 kHz | 16, 24, 32, 40 kbit/s | 13 bit | |||||
| G.728 | low-delay CELP, Lossy | 8 kHz | 16 kbit/s | 13 bit | 0.625 ms | ||||
| G.729 | CS-ACELP, Lossy | 8 kHz | 8 kbit/s | 13 bit | 15 ms | ||||
| G.729D | CS-ACELP, Lossy | 8 kHz | 6.4 kbit/s | 13 bit | |||||
| G.729E | CS-ACELP, Lossy | 8 kHz | 11.8 kbit/s | 13 bit | 15 ms | ||||
| G.729.1 | CELP, TDBWE, TDAC, Lossy | 8, 16 kHz | 8 kbit/s, 12–32 kbit/s in 2 kbit/s steps | 16 bit | 48.9375 ms |