RTP payload formats


The Real-time Transport Protocol specifies a general-purpose data format and network protocol for transmitting digital media streams on Internet Protocol networks. The details of media encoding, such as signal sampling rate, frame size and timing, are specified in an RTP payload format. The format parameters of the RTP payload are typically communicated between transmission endpoints with the Session Description Protocol, but other protocols, such as the Extensible Messaging and Presence Protocol may be used.

Payload types and formats

The technical parameters of payload formats for audio and video streams are standardised.
The standard also describes the process of registering new payload types with IANA.
Payload formats and types for text messaging are defined in the following specifications:
Payload formats and types for MIDI are defined in the following specifications:

Audio and video payload types

Payload formats and types for audio and video are defined in the following specifications:
Payload identifiers 96–127 are used for payloads defined dynamically during a session. It is recommended to dynamically assign port numbers, although port numbers 5004 and 5005 have been registered for use of the profile when a dynamically assigned port is not required.
Applications should always support PCMU. Previously, DVI4 was also recommended, but this was removed in 2013.
Payload type NameTypeNo. of channelsClock rate Frame size Default packet interval DescriptionReferences
0PCMUaudio18000any20ITU-T G.711 PCM μ-Law audio 64 kbit/s
1reserved audio18000reserved, previously FS-1016 CELP audio 4.8 kbit/s
2reserved audio18000reserved, previously ITU-T G.721 ADPCM audio 32 kbit/s or ITU-T G.726 audio 32 kbit/s
3GSMaudio180002020European GSM Full Rate audio 13 kbit/s
4G723audio180003030ITU-T G.723.1 audio
5DVI4audio18000any20IMA ADPCM audio 32 kbit/s
6DVI4audio116000any20IMA ADPCM audio 64 kbit/s
7LPCaudio18000any20Experimental Linear Predictive Coding audio 5.6 kbit/s
8PCMAaudio18000any20ITU-T G.711 PCM A-Law audio 64 kbit/s
9G722audio18000any20ITU-T G.722 audio 64 kbit/s
10L16audio244100any20Linear PCM 16-bit Stereo audio 1411.2 kbit/s, uncompressed
11L16audio144100any20Linear PCM 16-bit audio 705.6 kbit/s, uncompressed
12QCELPaudio180002020Qualcomm Code Excited Linear Prediction,
13CNaudio18000Comfort noise. Payload type used with audio codecs that do not support comfort noise as part of the codec itself such as G.711, G.722.1, G.722, G.726, G.727, G.728, GSM 06.10, Siren, and RTAudio.
14MPAaudio1, 2900008–72MPEG-1 or MPEG-2 audio only,
15G728audio180002.520ITU-T G.728 audio 16 kbit/s
16DVI4audio111025any20IMA ADPCM audio 44.1 kbit/s
17DVI4audio122050any20IMA ADPCM audio 88.2 kbit/s
18G729audio180001020ITU-T G.729 and G.729a audio 8 kbit/s; Annex B is implied unless the annexb=no parameter is used,
19reserved audioreserved, previously comfort noise
25CELLBvideo90000Sun CellB video
26JPEGvideo90000JPEG video
28nvvideo90000Xerox PARC's Network Video
31H261video90000ITU-T H.261 video
32MPVvideo90000MPEG-1 and MPEG-2 video
33MP2Taudio/video90000MPEG-2 transport stream
34H263video90000H.263 video, first version ,
72-76reservedreserved because RTCP packet types 200-204 would otherwise be indistinguishable from RTP payload types 72-76 with the marker bit set,
77-95unassignednote that RTCP packet type 207 would be indistinguishable from RTP payload types 79 with the marker bit set,
dynamicH263-1998video90000H.263 video, second version ,,
dynamicH263-2000video90000H.263 video, third version
dynamic H264 AVCvideo90000H.264 video
dynamic H264 SVCvideo90000H.264 video
dynamic H265video90000H.265 video
dynamic theoravideo90000Theora video
dynamiciLBCaudio1800020, 3020, 30Internet low Bitrate Codec 13.33 or 15.2 kbit/s
dynamicPCMA-WBaudio1160005ITU-T G.711.1 A-law
dynamicPCMU-WBaudio1160005ITU-T G.711.1 μ-law
dynamicG718audio32000 20ITU-T G.718
dynamicG719audio4800020ITU-T G.719
dynamicG7221audio16000, 3200020ITU-T G.722.1 and G.722.1 Annex C
dynamicG726-16audio18000any20ITU-T G.726 audio 16 kbit/s
dynamicG726-24audio18000any20ITU-T G.726 audio 24 kbit/s
dynamicG726-32audio18000any20ITU-T G.726 audio 32 kbit/s
dynamicG726-40audio18000any20ITU-T G.726 audio 40 kbit/s
dynamicG729Daudio180001020ITU-T G.729 Annex D
dynamicG729Eaudio180001020ITU-T G.729 Annex E
dynamicG7291audio1600020ITU-T G.729.1
dynamicGSM-EFRaudio180002020ITU-T GSM-EFR
dynamicGSM-HR-08audio1800020ITU-T GSM-HR
dynamic AMRaudio800020Adaptive Multi-Rate audio
dynamic AMR-WBaudio1600020Adaptive Multi-Rate Wideband audio
dynamic AMR-WB+audio1, 2 or omit7200013.3–40Extended Adaptive Multi Rate – WideBand audio
dynamic vorbisaudioVorbis audio
dynamic opusaudio1, 2480002.5–6020Opus audio
dynamic speexaudio18000, 16000, 3200020Speex audio
dynamicmpa-robustaudio1, 29000024–72Loss-Tolerant MP3 audio
dynamic MP4A-LATMaudio90000 or othersMPEG-4 Audio
dynamic MP4V-ESvideo90000 or othersMPEG-4 Visual
dynamic mpeg4-genericaudio/video90000 or otherMPEG-4 Elementary Streams
dynamicVP8video90000VP8 video
dynamicVP9video90000VP9 video
dynamicAV1video90000AV1 video
dynamicL8audioany20Linear PCM 8-bit audio with 128 offset
dynamicDAT12audioany20 IEC 61119 12-bit nonlinear audio
dynamicL16audioany20Linear PCM 16-bit audio,
dynamicL20audioany20 Linear PCM 20-bit audio
dynamicL24audioany20 Linear PCM 24-bit audio
dynamicrawvideo90000Uncompressed Video
dynamicac3audio32000, 44100, 48000Dolby AC-3 audio
dynamiceac3audio32000, 44100, 48000Enhanced AC-3 audio
dynamict140text1000Text over IP
dynamicEVRC
EVRC0
EVRC1
audio8000EVRC audio
dynamicEVRCB
EVRCB0
EVRCB1
audio8000EVRC-B audio
dynamicEVRCWB
EVRCWB0
EVRCWB1
audio16000EVRC-WB audio
dynamicjpeg2000video90000JPEG 2000 video
dynamicUEMCLIPaudio8000, 16000UEMCLIP audio
dynamicATRAC3audio44100ATRAC3 audio
dynamicATRAC-Xaudio44100, 48000ATRAC3+ audio
dynamicATRAC-ADVANCED-LOSSLESSaudioATRAC Advanced Lossless audio
dynamicDVvideo90000DV video
dynamicBT656videoITU-R BT.656 video
dynamicBMPEGvideoBundled MPEG-2 video
dynamicSMPTE292MvideoSMPTE 292M video
dynamicREDaudioRedundant Audio Data
dynamicVDVIaudioVariable-rate DVI4 audio
dynamicMP1SvideoMPEG-1 Systems Streams video
dynamicMP2PvideoMPEG-2 Program Streams video
dynamictoneaudio8000 tone
dynamictelephone-eventaudio8000 DTMF tone
dynamicaptxaudio2 - 64000 ÷ sample rate4aptX audio
dynamicjxsvvideo90000JPEG XS video
dynamicscipaudio/video8000 or 90000SCIP