When you make a VoIP call or join a video meeting, something has to carry your voice and video across the internet in real time. That “something” is typically RTP. In this glossary entry, we’ll explain what RTP is, how it works, why it runs over UDP, and why it’s essential for technologies like VoIP, SIP trunking, and video conferencing.
RTP (Real-Time Transport Protocol) is a network protocol used to deliver real-time audio and video over IP networks. It plays a critical role in technologies like VoIP calls, video conferencing, and live media streaming, where data must arrive quickly to maintain natural communication. RTP prioritises low latency and smooth playback over perfect reliability, making it ideal for time-sensitive media transmission.
How Does RTP Work?
RTP works by taking live audio or video and breaking it into small packets that can be transmitted quickly across an IP network. The process is optimised for speed so that conversations and video streams feel natural and uninterrupted.
Here’s a simple view of how RTP operates in real time:
Media is captured — A microphone captures voice or a camera captures video. The media is converted into digital data.
RTP packetizes the media — The digital audio or video stream is divided into small data packets suitable for network transmission.
Sequence numbers are added — Each RTP packet includes a sequence number so the receiving device can detect missing packets and reorder them correctly.
Timestamps are added — RTP adds timestamps to help synchronise audio and video streams and ensure proper playback timing.
Packets are sent over UDP — RTP typically runs over UDP to minimise delay. UDP does not retransmit lost packets, which helps maintain low latency.
The receiver reorders and plays media — The receiving system uses the sequence numbers and timestamps to reorder packets, smooth out jitter, and play the media in real time.
Key Features of RTP
RTP is specifically designed to handle real-time media transmission. Here are its core features in simple terms:
Packetisation of real-time media — RTP divides live audio and video into small packets so they can be transmitted efficiently across IP networks.
Sequence numbers (packet ordering) — Each packet includes a sequence number, allowing the receiver to detect lost packets and restore the correct order during playback.
Timestamps (synchronisation) — RTP adds timestamps to ensure audio and video streams stay synchronised and are played at the correct speed.
Designed for low latency — RTP prioritises speed and smooth delivery, making it suitable for live communication like VoIP and video conferencing.
Does not guarantee delivery — RTP does not retransmit lost packets. Instead, it tolerates minor packet loss to avoid delays that would disrupt real-time conversations.
RTP vs UDP — What’s the Difference?
RTP and UDP are closely related, but they are not the same thing. UDP is a transport protocol that moves data from one device to another without guaranteeing delivery. RTP runs on top of UDP and adds media-specific features needed for real-time audio and video transmission.
Here’s a simple comparison:
Feature
RTP
UDP
Purpose
Media transport (audio/video)
General data transport
Reliability
No guarantee
No guarantee
Adds timestamps
Yes
No
Adds sequence numbers
Yes
No
In short, UDP handles the delivery, while RTP structures and manages real-time media on top of UDP. UDP keeps things fast by avoiding retransmissions, and RTP adds the intelligence needed to keep voice and video synchronised and in the correct order.
RTP vs RTCP vs SRTP
RTP is often used alongside other related protocols. Understanding how they differ helps clarify how real-time communication systems work.
Here’s a simple comparison:
Protocol
Purpose
RTP
Carries audio and video media
RTCP
Monitors quality and provides synchronisation feedback
SRTP
Encrypts and secures RTP traffic
In a typical VoIP call, RTP carries the actual voice data between devices. At the same time, RTCP (RTP Control Protocol) sends quality reports, such as packet loss and jitter statistics, helping systems monitor performance and adjust if needed. When security is required, SRTP (Secure RTP) encrypts the media stream to protect conversations from interception. Together, these protocols ensure real-time media is delivered quickly, monitored for quality, and secured when necessary.
Why Is RTP Important for VoIP and SIP Trunking?
RTP is essential to VoIP and SIP trunking because it carries the actual voice data during a call. While SIP is responsible for signalling—setting up, managing, and ending the call—RTP is the protocol that transports the live audio between participants. Without RTP, a call could be established, but no sound would be transmitted.
In VoIP environments, RTP enables real-time communication by delivering voice packets quickly and in the correct order. It works alongside SIP to separate signalling from media, which improves flexibility and scalability in business phone systems.
In SIP trunking setups, RTP flows between the PBX (or cloud phone system) and the provider’s network, carrying inbound and outbound voice traffic. Performance factors like latency, jitter, and packet loss directly affect RTP streams, which is why network quality and proper configuration are critical for clear call audio.
FAQ About RTP
What does RTP stand for?
RTP stands for Real-Time Transport Protocol. It is a network protocol used to deliver live audio and video over IP networks.
Is RTP the same as UDP?
No. UDP (User Datagram Protocol) is a transport protocol that moves data across networks without guaranteeing delivery. RTP runs on top of UDP and adds features like sequence numbers and timestamps specifically designed for real-time media.
Does RTP guarantee delivery?
No. RTP does not guarantee packet delivery or retransmit lost data. It prioritizes low latency over reliability, tolerating minor packet loss to keep audio and video flowing smoothly.
What port does RTP use?
RTP does not use a single fixed port. It typically uses dynamic UDP ports negotiated during call setup (often through SIP). Many VoIP systems use a configurable UDP port range, commonly starting around 10000–20000, but this varies by implementation.
What is the difference between RTP and SRTP?
RTP carries real-time audio and video data, while SRTP (Secure RTP) encrypts that data to protect it from interception. SRTP adds confidentiality, message authentication, and integrity protection to standard RTP streams.