Latency is the delay between when data is sent and when it is received. It plays a crucial role in applications that require real-time responsiveness.
What is latency?
Latency refers to the time delay between a source sending data and the destination receiving it. It is a key performance indicator in computer networks and digital communications, especially when evaluating the responsiveness of systems. Unlike bandwidth or bit rate, which measure the amount of data transmitted over time, latency focuses purely on how long it takes for data to travel from point A to point B.
The impact of latency becomes most apparent in systems where timing and real-time interaction are essential. These include applications like video conferencing, online gaming, and voice communication. Even a small delay can lead to noticeable performance issues in these cases.
Important characteristics of latency:
Measured in milliseconds (ms).
Low latency means quicker communication and better responsiveness.
High latency results in noticeable lag, which can degrade user experience.
Latency is often referred to in terms of round-trip time (RTT)—the time it takes for data to go to the destination and back again.
Types of latency
Understanding different types of latency helps identify the sources of delay in a system. These types contribute to the total latency experienced during communication.
Propagation latency
Practice Questions
FAQ
Satellite communication introduces significantly higher latency because of the physical distance that the signals must travel. Most communication satellites are in geostationary orbit, located around 35,786 kilometres above the Earth’s surface. When data is transmitted via satellite, it has to travel from the Earth to the satellite and back down again—this journey occurs twice for a round-trip (up to the satellite and back down to the receiver). Even though electromagnetic signals travel at the speed of light, this long distance results in a round-trip delay of approximately 480 to 600 milliseconds, depending on atmospheric conditions and equipment used. In contrast, fibre optic cables run directly across land or sea beds and cover much shorter distances. Signals in fibre optics also travel slightly slower than in a vacuum but still much faster than the delay introduced by satellite routing. As a result, satellite communication is unsuitable for latency-sensitive applications like online gaming or voice calls unless specifically engineered to reduce this delay.
Jitter refers to the variation in latency—in other words, the inconsistency in the time it takes for data packets to travel from sender to receiver. While latency is the overall delay, jitter measures how stable or predictable that delay is. For example, if one packet arrives in 50 milliseconds and the next arrives in 90 milliseconds, the jitter is 40 milliseconds. This becomes a significant concern in real-time communication systems, such as VoIP or video conferencing, where consistent timing is crucial for maintaining quality. High jitter can cause packets to arrive out of order, leading to issues like choppy audio, broken video streams, or dropped frames. Jitter is usually caused by congestion, routing inconsistencies, or network hardware performance. Buffering can help smooth out jitter but introduces additional latency. Therefore, a network must not only aim for low latency but also low jitter to ensure reliable and smooth data transmission for real-time applications.
Yes, there is a notable difference between latency in wired and wireless networks. Wired networks generally offer lower and more stable latency due to the direct physical connection between devices. Signals in wired networks, especially those using fibre optic cables, face minimal interference and travel through controlled environments, resulting in consistent and predictable transmission times. On the other hand, wireless networks introduce additional latency because they are subject to environmental interference, signal attenuation, and complex encoding and decoding processes. Wireless signals may be affected by obstacles such as walls, weather conditions, and competing devices operating on similar frequencies. Additionally, wireless communication often includes more error checking and retransmission mechanisms to ensure data integrity, which further increases processing time. Wireless networks may also use shared channels, where multiple devices compete for bandwidth, leading to queueing delays. As a result, although wireless offers mobility and convenience, it tends to be less reliable in terms of latency performance compared to wired connections.
Operating systems and application-level decisions significantly impact system latency beyond the physical transmission of data. The operating system controls how processes are scheduled, how memory is managed, and how input/output operations are prioritised. If a system is multitasking heavily, the scheduler may delay a process from executing immediately, increasing processing latency. Additionally, context switching between processes adds time before data can be handled. At the application level, poor software design—such as inefficient algorithms, blocking operations, or excessive resource use—can create bottlenecks. Applications that make unnecessary network requests, use large payloads, or fail to cache data locally can also introduce delays. Furthermore, decisions like polling instead of using interrupts or waiting synchronously for network responses rather than handling them asynchronously may result in higher latency. Developers must therefore consider how their code interacts with both the operating system and network services to minimise delays and provide responsive applications, especially in systems requiring real-time feedback.
Packet size has a direct impact on transmission latency. Smaller packets transmit faster because they require less time to be placed on the network medium (shorter transmission delay). This is beneficial in applications where responsiveness is more critical than throughput, such as in real-time voice communication or online gaming. However, smaller packets carry more protocol overhead relative to data payload, as each packet includes headers and possibly checksums, increasing network load. On the other hand, larger packets are more efficient for bulk data transfer, as they reduce overhead per byte of data. However, if a large packet encounters transmission errors, the entire packet must be retransmitted, which introduces a larger delay compared to smaller packets. In low-latency scenarios, packet size must strike a balance between speed and reliability. Typically, smaller or medium-sized packets are preferred, especially when communication needs to be highly interactive and delays in transmission—even by a few milliseconds—can affect user experience.
