DetNet B. Varga
Internet-Draft J. Sachs
Intended status: Informational Ericsson
Expires: 9 March 2025 F. Duerr
University of Stuttgart
S. Mostafavi
KTH Royal Institute of Technology
5 September 2024
Latency analysis of mobile transmission
draft-varga-detnet-mobile-latency-analysis-01
Abstract
Dependable time-critical communication over a mobile network has its
own challenges. This document focuses on a comprehensive analysis of
mobile systems latency in order to incorporate its specifics in
developments of latency specific network functions. The analysis
provides valuable insights for the development of wireless-friendly
methods ensuring bounded latency as well as future approaches using
data-driven latency characterization.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at https://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on 9 March 2025.
Copyright Notice
Copyright (c) 2024 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents (https://trustee.ietf.org/
license-info) in effect on the date of publication of this document.
Varga, et al. Expires 9 March 2025 [Page 1]
Internet-Draft Mobile Latency September 2024
Please review these documents carefully, as they describe your rights
and restrictions with respect to this document. Code Components
extracted from this document must include Revised BSD License text as
described in Section 4.e of the Trust Legal Provisions and are
provided without warranty as described in the Revised BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2
2. Comparison between a wired and a mobile virtual DetNet
router . . . . . . . . . . . . . . . . . . . . . . . . . 6
3. Mobile Transmission Latency Breakdown . . . . . . . . . . . . 7
3.1. Mobile communication targets . . . . . . . . . . . . . . 7
3.2. Transmission Latency Breakdown . . . . . . . . . . . . . 8
3.3. QoS architecture within the mobile network . . . . . . . 9
3.4. Latency contributions in different layers of radio
protocols . . . . . . . . . . . . . . . . . . . . . . . . 11
3.5. Latency Analysis . . . . . . . . . . . . . . . . . . . . 14
3.5.1. Processing delays in gNB and UE . . . . . . . . . . . 15
3.5.2. Traffic handling and queuing . . . . . . . . . . . . 15
3.5.3. Data transmission over the radio interface . . . . . 15
3.5.4. Wireless transmission reliability . . . . . . . . . . 16
4. Example: Observed characteristics in real network . . . . . . 18
5. Scheduling related future work . . . . . . . . . . . . . . . 20
6. Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
7. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 21
8. References . . . . . . . . . . . . . . . . . . . . . . . . . 21
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 23
1. Introduction
Digital transformation of industries and society is resulting in the
emergence of a larger family of time-critical services with unique
requirements distinct from traditional Internet applications. Such
time-critical communication has in the past been mainly prevalent to
wired communication system, which is limited to local and isolated
network domains. Wireless communication provides flexibility and
simplicity, but with inherently stochastic components that lead to
packet delay distributions metrics exceeding significantly those
found in wired counterparts. These deviations of stochastic
characteristics have to be addressed in Deterministic Networking
(DetNet) [RFC8655].
The 5G mobile communication system is specified in the Third
Generation Partnership Project (3GPP) and it supports communication
with unprecedented reliability and very low latency through the
Ultra-Reliable Low Latency Communications (URLLC) enhancements
introduced in Release 16. URLLC features targeted reliability,
Varga, et al. Expires 9 March 2025 [Page 2]
Internet-Draft Mobile Latency September 2024
latency and QoS (e.g., automatic repetitions, antenna techniques,
robust physical channels, Orthogonal Frequency Division Multiplex
(OFDM) numerology, mini-slots, grant-free access, pre-emption, 5G QoS
identifier (5QI) values for multiple time-critical services, QoS
monitoring). Providing synchronization and exposure functionality
are covered as well.
DetNet support started in Release 18 based on the concept developed
for Time-sensitive Networking (TSN) in former releases. The 5G
system is represented in the end-to-end architecture as a set of
virtual DetNet routers. The 5G network comprises a 5G core network
and a Radio Access Network (RAN). A User Plane Function (UPF) of the
5G core network acts as a gateway towards the DetNet network. The
RAN can span over a wider geographical area to provide wireless
connectivity to one or more User Equipment (UEs).
Note: In general bridging/routing service is out-of-scope for 3GPP
specifications, therefore in real network scenarios bridging and
routing are for example implemented by additional (external)
functions located mainly within or next to the UPF.
Note2: According to TSC (Time Sensitive Communication) concept of
3GPP the whole 5GS is presented towards the wired DetNet network as a
virtual DetNet router. Using DetNet technology between the 5GS
components is not precluded but out-of-scope in this document.
Varga, et al. Expires 9 March 2025 [Page 3]
Internet-Draft Mobile Latency September 2024
+--------+ +--------+
| TSCTSF |<--------------->| DetNet |
+--------+ | Ctrl. |
^ +--------+
|
v
+---+
|PCF|
+---+
^
|
v
+---+ +---+
+------>|AMF|<--->|SMF|<----------+
| +---+ +---+ |
| ^ |
| | |
v v v
+------+ +--+----------------+ +-----+ +--------+ +---------+
|DetNet|<->|UE| RAN +--+Trans+--+Core Net|<->| DetNet |
| end | | +----+-------+---+ |port | | (CN) | | network |
|System| | TE | radio |gNB| | NW | | UPF / | +---------+
+------+ | | link | | | | | NW-TT |
+--+----+ +---+ +-----+ +--------+
5GS virtual DetNet Router
<---------------------------------------->
SMF: Sessions Management Function UE, TE: User, Terminal Equipment
AMF: Access & Mobility Management Function UPF: User Plane Function
PCF: Policy Control Function RAN: Radio Access Network
TSCTSF: Time Sensitive Communication gNB: Base Station
and Time Synchronization Function NW-TT: Network-side TSC Translator
Figure 1: Internal components of the 5G system acting as a
virtual DetNet router
Varga, et al. Expires 9 March 2025 [Page 4]
Internet-Draft Mobile Latency September 2024
Figure 1 shows the interconnection of the DetNet nodes and the 5G
network. The Time-Sensitive Communication and Time Synchronization
Function (TSCTSF) connects the DetNet Controller and the 5G control
plane. The TSCTSF collects information from the 5G system, and it
reports to the DetNet Controller. The DetNet Controller configures
the 5G as a virtual DetNet router through the TSCTSF, which maps
parameters and sets the configuration via the 5G control plane. Data
plane connectivity at the UPF is achieved via the TSC Translators
(TT) on the network-side at the UPF (NW-TT). Using a TT function on
the device side (DS-TT) is optional (e.g., if time synchronization
has to be provided).
The interaction between the DetNet controller and the 5G system is
specified in [M23.501] e.g., for the support of periodic
deterministic communication. It describes how the a-priori traffic
pattern characteristics in the downlink and the uplink direction
could be provided from an external network into the 5GS and used by
the NG-RAN to optimize resource utilization and to lower the latency
and latency variation. The TSC Assistance Information (TSCAI)
feature is described as a method how the QoS flow traffic
characteristics could be transferred within the 5GS. The TSCAI
feature can be helpful e.g., in scenarios where there is an offset
between the traffic burst sending times and the reserved resources on
the air interface, a mismatch between the periodicity of traffic and
scheduled resources, a clock drift of 5GS with a reference to the
clock of an external network, or in a combination of these cases.
Note: 3GPP systems do not support directly the MPLS data plane of
DetNet due to the lack of support for MPLS. DetNet IP data plane is
supported via the IP PDU session.
Wireless system and its external interfaces are by nature distributed
and with dynamic variations due to radio propagation. The radio
transmission suffers from interferences, reflections, scattering and
diffraction that affect the reliability of data communications which
results in high variable forwarding latency, see a deeper review in
[NR-5G].
There are multiple extension directions to overcome the limitations
inherited by wireless systems, especially 3GPP ones. The common
characteristics of them are that they provide a wireless-friendly
toolset to achieve the required latency distribution between the
endpoints. The latency analysis described here is intended to help
the developers of such wireless-friendly toolsets and provide
motivation for new approaches as well. Such new approaches can be
based on the predictability of the system, for example via usage of
data-driven latency characterization, where network entities have the
ability to estimate the evolution of a system metric or state in the
future.
Varga, et al. Expires 9 March 2025 [Page 5]
Internet-Draft Mobile Latency September 2024
Note: this document was written in order to support DetNet WG related
discussions but it can be interesting for non-DetNet discussions as
well.
2. Comparison between a wired and a mobile virtual DetNet router
The same 5G network can form multiple virtual routers, each of which
is realized via the UPF instance in the 5G core network. An UPF
configured for DetNet support and all UEs connected to that UPF with
IP PDU sessions jointly form the virtual 5G DetNet router and its
ports. There exist significant differences in the characteristics of
such a virtual and a legacy wired DetNet router [D6G-D2.1]:
* Physical distance of ports: In a wired router the physical
distance between ingress and egress ports is in the order of a
decimeter. In a virtual 5G router the distance is between the UPF
and the UE, or between two UEs and can be up to 100's of meters or
even kilometers. This can remarkably impact on network topology.
For example, in an industrial wired DetNet network connecting two
end points may require many 10's of hops to be traversed for E2E
connectivity. With a 5G virtual router only few hops are needed
(e.g., 1-2 (or up to a few) hops to reach the 5G ingress (UE or
UPF) and 1-2 (or up to a few) hops to reach the end node from the
5G egress (UE or UPF).
* Number of ports: The number of router ports in a wired router is
decided at the design and production of the router; router ports
are at fixed locations (in the chassis of the router). In the
virtual 5G router, the number of ports depends on the number of
UEs connected to the UPF that outlines the virtual router. If a
new UE connects to the UPF the number of ports owned by the
virtual router increases. This new UE may require interaction(s)
with the DetNet Controller (e.g., reporting latency to/from the
new port, updating router configurations).
* Latency characteristics: The latency performance of a wired router
is in the single-digit microsecond range, with a PDV in the range
of some 100's of nanoseconds. In a wireless router the typical
latency values are in the range of milliseconds (without specific
configurations for low latency bounds they can reach up to some
10's of milliseconds). Even by using URLLC and proper DetNet
configuration the PD and PDV of a 5G virtual router is
substantially larger than for a wired router.
* Dynamicity of characteristics: Characteristics of a (wired) DetNet
router are mostly determined at design and production time. A
wired router that is tested in a lab prior to normal deployment
can be expected to behave in the same way during operations as
Varga, et al. Expires 9 March 2025 [Page 6]
Internet-Draft Mobile Latency September 2024
during the lab test. In contrast, for a wireless system - and a
virtual 5G DetNet router - the performance depends on the radio
environment and deployment. So, the characteristics of the
virtual 5G router are determined during the operation phase. With
a well-planned and deployed RAN, the general 5GS performance is
expected to perform according to requirements, but in case of
major changes in the radio environment (e.g., walls or large
blocking installations being added) changes in the performance
might occur.
While the 5GS has been specified to be compatible to DetNet by its
external interfaces, the differences in characteristics of the
virtual 5G router and a wired DetNet router requires the development
of new wireless-friendly solutions, those are able to efficiently
ensure bounded latency in mixed (wired and wireless) DetNet
scenarios.
3. Mobile Transmission Latency Breakdown
3.1. Mobile communication targets
In traditional mobile communication networks, the primary key
performance indicators of interest have been the achievable data rate
and spectral efficiency. In 5G, latency has been added as a further
key performance indicator by URLLC. The ambition of 5G URLLC was to
provide low-latency communication while providing high reliability
for maintaining the latency below a specified latency bound. For
example, the objective for the 5G standard is to guarantee that a RAN
latency of 1 ms can be achieved with 99.999% probability.
A solution for reliable wireless transmission with high spectral
efficiency is to apply Hybrid Automatic Repeat Request (HARQ)
retransmissions to recover from unsuccessful transmissions. However,
HARQ leads to an increase in latency due to multiple transmissions
causing a notable disturbance in the packet delay distribution.
URLLC has introduced two major sets of tools: (i) reducing the radio
transmission structure for lower latencies (e.g., processing delays,
channel access delays), and (ii) providing higher robustness in the
transmission to achieve the same latency reliability with fewer
transmission attempts, at the costs of reduced spectral efficiency
due to extremely conservative transmission modes.
To give an example, an uplink transmission in a millimeter wave
carrier can be made in two different configurations [FGS15]:
* Normal 5G New Radio (NR) configuration with up to 3
retransmissions for reliability with packet delay from ~500 us to
2.8 ms, with low resource usage,
Varga, et al. Expires 9 March 2025 [Page 7]
Internet-Draft Mobile Latency September 2024
* 5G URLLC NR configuration with single-transmission reliability
with packet delay from ~500 us to 900 us, involving high resource
usage.
Furthermore, very low latencies enabled by URLLC require a thorough
network deployment plan (e.g., location and density of base station
antennas) to ensure that the capabilities are available throughout
the service area. More relaxed latencies are less sensitive to the
radio network design.
5G URLLC is the main enabler to support time-critical communication
standards that have been defined for fixed networks, like IEEE 802.1
TSN and IETF DetNet.
3.2. Transmission Latency Breakdown
Generally, the latency contributions in a 5G network are dominated by
the RAN [D6G-D3.1]. The transport network only plays a role if a UPF
is far away from the gNB; the amount of packet processing at the UPF
(and related processing times) is limited in comparison to RAN.
In the 5G RAN the main latency contributors are:
1. Time-domain reliability based on HARQ
2. Mobility with handover interruptions
3. Time-division duplex structure
4. Congestion due to resource sharing and queuing
HARQ allows for a better utilization of the resources while being
robust for a defined loss bound. Retransmissions inherently
contribute to the latency of the packet with defined probability of
retransmission(s). HARQ should be used as reliability tool, in case
that it is permitted by the latency bound; it is a tool that combines
high reliability with spectral efficiency (at the cost of increased
PDV). Reliability can be achieved without HARQ, by using more robust
transmission modes. If a (low) latency bound is provided with
99.999% reliability by a robust single transmission, then the large
majority of (i.e., 99.99%) of the packets are over-protected with too
high resource allocations in order to ensure that also the worst-case
packets mostly achieve the latency bound.
Mobility is ensured by handover, where a UE switches connection from
one base station to another, which can lead to handover interruption
times. There are tools to minimize this impact, e.g., L3 make-
before-break handover where the resources are allocated and ready
Varga, et al. Expires 9 March 2025 [Page 8]
Internet-Draft Mobile Latency September 2024
before performing the handover, L1 or L2 mobility with multiple
transmission-reception point (multi-TRP), multi-connectivity. These
options are dependent on deployment and spectrum.
Time Division Duplexing (TDD) pattern is sometimes prescribed by
national regulation and subject to harmonization of multiple
networks. This can place restrictions on applicable configurations.
Each TDD pattern introduces at least PDV at transmission time
interval (TTI) level since packets need to wait for their time slots
to be transmitted.
When the network is undergoing congestion at high loads, the
opportunities for transmission are restricted and, consequently,
additional delay is experienced by the packet. Possible solutions
are to apply prioritization, resource partitioning, admission
control, traffic policing, reservations, or preconfigured access. In
most cases there are implications for the implementation, as well as
utilization inefficiencies.
3.3. QoS architecture within the mobile network
The packet delay of individual packet is strongly dependent on how
the packet is handled within the mobile network. Different packets
are treated differently according to the service requirements they
are associated with. This allows to provide latency-optimized
treatment for dependable time-critical services by applying the
Quality of Service (QoS) mechanisms of the mobile network. The
handling of QoS for traffic passing through the 5G network is defined
in the 5G QoS framework [M23.501][FGAQoS], as summarized in Figure 2.
The end-to-end traffic flows passing through the 5G network, denoted
as service data flows, are mapped at the ingress to the 5G system at
the UE and UPF to QoS flows via traffic filter rules. The QoS flow
is the finest level of granularity for specifying the service
specific traffic treatment in the 5G system. Each QoS flow can have
different traffic forwarding treatment configured in the network,
according to the defined QoS requirements.
Varga, et al. Expires 9 March 2025 [Page 9]
Internet-Draft Mobile Latency September 2024
NG-RAN 5GC
<------------------------------------->.<------------------
. .
+------------+ . +------------+ . +-------------+
| UE | . | gNB | . | UPF |
|+----+ +----------------------------------------+ +----+|
||TFTs| | PDU Session | |SDF ||
|+----+ | +---------------+ #----------------# | |Fltr||
| | | Radio Bearer | | GTP-U Tunnel | | +----+|
| +--------------------------------------------+ |
| | QoS Flow | |
| +--------------------------------------------+ |
| +--------------------------------------------+ |
| | QoS Flow | |
| +--------------------------------------------+ |
| | +---------------+ | | | |
| | +---------------+ | | | |
| | | Radio Bearer | | | | |
| +--------------------------------------------+ |
| | QoS Flow | |
| +--------------------------------------------+ |
| | +---------------+ #----------------# | |
| +----------------------------------------+ |
| | . | | . | |
+------------+ . +------------+ . +-------------+
. .
Uu N3
QoS rules QoS Profiles UL and DL
Packet Detection
Rules
Figure 2: 5G QoS architecture
The QoS flow is transported through the 5G core network via a GTP-U
tunnel between the UPF and the gNB over a transport network. In
large networks, the UPF can be placed flexibly in the network
topology; this allows the UPF to be placed close to the device (UE)
and its application and thereby enabling the shortest possible
transport connection and reducing latency [ETR20]. In local
deployments (e.g., industrial scenarios) a UPF is typically very
close to the gNB and can be even located in the same rack. In the
RAN, the QoS flow is transported via a radio bearer over the radio
interface between the user equipment (UE) and the gNB.
Varga, et al. Expires 9 March 2025 [Page 10]
Internet-Draft Mobile Latency September 2024
3.4. Latency contributions in different layers of radio protocols
The Service Data Adaption (SDAP) layer maps the QoS flows to Data
Radio Bearers (DRBs) and marks the packets with the QoS flow
identifier. DRBs can be configured to be either in acknowledged mode
(AM) or unacknowledged mode (UM) (see Figure 4); for an acknowledged
mode DRB lossless data forwarding at handover is enabled for the
Packet Data Convergence Protocol (PDCP) layer and Radio Link Control
(RLC) operates in acknowledged mode. The latency impact of SDAP on
data transfer is negligible.
Ethernet or IP Traffic
<------------------------------------------>
| QoS Flow |
|<---------------------------------->|
| |
+--------+ +--------------+ +---------+
| +----+ | | +----+-----+ | | +-----+ |
| |SDAP| | | |SDAP|GTP-U| | | |GTP-U| |
| +----+ Radio Bearer +----+-----+ | | +-----+ |
| |<----------------->| | | | | |
| +----+ | | +----+-----+ | | +-----+ |
| |PDCP| | | |PDCP| | | | | | |
| +----+ RLC Channel +----+ | | | | | |
| |<----------------->| | | | | | | |
| +----+ | | +----+ | | | | | |
| |RLC | | | |RLC | IP | | | | IP | |
| +-- -+ Logical Ch. +----+ | | | | | |
| |<----------------->| | | | | | | |
| +----+ | | +----+ | | | | | |
| |MAC | | | |MAC | | | | | | |
| +-- -+ Transport Ch.+----+ | | | | | |
| |<----------------->| | | | | | | |
| +----+ | | +----+ | | | | | |
| |PHY | | | |PHY | | | | | | |
| +----+ | | +----+-----+ | | +-----+ |
| ^-------------------^ ^-----------^
| | | | | |
+---UE---+ +------gNB-----+ +---UPF---+
SDAP: ServiceData Adaptation Protocol
PDCP: Packet Data Convergence Protocol
RLC: Radio Link Control
MAC: Medium Access Control
PHY: Physical Layer
Figure 3: 5G protocol stack for user plane with focus on RAN
Varga, et al. Expires 9 March 2025 [Page 11]
Internet-Draft Mobile Latency September 2024
QoS QoS QoS
Flow1 Flow2 Flow3
| | |
+-----|------|-----------------|---------+
| v v SDAP v |
| +----------+ +----------+ |
+---| DRB #1 |----------| DRB #2 |---+
+---| AM |---+ +---| UM |---+
| +----------+ | | +----------+ |
| | | |
| PDCP Entity | | PDCP Entity |
| | | |
+------------------+ +------------------+
^ ^ |
| | |
v | v
+------------+ +-----------+ +-----------+
| RLC AM | | RLC UM UL | | RLC UM DL |
+------------+ +-----------+ +-----------+
+----------------------------------------+
| MAC and PHY |
+----------------------------------------+
....................... Air interface (Uu)
Figure 4: 3GPP 5G protocol stack and data flow
At the next layer, the PDCP (Packet Data Convergence Protocol) layer
provides ciphering for encryption of user plane data and optionally
also integrity protection and verification via a message
authentication code that is calculated for each data Protocol Data
Unit (PDU). PDCP assigns a sequence number for each data PDU and
forwards it to the underlying RLC layer. PDCP can also perform
header compression and decompression over the radio link for the IP
headers or Ethernet headers of the end-to-end data flow.
For acknowledged mode DRBs a copy of each PDCP PDU is stored in a
local buffer. At changes of the RLC entity, due to either handover
or (re-)configuration of dual connectivity or carrier aggregation, a
lossless continuation of data transfer is ensured by forwarding not-
yet-acknowledged PDCP PDUs to the new RLC entity.
As the underlying protocol layers can lead to packet re-ordering, the
PDCP performs packet re-ordering to ensure in-order transmission of
data over the DRB. For this, the receiver holds back the received
packets until all earlier packets of the DRB have been received and
are delivered first. A reordering timer determines how long packets
Varga, et al. Expires 9 March 2025 [Page 12]
Internet-Draft Mobile Latency September 2024
are held back before delivery. In-order delivery leads to head-of-
line blocking, which means that a long packet delay of one PDU (e.g.,
due to a larger number of retransmissions) affects also earlier
packets. The impact of this head-of-line blocking is controlled via
the reordering timer, which may reduce head-of-line-induced latencies
at an increased risk of sending packets out of order. It is possible
to configure the PDCP also for explicit out-of-order delivery, in
which case no packet delay propagation within a group of PDUs
appears.
The PDCP can be configured for Service Data Unit (SDU) discard, which
enables to set a maximum lifetime on a packet in the radio
transmission. If a configured SDU discard timer expires, the PDCP
sender removes the packet from its buffer and requests the lower
layer to purge the related data. SDU discard can be considered as a
latency-based active queue management scheme.
The PDCP allows to aggregate multiple radio links over different
frequency carriers, based on the NR functionality of carrier
aggregation or dual-connectivity. The PDCP connection uses, in this
case, multiple RLC entities; this can be used to aggregate the
capacity of multiple radio links for the data radio bearer, but it
can also be used to provide redundant transmission. For redundant
transmission the PDCP entity duplicates PDCP PDUs and transmits them
via multiple links; at the PDCP receiver, duplicates are then
filtered out.
The PDCP uses one or more RLC channels, via one or more RLC
instances. RLC provides reliable data transmission over the radio
link via its acknowledged mode (AM); it can also be configured to
apply the unacknowledged mode (UM) In AM mode, a selective-repeat ARQ
protocol is used, in which correct reception of packets is ensured by
detecting packet errors or losses and triggering retransmissions as
needed. RLC transmitter and receiver entities maintain a sliding-
window buffer, and the receiver entity updates the transmitter entity
via status reports about correctly received or missing PDUs. The RLC
receiver forwards correctly received PDUs to the PDCP receiving
entity, which may comprise packets being delivered out-of-sequence.
Reordering for in-sequence delivery is then performed in PDCP. RLC
applies segmentation of SDUs towards the Medium Access Control (MAC)
layer, so that the MAC protocol can multiplex RLC PDUs into the
transport blocks sent by MAC to the physical layer.
From a packet delay perspective, minor latency contributions are made
by packet processing. The larger possible latency contribution in
acknowledged mode comes from the ARQ operation. A packet is
maintained in the receiver buffer until it is successfully
transmitted. For this, several RLC retransmissions can be used,
Varga, et al. Expires 9 March 2025 [Page 13]
Internet-Draft Mobile Latency September 2024
where the maximum number of retransmissions is configurable. An RLC
retransmission takes in the order of some tens of milliseconds, so
that it can lead to some increased delay of packets that are not
correctly transmitted in the first RLC transmission attempt. The
need for RLC retransmission depends strongly on the configuration of
the reliability that is configured for the lower MAC/PHY layers. For
time critical low latency communication, typically the MAC/PHY is
configured very reliably so that RLC retransmissions are not
necessary. This trade-off we discuss more.
MAC entities are responsible for scheduling the radio resources for
all bearers in UEs and gNB in both uplink and downlink directions.
The RLC data segments received from multiple logical channels are
concatenated along with MAC headers, padded if required, and then
encoded to fit inside the scheduled Transport Block (TB) to be
transmitted through the radio physical layer [NR-5G]. After the
successful reception of the TB, the counterpart MAC entity decodes
the TB and demultiplexes to the logical channels. Furthermore, the
HARQ process of the MAC layer is responsible for handling most of the
radio link errors. HARQ combines ARQ with Forward Error Correction
(FEC) to efficiently enhance the reliability of communication in
wireless channels. Via fast feedback the receiving MAC provides
positive (ACK) or negative acknowledgments (NACK) back to the
transmitter about successful TB decoding. One of the key functions
of the MAC entity at gNB is to perform radio resource allocation for
both Uplink (UL) and Downlink (DL) directions every TTI. The exact
resource allocation process, considering factors such as Channel
State Indicator (CSI), QoS requirements, and buffer occupancy, is
beyond the scope of this document. However, it is important to note
that the scheduler plays a crucial role in ensuring that the TB size
(TBS) aligns with the chosen Modulation and Coding Scheme (MCS) and
the number of Physical Resource Blocks (PRBs) allocated for the
transmission. In addition to the above functions, the MAC also
manages random access control during the initial access of UEs.
3.5. Latency Analysis
The latency analysis focuses on the following areas as contributors
to the latency:
* Processing delays at gNB and UE
* Traffic handling / queuing
* Data transmission over the radio interface
* Reliability mechanisms (like HARQ)
Varga, et al. Expires 9 March 2025 [Page 14]
Internet-Draft Mobile Latency September 2024
In addition, further delays may be incurred due to mobility of
devices or activating devices from power-saving idle states.
3.5.1. Processing delays in gNB and UE
For RAN processing in both UE and gNB, the most processing-intensive
functions are found in the physical layer. They comprise, e.g.,
channel equalization, channel encoding and decoding, Multiple-input
Multiple-output (MIMO) processing. As part of the 5G standardization
for URLLC, different UE capabilities with regards to processing times
have been defined. For UEs that support faster processing (i.e. "UE
capability 2"), this allows the scheduler in the gNB to accelerate
certain radio transmission procedures that depend on UE processing
times.
3.5.2. Traffic handling and queuing
In practical network situations a 5G network provides connectivity
for a large number of UEs and a potentially even larger number of
traffic flows. The gNB scheduler allocates the radio resources to
all UEs and traffic flows in a radio cell for both uplink and
downlink. In case that more traffic packets arrive at the wireless
5G transmitter than can be served in the next transmission time
interval, which is the scheduling period for which radio resources
are allocated, queuing occurs as not all traffic can be handled
instantaneously. The queuing of packets thus can introduce
additional packet delays.
To ensure that time-critical traffic flows are not impacted by large
queuing delays, traffic prioritization is defined. 5G applies a QoS
framework, where different traffic flows are separated (into so-
called QoS flows), and traffic handling and prioritization is
performed between those flows. By appropriate prioritization in the
scheduler, the impact of queuing can be minimized for time-critical
traffic flows. For this to work, it is also important that the total
number and aggregate traffic of time-critical traffic flows, that
should obtain priority in scheduling decisions, stays below some
threshold fraction of the total 5G network capacity. To this end,
admission control is applied when admitting new traffic flows.
3.5.3. Data transmission over the radio interface
The data transmission over the radio interface is significantly
impacted by the radio interface design and the frame structure. A
radio slot consists of 14 Orthogonal Frequency Division Multiplexing
(OFDM) symbols, where a flexible numerology with different options of
sub-carrier spacing can be applied, which leads to different slot
durations [SWD18][LSW19]. The common slot lengths in deployed 5G
Varga, et al. Expires 9 March 2025 [Page 15]
Internet-Draft Mobile Latency September 2024
networks have a length of 0.5 ms (based on 30 kHz sub-carrier
spacing) in frequency bands up to 6 GHz, and a length of 0.125 ms
(based on 120 kHz sub-carrier spacing). The transmission of user
data is scheduled by the scheduler per slot. 5G can be deployed in a
wide range of spectrum bands; multiple spectrum bands can be combined
by a 5G network. This includes frequency bands from 450 MHz up to
2.6 GHz which are based on frequency division duplex (FDD), which
means that uplink and downlink transmission is ongoing simultaneously
on different spectrum carriers. But above 2 GHz typically time-
division duplex (TDD) is applied, where the same spectrum carrier is
alternatingly used for uplink and downlink transmission. The
majority of 5G network deployments are based on TDD spectrum
allocations.
In principle, the 5G standard allows a very flexible configuration of
TDD patterns. In practice, there are constraints due to coexistence:
if two networks use different TDD patterns, this can cause
interference between these two networks. For local 5G network
deployments the choice of TDD pattern is more flexible, in particular
when indoors, since such networks are more isolated from other
networks and coexistence is easier. In today's (public) 5G networks
only a set of TDD patterns is used, which are often even with a
larger portion of radio resources being allocated to downlink, as
most data in public networks is downloaded to devices. From a
latency perspective the TDD pattern has a large impact on the
transmission latency, as it restricts at what time instances the
scheduler can allocate downlink or uplink resources for the
transmission of user data or control information (like HARQ
feedback).
Other latency-related improvements of the radio transmission include
pre-configured transmission opportunities for time-critical devices;
this can significantly reduce the time for a UE to obtain access to
the radio channel by avoiding an initial request procedure to the gNB
[SWD18][LSW19].
3.5.4. Wireless transmission reliability
A new paradigm has been introduced with the 5G standard to address
time-critical communications, for which features for URLLC have been
standardized. Those include shortened transmission procedures and
very robust transmission modes for data and control channels, to
significantly reduce the probability of unsuccessful radio
transmissions. In addition, a very effective way to provide
reliability in a time-varying wireless transmission context is the
application of ARQ.
Varga, et al. Expires 9 March 2025 [Page 16]
Internet-Draft Mobile Latency September 2024
By identifying packet losses and recovering them by retransmissions a
reliable transmission over 5G can be provided. Thereby a two-level
ARQ mechanism has proven to be very effective [LLM09]. A stop-and-
wait Hybrid ARQ mechanism with multiple parallel HARQ processes is
implemented in the MAC layer tightly coupled with the physical layer.
Fast HARQ feedback (i.e., acknowledgement of negative acknowledgement
of successful transmission, ACK or NACK) is enabled via physical
channels and allows for fast error recovery. In addition, HARQ is
integrated with channel coding by allowing to provide incremental
redundancy in the retransmission. This provides a very spectral
efficient recovery of transmission errors.
Moreover, a sliding window ARQ mechanisms is provided by the RLC
layer. It operates with full ARQ status reports about missing and
correctly received RLC PDUs, which are transmitted as RLC control
messages including a cyclic redundancy check and normal transmission
over the lower MAC/PHY layers. While the majority of transmission
errors are recovered by the MAC HARQ, there is a risk of residual
HARQ errors, for example due to failure of the binary HARQ feedback,
where HARQ NACK may be erroneously misinterpreted as ACK and lead to
a packet failure. It is not spectrally efficient to protect such
small HARQ signals with very high reliability. The RLC ARQ protocol
is well capable at recovering such HARQ failures to provide very high
reliability of data transmission. However, the retransmission round-
trip time (RTT) of RLC ARQ is significantly larger than the HARQ RTT.
For mobile broadband services the benefit of this coordinated two-
layer ARQ has been acknowledged as an efficient solution.
As shown in Figure 5, by expanding the service range of 5G to a wider
set of critical communication services the focus of latency
performance has shifted away from the best-effort latency
performance, e.g. expressed as mean packet delay, and which is a
relevant latency metric for typical mobile broadband (MBB)
applications. For time-critical services, the latency bound comes
into focus. To this end, the concept of reliability has been defined
in the 5G standardization, which expresses the probability that a
packet can be transmitted in a defined maximum delay. Latency
performance is thus expressed by a pair of metrics: the latency bound
and the reliability with which this bound can be provided.
Varga, et al. Expires 9 March 2025 [Page 17]
Internet-Draft Mobile Latency September 2024
Mobile BroadBand Time-critical Communication
Probability Probability
^ ^ Latency
| | Bound
| | .
| xx | xxxx .
| x x | x x .
| x x | x x .
| x x | x x .
| x x | x x .
| x x | x x .
| x xx | x x .
| x xxxxxx | x x.
+ xxxxx | x x
+-x----------------------x---> +-x---------.x----->
^ Latency .^ Latency
| |
Long tail Too late or lost
Figure 5: Time-critical communication with URLLC: from best
effort to bounded latency performance
The focus of 5G standardization so far was on the latency bound and
the reliability for time-critical services. For the integration of
5G with dependable end-to-end communication, e.g., based on TSN or
DetNet, packet delay variation may also be of importance.
Independent from the latency bound that is provided by 5G, it is
clear from the description above that 5G introduces a large PDV; the
relative PDV is significantly larger than the one found e.g., in
wired nodes.
4. Example: Observed characteristics in real network
This section contains real-world observations on packet delay
distribution from a 5G system. Through an empirical analysis
framework developed for 5G networks as described in [EDAF24], the
internal mechanisms in 5G contributing to the packet delay
distribution were investigated.
Varga, et al. Expires 9 March 2025 [Page 18]
Internet-Draft Mobile Latency September 2024
+--------------------------------------------------+
| Packet Delay |
| | 10 ms | 15 ms | 20 ms |
+==================================================+
|Cumulative probability| 99% | 99.99% | 99.999% |
+--------------------------------------------------+
|HARQ Re-transmission | 0.01% | 15% | 45% |
+--------------------------------------------------+
|RAN Transmission | 27% | 25% | 10% |
+--------------------------------------------------+
|RAN Segmentation | 43% | 40% | 30% |
+--------------------------------------------------+
|RAN Queuing Delay | 29.99% | 20% | 15% |
+--------------------------------------------------+
Figure 6: Measurement on internal mechanisms in 5G contributing
to the packet delay distribution
In an experiment conducted on OpenAirInterface 5G network [OAI5G] , a
traffic generator was deployed on a static UE with ideal coverage to
push packets every 10 ms on uplink direction. The end-to-end uplink
delay of each packet on a live 5G network was measured and
decomposed. Figure 6 on second row displays the cumulative
distribution of the packet delays which also indicates the packet's
delay violation probability for different delay targets. For
instance, it can be observed that 15 ms target was violated with
probability of 10e-2 while 20 ms target was violated with probability
of 10e-3. Such insight can be useful to incorporate when it comes to
determining end-to-end schedules as the violation probability
indicates the ratio of packets that will arrive later than the
determined window.
In addition, we measured the contribution percentage of 4 distinct
delay components to the packet delay violations: HARQ
retransmissions, RAN transmission, RAN segmentation, and RAN queuing.
Each of these processes contributes on a different level to the delay
violations, which is reported in percentage in Figure 6. For
instance, 15 ms target delay with violation probability of 10e-2 has
20% contribution from queuing delay, 40% from segmentation delay, 25%
from RAN transmission, and 15% from HARQ re-transmissions.
Varga, et al. Expires 9 March 2025 [Page 19]
Internet-Draft Mobile Latency September 2024
Regarding larger delay targets, where violations are less likely,
contribution of HARQ retransmissions starts to dominate, accounting
for up to 50% of the e2e delay. This trend was further evident in
all experiments, underscoring that the primary contributor to the
extended tail in packet delay is the infrequent yet impactful HARQ
re-transmissions.
5. Scheduling related future work
The packet delay characteristics of mobile transmissions has to be
considered by the packet scheduling performed at routers to provide
reliable end-to-end delay guarantees. Although scheduling is only
concerned with providing bounds on queuing delay, the node internal
forwarding delay is another integral part of end-to-end delay and
must be considered when calculating scheduling parameters or
analyzing an end-to-end schedule. The node internal forwarding delay
of mobile virtual DetNet routers causes a packet delay that is
stochastic and heavy-tailed, i.e., larger delay values are more
likely compared to exponentially bounded tails and packet delay
variation is relatively large. These properties will lead to the
following problems for end-to-end scheduling.
In case of clock-driven scheduling scenarios, similar to scheduled
traffic (time-aware shaper) [IEEE8021Qbv] of TSN, the end-to-end
scheduling requires the calculation of per-hop time-tables [SOL23] to
control packet forwarding:
* Bad reliability-efficiency trade-off: due to the large packet
delay variation, larger time windows have to be allocated to flows
to isolate flows in time and reliably guarantee delay bounds.
With non-work-conserving scheduling (i.e., exclusively allocated
time windows) this reduces the number of admitted flows or
bandwidth that can be utilized.
* Higher complexity of scheduling problem formulation and solution:
stochastic packet delay must be considered in the formulation for
calculating time-tables (e.g., Integer Linear Programming,
Constrained Programming). This might also increase the time to
calculate a feasible schedule or make decisions for admission
control.
In case of other (non-clock-driven) scheduling mechanisms, e.g.,
using static or dynamic priorities or hop-by-hop traffic shaping like
the TSN Credit-Based Shaper [IEEE8021Qav], Asynchronous Traffic
Shaper [IEEE8021Qcr], etc.:
Varga, et al. Expires 9 March 2025 [Page 20]
Internet-Draft Mobile Latency September 2024
* Higher complexity of end-to-end delay analysis: stochastic delay
with large delay variation needs to be considered in the analysis
methodology, e.g., in definition of arrival curves in network
calculus [MSL18], to derive tight delay bounds.
In future work, a detailed analysis for each individual scheduling
approach is required to analyze the specific impact of the packet
delay characteristic onto end-to-end delay bounds, end-to-end delay
variation, reliability-efficiency trade-off, runtime of schedule
synthesis and analysis, and other KPIs.
6. Summary
Wireless communication provides flexibility and simplicity, but with
inherently stochastic components that lead to packet delay
distributions metrics exceeding significantly those found in wired
counterparts. These deviations of stochastic characteristics make
traditional approaches to planning and configuration of end-to-end
time-critical communication networks such as Time-sensitive
Networking (TSN) or Deterministic Networking (DetNet), fall short in
their performance regarding service performance, scalability, and
efficiency.
Some traffic shaping mechanisms, like time-scheduled transmission
(i.e., IEEE 802.1Qbv), expect very deterministic latency behavior in
every node on the transmissions path. The latency distribution of a
5G system makes it impracticable to implement some legacy time-
schedule configurations. Therefore, to ensure wide integration and
interworking with wired deterministic technology such as TSN and
DetNet, it is desirable to develop wireless-friendly solutions to
ensure the end-to-end latency bounds of deterministic applications.
7. Acknowledgements
Authors extend their appreciation to James Gross, Gourav Prateek
Sharma, Janos Farkas, Marilet De Andrade Jardim, Gyorgy Miklos, and
Damir Hamidovic for their insightful comments and productive
discussion that helped to improve the document.
8. References
[D6G-D2.1] DETERMINISTIC6G Project, "D2.1: First report on 6G-centric
Enablers", 2023, .
Varga, et al. Expires 9 March 2025 [Page 21]
Internet-Draft Mobile Latency September 2024
[D6G-D3.1] DETERMINISTIC6G Project, "D3.1: Report on 6G Convergence
Enablers Towards Deterministic Communication Standards",
2023, .
[EDAF24] Mostafavi, S., Tillner, M., Sharma, G., and J. Gross, "An
End-to-End Delay Analytics Framework for 5G-and-Beyond
Networks", arXiv preprint arXiv:2401.09856, 2024.
[ETR20] Alriksson, F., Bostroem, L., Sachs, J., P. E. Wang, Y.,
and A. Zaidi, "Critical IoT connectivity Ideal for Time-
Critical Communications", Ericsson Technology Review,
DOI 10.23919/ETR.2020.9905508, 2020.
[FGAQoS] 5G-ACIA, "5G QoS for Industrial Automation", 2021,
.
[FGS15] 5G-SMART, "D1.5: Evaluation of radio network deployment
options", 2021, .
[IEEE8021Q]
IEEE, "IEEE Standard for Local and Metropolitan Area
Networks -- Bridges and Bridged Networks",
DOI 10.1109/IEEESTD.2018.8403927, July 2018,
.
[IEEE8021Qav]
IEEE, "IEEE Standard for Local and Metropolitan Area
Networks -- Amendment 12: Forwarding and Queuing
Enhancements for Time-Sensitive Streams",
DOI 10.1109/IEEESTD.2010.8684664, July 2018,
.
[IEEE8021Qbv]
IEEE, "IEEE Standard for Local and Metropolitan Area
Networks -- Amendment 25: Enhancements for Scheduled
Traffic", DOI 10.1109/IEEESTD.2016.8613095, 2015,
.
[IEEE8021Qcr]
IEEE, "IEEE Standard for Local and Metropolitan Area
Networks -- Amendment 34: Asynchronous Traffic Shaping",
DOI 10.1109/IEEESTD.2020.9253013, November 2020,
.
Varga, et al. Expires 9 March 2025 [Page 22]
Internet-Draft Mobile Latency September 2024
[LLM09] Larmo, A., Lindstroem, M., Meyer, M., Pelletier, G.,
Torsner, J., and H. Wiemann, "The LTE link-layer design",
IEEE Communications Magazine, vol. 47, no. 4, pp. 52-59,
DOI 10.1109/MCOM.2009.4907407, 2009.
[LSW19] Liberg, O., Sundberg, M., P. E. Wang, Y., Bergman, J.,
Sachs, J., and G. Wikstroem, "Cellular Internet of Things
- From Massive Deployments to Critical 5G Applications",
Academic Press, second edition, ISBN: 9780081029022, 2019.
[M23.501] 3GPP 23.501, "System architecture for the 5G System
(5GS)",
.
[MSL18] Mohammadpour, E., Stai, E., Mohiuddin, M., and J. Y. Le
Boudec, "Latency and Backlog Bounds in Time-Sensitive
Networking with Credit Based Shapers and Asynchronous
Traffic Shaping", DOI 10.1109/ITC30.2018.10053, 2018,
.
[NR-5G] Dahlman, E., Parkvall, S., and J. Skold, "5G NR - The next
generation wireless access technology", Academic Press ,
2021.
[OAI5G] Kaltenberger, F., Silva, A.P., Gosain, A., Wang, L., and
T.T. Nguyen, "OpenAirInterface: Democratizing innovation
in the 5G Era", Computer Networks 176,p.107284, 2020.
[RFC8655] Finn, N., Thubert, P., Varga, B., and J. Farkas,
"Deterministic Networking Architecture", RFC 8655,
DOI 10.17487/RFC8655, October 2019,
.
[SOL23] Stueber, T., Osswald, L., Lindner, S., and M. Menth, "LA
Survey of Scheduling Algorithms for the Time-Aware Shaper
in Time-Sensitive Networking (TSN)",
DOI 10.1109/ACCESS.2023.3286370, 2023,
.
[SWD18] Sachs, J., Wikstroem, G., Dudda, T., Baldemair, R., and K.
Kittichokechai, "5G radio network design for ultra-
reliable low-latency communication", IEEE Network vol. 32,
pp. 24-31, 2018.
Authors' Addresses
Varga, et al. Expires 9 March 2025 [Page 23]
Internet-Draft Mobile Latency September 2024
Balazs Varga
Ericsson
Magyar Tudosok krt. 11
1117 Budapest
Hungary
Email: balazs.a.varga@ericsson.com
Joachim Sachs
Ericsson
Germany
Email: joachim.sachs@ericsson.com
Frank Duerr
University of Stuttgart
Universitaetsstr. 38
70569 Stuttgart
Germany
Email: frank.duerr@ipvs.uni-stuttgart.de
Samie Mostafavi
KTH Royal Institute of Technology
Stockholm
Sweden
Email: ssmos@kth.se
Varga, et al. Expires 9 March 2025 [Page 24]