The time delay between sending a request and receiving a response. Latency affects user experience and may be specified in SLAs; high latency can undermine real-time use cases and is often tested under realistic load conditions.
Latency
C
T
See: Inference; Service Level Agreement / Service Level Objective; Throughput