QCN Transience and Equilibrium: Response and Stability. Abdul Kabbani, Rong Pan, Balaji Prabhakar and Mick Seaman

Similar documents
Dual frame motion compensation for a rate switching network

ECM and E 2 CM performance under bursty traffic. Cyriel Minkenberg & Mitch Gusat IBM Research GmbH, Zurich April 26, 2007

Using the MAX3656 Laser Driver to Transmit Serial Digital Video with Pathological Patterns

Introduction. Packet Loss Recovery for Streaming Video. Introduction (2) Outline. Problem Description. Model (Outline)

Dual Frame Video Encoding with Feedback

SWITCHED INFINITY: SUPPORTING AN INFINITE HD LINEUP WITH SDV

MPEG-4 Video Transfer with TCP-Friendly Rate Control

Joint source-channel video coding for H.264 using FEC

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS

Processes for the Intersection

Dual frame motion compensation for a rate switching network

CHAPTER 2 SUBCHANNEL POWER CONTROL THROUGH WEIGHTING COEFFICIENT METHOD

Transport Stream. 1 packet delay No delay. PCR-unaware scheme. AAL5 SDUs PCR PCR. PCR-aware scheme PCR PCR. Time

SAVE: An Algorithm for Smoothed Adaptive Video over Explicit Rate Networks

Therefore, HDCVI is an optimal solution for megapixel high definition application, featuring non-latent long-distance transmission at lower cost.

Feasibility Study of Stochastic Streaming with 4K UHD Video Traces

Integrated end-end buffer management and congestion control for scalable video communications

Multimedia Networking

Meeting Embedded Design Challenges with Mixed Signal Oscilloscopes

Precision testing methods of Event Timer A032-ET

Example: compressing black and white images 2 Say we are trying to compress an image of black and white pixels: CSC310 Information Theory.

A look at the MPEG video coding standard for variable bit rate video transmission 1

A GoP Based FEC Technique for Packet Based Video Streaming

Implementation of an MPEG Codec on the Tilera TM 64 Processor

ECSE-323 Digital System Design. Datapath/Controller Lecture #1

Controlling adaptive resampling

PACKET-SWITCHED networks have become ubiquitous

data and is used in digital networks and storage devices. CRC s are easy to implement in binary

Electro-Optic Beam Deflectors

ZONE PLATE SIGNALS 525 Lines Standard M/NTSC

Long and Fast Up/Down Counters Pushpinder Kaur CHOUHAN 6 th Jan, 2003

Motion Video Compression

Optimal Coding Rate Control of Scalable and Multi Bit Rate Streaming Media

Adjusting Forward Error Correction with Temporal Scaling for TCP-Friendly Streaming MPEG

Joint use of LTP and Erasure FEC for space environments (ECLSA 2.0)

A Low-Power 0.7-V H p Video Decoder

Case Study Monitoring for Reliability

Comment #147, #169: Problems of high DFE coefficients

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

An Effective Filtering Algorithm to Mitigate Transient Decaying DC Offset

Using Software Feedback Mechanism for Distributed MPEG Video Player Systems

Technology Scaling Issues of an I DDQ Built-In Current Sensor

Design of Fault Coverage Test Pattern Generator Using LFSR

White Paper. Video-over-IP: Network Performance Analysis

Evaluation of SGI Vizserver

Updates for the Back-to-back Frame Benchmark

MC9211 Computer Organization

Analysis of Video Transmission over Lossy Channels

Digital Representation

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016

Second-Order Systems

Learning Musicianship for Automatic Accompaniment

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

An Introduction to the Spectral Dynamics Rotating Machinery Analysis (RMA) package For PUMA and COUGAR

MULTIMEDIA COMPRESSION AND COMMUNICATION

Jin-Fu Li Advanced Reliable Systems (ARES) Laboratory. National Central University

Profiling techniques for parallel applications

Bit Rate Control for Video Transmission Over Wireless Networks

Readout techniques for drift and low frequency noise rejection in infrared arrays

Modeling and Evaluating Feedback-Based Error Control for Video Transfer

A HIGH THROUGHPUT CABAC ALGORITHM USING SYNTAX ELEMENT PARTITIONING. Vivienne Sze Anantha P. Chandrakasan 2009 ICIP Cairo, Egypt

Peak Dynamic Power Estimation of FPGA-mapped Digital Designs

An Improved Fuzzy Controlled Asynchronous Transfer Mode (ATM) Network

PTP: Backbone of the SMPTE ST2110 Deployment

IEEE802.11a Based Wireless AV Module(WAVM) with Digital AV Interface. Outline

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Multimedia Communications. Video compression

Performance Driven Reliable Link Design for Network on Chips

medlab One Channel ECG OEM Module EG 01000

Multimedia Communications. Image and Video compression

High Performance Raster Scan Displays

II. SYSTEM MODEL In a single cell, an access point and multiple wireless terminals are located. We only consider the downlink

RECOMMENDATION ITU-R BT Studio encoding parameters of digital television for standard 4:3 and wide-screen 16:9 aspect ratios

Time Domain Simulations

Politecnico di Torino HIGH SPEED AND HIGH PRECISION ANALOG TO DIGITAL CONVERTER. Professor : Del Corso Mahshid Hooshmand ID Student Number:


sr c0 c3 sr c) Throttled outputs Figure F.1 Bridge design models

Chapter 10 Basic Video Compression Techniques

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

Real-time spectrum analyzer. Gianfranco Miele, Ph.D

A Statistical Framework to Enlarge the Potential of Digital TV Broadcasting

Pattern Smoothing for Compressed Video Transmission

FX Basics. Time Effects STOMPBOX DESIGN WORKSHOP. Esteban Maestre. CCRMA Stanford University July 2011

THE CAPABILITY of real-time transmission of video over

MTL Software. Overview

Fault Detection And Correction Using MLD For Memory Applications

Frame Relay Congestion Control

Synthesis Technology E102 Quad Temporal Shifter User Guide Version 1.0. Dec

25.5 A Zero-Crossing Based 8b, 200MS/s Pipelined ADC

IN A SERIAL-LINK data transmission system, a data clock

IT T35 Digital system desigm y - ii /s - iii

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR NPTEL ONLINE CERTIFICATION COURSE. On Industrial Automation and Control

FRAMES PER MULTIFRAME SLOTS PER TDD - FRAME

Digital Correction for Multibit D/A Converters

H-Ternary Line Decoder for Digital Data Transmission: Circuit Design and Modelling

A variable bandwidth broadcasting protocol for video-on-demand

A Light Weight Method for Maintaining Clock Synchronization for Networked Systems

WINTER 15 EXAMINATION Model Answer

Transcription:

QCN Transience and Equilibrium: Response and Stability Abdul Kabbani, Rong Pan, Balaji Prabhakar and Mick Seaman

Outline of presentation 2-QCN Overview and method for improving transient response Equilibrium and Scalability Large number of sources and/or large RTTs Conclusions 2

Basic QCN 2-point architecture: Reaction Point -- Congestion Point 1. Congestion Points: Sample packets, compute feedback (Fb), quantize Fb to 6 bits, and reflect only negative Fb values to Reaction Point with a probability proportional to Fb. Fb = -(q off + w q delta ) = -(queue offset + w.rate offset) Reflection Probability Pmin P max Fb 2. Reaction Points: Transmit regular Ethernet frames. When congestion message arrives: perform multiplicative decrease, fast recovery and active increase. Fast recovery similar to BIC-TCP: gives high performance in high bandwidthdelay product networks, while being very simple. 3

Other Features Extra fast recovery Useful for coping with bursty drops, matches offered rate to available capacity Drift: Timer-based, gentle rate increase For failsafe operation, improves fairness by preferentially drifting low rate sources Fb-hat: Congestion estimation at the source Allows a source to recover available bandwidth quicker Will go over this today 4

Grabbing Extra Bandwidth: An Algorithm Estimate congestion at the source Maintain an estimate of Fb, say Fb-hat, at each RL Fb-hat is counted using a 5-bit saturation counter Fb-hat is thought of as a source s estimate of congestion Updating Fb-hat For every Fb recd by RL: Fb-hat <-- Fb-hat + Fb For every 50 pkts transmitted: Fb-hat <-- Fb-hat/2 (just right shift) Using Fb-hat: cycle-shrinking Every time we begin a cycle of FR or AI If Fb-hat is small (e.g. 0 or 1): reduce length of cycle to 50 pkts from 100 pkts Idea: small Fb-hat implies no dings for a while, hence it is likely there is no congestion; so a source can quickly get to AI and grab extra bdwdth Note: in equilibrium, Fb-hat will be more than 1, hence no cycle-shrinking should occur, hence stability is preserved 5

A Pictorial View Consider output-generated hotspot on a 10 G link 10 G 10 G S 1 S 2 0.5G S N Small, well-spaced Fb values Fb-hat is small Large, closely-spaced Fb values Fb-hat is large Fb=0 values (not signaled) Fb-hat goes to zero 6

A Principle Introducing Fb-hat symmetrizes the source and switch behavior Switch Has input: Packets or source rates Observes: Qoff, Qdelta Goal: Drive Q to Qeq and Qdelta to zero Action taken to achieve goal: Send Fb signals to sources RL Has input: Fb signals from network Observes: Fb-hat Goal: Drive Fb-hat close to zero (i.e. just above 1, the threshold) Action taken: Change transmission rate This is a primal-dual algorithm for congestion management Primal variable, source rate: Input to switch but output from RL Dual variable, Fb: Input to RL but output from switch 7

Distributed Control Primal algorithm: The control laws situated at the source, switch has static functions. Dual algorithm: Control laws situated at switch, source has static functions. QCN without Fb-hat already primal-dual, Fb-hat makes it more so. A principle of distributed control: The switch and source (or RL) pass just the right signals to each other so as to solve the global bandwidth partitioning problem in a distributed fashion Clearly, other algorithms can be obtained from this principle; e.g. we have tried 1. Cycle lengths of 25, 50 and 100 pkts depending on Fb-hat values 2. Stretching cycle lengths to 150 pkts if Fb-hat is large 3. Letting Fb-hat go negative; this lets source know with more certainty that bdwdth is available These improve the transient response further But they introduce a complexity--performance trade-off 8

Improvement in Transience 9

Simulations: OG Hotspot Parameters 10 sources share a 10 G link, whose capacity drops to 0.5G during 2-4 secs Max offered rate per source: 1.05G RTT = 40 microseconds Buffer size = 100 pkts; Qeq = 22 Fb-hat saturated at 31 FR cycle-shrinking: 50 pkts if Fb-hat is 0 or 1, 100 pkts otherwise AI: also 50 or 100 pkts depending on Fb-hat as above AI amount: 25 Mbps 10 G 10 G Source 1 Source 2 0.5G Source 10 10

0.5G Bottleneck: Rate Old 2-QCN: 202 msec New 2-QCN: 136 msec 3-QCN: 96 msec Note: We have seen recovery times as low as 113 msecs, the number above is on the high side; the average was around 125 msecs 11

0.5G Bottleneck: Rate, Zoomed-in 12

0.5G Bottleneck: Queue, Zoomed-in 13

0.5G, 1G, 2G Bottleneck: Queue, Zoomed-in 14

0.5G, 1G, 2G Bottleneck: Rate, Zoom 15

Conclusions for Transience Seen a method for the source to monitor congestion and quickly grab available bandwidth The algorithm we have used is the simplest possible Enhanced versions (different cycle lengths, Fb-hat negative, hyperactive increase) certainly improve the recovery time; in fact, one can match or beat 3-QCN. However, it seems better to choose a simple version because it is adequate, esp in real deployments where flows will arrive and depart. Need to draw a line in trade-off space. 16

Scalability, Stability 17

Scalability Analysis This refers to understanding the behavior of QCN when N, the number of sources, and the RTT get very large; main question How does stability depends on parameters? Ultimately this is addressed through a theoretical model. We use a Markov chain based model, different from the standard linearized analysis using differential equations. The Markov model contains more information, because it is the source of the differential equation model. The differential equations describe the mean behavior of the system; i.e. average rates, not instantaneous; whereas the Markov model gives the complete stochastic description. Linearization around the operating point gives local stability. Finally, the Markov model can be used to model both equilibrium and transience. We are developing the theory; for now, we use simulations to observe stability wrt large numbers of sources and RTT. 18

Simulations Parameters N sources share a 10 G link Starting rate of each source = 100/N G RTT = 40, 100, 200, 300, 400 microseconds Buffer size = 100 pkts; Qeq = 22 Fb-hat saturated at 31 FR cycle-shrinking: 50 pkts if Fb-hat is 0 or 1, 100 pkts otherwise AI: also 50 or 100 pkts depending on Fb-hat as above AI amount: 25 Mbps Source 1 Source 2 Source N 10 G Destination 19

Varying # of Sources: Queue Size RTT = 40 usecs 20

Varying # of Sources: Rate RTT = 40 usecs 21

QCN: Distribution of flows in FR, AI 22

Varying RTT: Queue Size Number of Sources = 10 23

Varying RTT: Rate Number of Sources = 10 24

100 sources, RTT = 400 usecs w=2 vs w=4 Since Fb = q off + wq delta, increasing w weights the derivative more and hence leads to less wiggly queues. But because of the springiness of binary increase, even w = 2 performs well at large RTTs. 25

Varying RTT and # of Sources: Queue Size 26

Varying RTT and # of Sources: Rate 27

Conclusions QCN shows good stability when both the number of sources and RTT are varied. We are currently developing a Markov chain model to analyze the scaling behavior that would capture both the equilibrium and transient modes. 28