Efficient Reconciliation and Flow Control for Anti-Entropy Protocols

Similar documents
The CIP Motion Peer Connection for Real-Time Machine to Machine Control

17 October About H.265/HEVC. Things you should know about the new encoding.

Department of Computer Science, Cornell University. fkatej, hopkik, Contact Info: Abstract:

Digital Video Engineering Professional Certification Competencies

Timing Error Detection: An Adaptive Scheme To Combat Variability EE241 Final Report Nathan Narevsky and Richard Ott {nnarevsky,

Building Trust in Online Rating Systems through Signal Modeling

Milestone Solution Partner IT Infrastructure Components Certification Report

National Park Service Photo. Utah 400 Series 1. Digital Routing Switcher.

Gamma instabus. Technical product information

Design for Testability

Cost-Aware Live Migration of Services in the Cloud

Milestone Leverages Intel Processors with Intel Quick Sync Video to Create Breakthrough Capabilities for Video Surveillance and Monitoring

Asynchronous inputs. 9 - Metastability and Clock Recovery. A simple synchronizer. Only one synchronizer per input

Transparent low-overhead checkpoint for GPU-accelerated clusters

R&S VENICE On air. 24/7.

Processes for the Intersection

Broadcast Networks with Arbitrary Channel Bit Rates

A Light Weight Method for Maintaining Clock Synchronization for Networked Systems

Vicon Valerus Performance Guide

GLI-12 V1.1 GLI 12 V2.0

The Promise, and Limitations, of Gossip Protocols

V9A01 Solution Specification V0.1

Multicore Design Considerations

Challenges of Launching DOCSIS 3.0 services. (Choice s experience) Installation and configuration

Point-to-Point Links

Koester Performance Research Koester Performance Research Heidi Koester, Ph.D. Rich Simpson, Ph.D., ATP

Using the VideoEdge IP Encoder with Intellex IP

Frame Processing Time Deviations in Video Processors

Efficient GPU Synchronization without Scopes: Saying No to Complex Consistency Models

Hands-On Real Time HD and 3D IPTV Encoding and Distribution over RF and Optical Fiber

Transparent Computer Shared Cooperative Workspace (T-CSCW) Architectural Specification

Datasheet. Full-Duplex, Point-to-Point Gigabit Radio. Models: AF-24, AF-24HD, AF-5, AF-5U. High Performance Wireless Backhaul

Contents on Demand Architecture and Technologies of Lui

Film Grain Technology

Retiming Sequential Circuits for Low Power

tech paper 2015 Effective feedback control

Benchtop Portability with ATE Performance

QCN Transience and Equilibrium: Response and Stability. Abdul Kabbani, Rong Pan, Balaji Prabhakar and Mick Seaman

Scan. This is a sample of the first 15 pages of the Scan chapter.

CHP Max Headend Optics Platform CHP CORWave II

THE ASTRO LINE SERIES GEMINI 5200 INSTRUCTION MANUAL

Put your sound where it belongs: Numerical optimization of sound systems. Stefan Feistel, Bruce C. Olson, Ana M. Jaramillo AFMG Technologies GmbH

Simulation Study of the Spectral Capacity Requirements of Switched Digital Broadcast

Datasheet. High-Performance airmax Bridge. Models: NBE M5-19, NBE-M5-16. Uniform Beamwidth Maximizes Noise Immunity. Innovative Mechanical Design

SCTE OPERATIONAL PRACTICE

Dual Frame Video Encoding with Feedback

Combinational vs Sequential

Compressed-Sensing-Enabled Video Streaming for Wireless Multimedia Sensor Networks Abstract:

MULTIPLE TPS REHOST FROM GENRAD 2235 TO S9100

Title: Members: Sponsors: Project Narrative: Small Projector Array Display System. Nicholas Futch, Ryan Gallo, Chris Rowe, Gilbert Duverglas

System Quality Indicators

Bridging the Gap Between CBR and VBR for H264 Standard

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015

PixelNet. Jupiter. The Distributed Display Wall System. by InFocus. infocus.com

Personal Mobile DTV Cellular Phone Terminal Developed for Digital Terrestrial Broadcasting With Internet Services

NTSC/PAL. Network Interface Board for MPEG IMX TM. VTRs BKMW-E2000 TM

Processor time 9 Used memory 9. Lost video frames 11 Storage buffer 11 Received rate 11

With a history dating back to 1770, the University of Technology Berlin (Technische Universität Berlin, also

SWITCHED INFINITY: SUPPORTING AN INFINITE HD LINEUP WITH SDV

AltiumLive 2017: Effective Methods for Advanced Routing

GROUNDBREAKING INNOVATIONS FOR DYNAMIC LIGHTING

medlab One Channel ECG OEM Module EG 01000

100Gb/s Single-lane SERDES Discussion. Phil Sun, Credo Semiconductor IEEE New Ethernet Applications Ad Hoc May 24, 2017

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2011

Boundless Security Systems, Inc.

Datasheet. Dual-Band airmax ac Radio with Dedicated Wi-Fi Management. Model: B-DB-AC. airmax ac Technology for 300+ Mbps Throughput at 5 GHz

HIGH SPEED ASYNCHRONOUS DATA MULTIPLEXER/ DEMULTIPLEXER FOR HIGH DENSITY DIGITAL RECORDERS

John Stankey President and CEO AT&T Operations

Image Acquisition Technology

Appendix Y: Queuing Models and Applications

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

spiff manual version 1.0 oeksound spiff adaptive transient processor User Manual

North America, Inc. AFFICHER. a true cloud digital signage system. Copyright PDC Co.,Ltd. All Rights Reserved.

A, B B, C. Internetwork Router. A, C Gossip Server

Development of Multiple Beam Guns for High Power RF Sources for Accelerators and Colliders

VIBRIO. User Manual. by Toast Mobile

Oscilloscopes for debugging automotive Ethernet networks

DVR or NVR? Video Recording For Multi-Site Systems Explained DVR OR NVR? 1

Jupiter PixelNet. The distributed display wall system. infocus.com

Thought Technology Ltd Belgrave Avenue, Montreal, QC H4A 2L8 Canada

2 MHz Lock-In Amplifier

T-Mobile AWS Filter Implementation Progress Report

Synchronization Check Relay ARGUS 7

Double Patterning OPC and Design for 22nm to 16nm Device Nodes

DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS

HELICAL SCAN TECHNOLOGY: ADVANCEMENT BY DESIGN

Introduction to HSR&PRP. HSR&PRP Basics

Digital Video Telemetry System

Jin-Fu Li Advanced Reliable Systems (ARES) Laboratory. National Central University

Solutions to Embedded System Design Challenges Part II

2 2 Relay outputs. M DIN W72 H7mm. LE7 Weekly/Yearly timer

A Video Frame Dropping Mechanism based on Audio Perception

LED control gear Compact dimming. Uconverter LCAI 2x38 W 0500 K013 one4all ECO series. Ordering data

AUTOMATING YOUR SUCCESS

Paul Dauncey For the CALICE-UK electronics group. A. Baird, D. Bowerman, P. Dauncey, R. Halsall, M. Postranecky, M.Warren, O.

Predicting the immediate future with Recurrent Neural Networks: Pre-training and Applications

Datasheet. Full-Duplex, Point-to-Point Gigabit Radio. Tel: +44 (0) Fax: +44 (0)

FORWARD PATH TRANSMITTERS

HYL-080D1750G358. Constant current LED driver DALI Dimmable. LED Driver. Product description. Benefits. Interfaces.

MISO - EPG DATA QUALITY INVESTIGATION

Transcription:

Efficient Reconciliation and Flow Control for Anti-Entropy Protocols Robbert van Renesse Dan Dumitriu Valient Gough Chris Thomas Work done at Amazon.com (2006)

Gossip at Amazon Ubiquitous Monitoring and Configuration (Astrolabe) Eventual Consistency (Dynamo) Failure Detection (S3)

Gossip Protocols Basic idea: each node executes periodically p := selectrandompeer(); peerstate := p.getstate(); mystate := me.getstate(); newstate := merge(mystate, peerstate); p.putstate(newstate); me.putstate(newstate);

Gossip cont d Pioneered by Al Demers et al. 1987 (includes Doug Terry in audience) Salient properties: Propagates in time proportional to log(#peers) Tolerates host failures and message loss Behavior easily modeled

Two types of gossip Rumor Mongering Gossip for some time Every message is important Useful for reliable broadcast Anti-Entropy Gossip until obsolete Only last update is important Useful for eventual consistency

Problems with Anti-Entropy Synchronous communication channel Capacity limited by available network capacity and CPU for handling updates When overloaded, updates may back up Tuning involves Setting gossip rate Setting maximum message size Tuning affects the capacity of the channel

State of a Gossiper (may only write own row) Request Rate Number of Items Number of Clients Venus 0.5 / 21 2300 / 12 3 / 25 Mars 1.3 / 11 1432 / 24 4 / 12 Jupiter 0.2 / 12 13298 / 3 10 / 13 Value: 0.2 Version : 12 Only last versions are relevant

State Merge Request Rate Number of Items Number of Clients Venus 0.5 / 21 2300 / 12 3 / 25 Mars 1.3 / 11 1432 / 24 5 / 14 Request Rate Number of Items Number of Clients Venus 0.5 / 21 2400 / 13 3 / 25 Mars 1.3 / 11 1432 / 24 5 / 14 Jupiter 0.2 / 12 13298 / 3 10 / 13 Request Rate Number of Items Number of Clients Venus Jupiter 0.5 / 21 0.2 / 12 2400 / 13 13298 / 3 3 / 25 10 / 13 Merge protocol exchanges deltas

Bandwidth Limited Limited available b/w per gossip exchange Can t send all deltas every time (or even ever) Limited bandwidth, limited CPU Need to prioritize Two parts to this talk 1. Initially assume b/w is fixed and consider merge 2. Then assume b/w depends on background load, and consider flow control

Baseline: Precise Reconciliation Focus of much research in the area Byers, Considine, Mitzenmacher 2002 Minsky, Trachtenberg, Zippel 2003 If bandwidth is limited, can only send subset. Two obvious choices: 1. Send most out-of-date updates first Seems fair 2. Send most recent updates first Kills obsolete updates faster, but may lead to starvation Both have high CPU overhead Hash functions, Bloom filters, Merkle trees,

Scuttlebutt Reconciliation Simple: one version number per participant

Assigning Version Numbers Key Value Version Key Value Version Reqs/s 0.5 5 Reqs/s 0.6 11 #items 123 8 #items 123 8 #clients 4 9 #clients 4 9 Key Value Version Key Value Version Reqs/s 0.7 10 Reqs/s 0.6 11 #items 123 8 #items 126 12 #clients 4 9 #clients 4 9 Note: never two attributes with the same version number

Gossiping: Two Phases Venus: Jupiter: Max(Version) Max(Version) I Venus Mars 6 12 Venus Mars 4 14 Jupiter 17 Jupiter 18 II Venus attributes with versions 5 and 6 Mars attributes with versions 13 and 14 Jupiter s attribute with version 18

Scuttlebutt convergence May not eliminate all diffs in single exchange But it *does* converge to consistent state, even when only a subset of updates are exchanged

Simulation Experiments 128 gossipers 64 attributes / gossipers (total: 8192 attrs) Updates: uniform (similar results with Zipf) Gossip once a second MTU: 100 diffs

Maximum Staleness Updates / sec: 128 256 128 0

# stale attributes Updates / sec: 128 256 128 0

Flow Control Merge alone cannot solve overload problem Flow Control: determine the maximum rate at which a peer can submit updates Requirements: Optimal Fair Adaptive

Fairness Accomplished through gossip itself Each participant maintains a maximum update rate at which it will submit updates When participants gossip, they split the difference between max. rates

Local Adaptation AIMD approach, a la TCP If gossip message overflows, then reduce maximum rate by a percentage If gossip message underflows, then increase rate additively

Maximum Update Rate MTU: 100 MTU: 50

Maximum Staleness MTU: 100 MTU: 50

Conclusion In overload situation, gossip does not provide predictable performance We contributed A low overhead reconciliation mechanism Flow Control for anti-entropy protocols