Pushing the Communication Barrier in 2PC using Lookup Tables

Similar documents
University of California at Berkeley College of Engineering Department of Electrical Engineering and Computer Science. EECS150, Spring 2011

FPGA Design. Part I - Hardware Components. Thomas Lenzi

Registers and Counters

Registers and Counters

MUHAMMAD NAEEM LATIF MCS 3 RD SEMESTER KHANEWAL

Chapter 7 Memory and Programmable Logic

DIGITAL SYSTEM DESIGN UNIT I (2 MARKS)

Asynchronous IC Interconnect Network Design and Implementation Using a Standard ASIC Flow

Chapter 4. Logic Design

problem maximum score 1 28pts 2 10pts 3 10pts 4 15pts 5 14pts 6 12pts 7 11pts total 100pts

Chapter 3. Boolean Algebra and Digital Logic

MODU LE DAY. Class-A, B, AB and C amplifiers - basic concepts, power, efficiency Basic concepts of Feedback and Oscillation. Day 1

DEDICATED TO EMBEDDED SOLUTIONS

Midterm Exam 15 points total. March 28, 2011

NH 67, Karur Trichy Highways, Puliyur C.F, Karur District UNIT-III SEQUENTIAL CIRCUITS

Digital Electronics II 2016 Imperial College London Page 1 of 8

FE REVIEW LOGIC. The AND gate. The OR gate A B AB A B A B 0 1 1

Chapter Contents. Appendix A: Digital Logic. Some Definitions

Sequential Logic. E&CE 223 Digital Circuits and Systems (A. Kennings) Page 1

Lab #12: 4-Bit Arithmetic Logic Unit (ALU)

1. Convert the decimal number to binary, octal, and hexadecimal.

A Fast Constant Coefficient Multiplier for the XC6200

Laboratory Exercise 3

Sequential Design Basics

PLTW Engineering Digital Electronics Course Outline

Logic. Andrew Mark Allen March 4, 2012

High Performance Carry Chains for FPGAs

Logic and Computer Design Fundamentals. Chapter 7. Registers and Counters

Digital Electronics Course Outline

Combinational Logic Design

Objectives. Combinational logics Sequential logics Finite state machine Arithmetic circuits Datapath

Designing an Efficient and Secured LUT Approach for Area Based Occupations

The Design of Efficient Viterbi Decoder and Realization by FPGA

Microprocessor Design

6.3 Sequential Circuits (plus a few Combinational)

Lecture 2: Digi Logic & Bus

CS 110 Computer Architecture. Finite State Machines, Functional Units. Instructor: Sören Schwertfeger.

ENGG2410: Digital Design Lab 5: Modular Designs and Hierarchy Using VHDL

CHAPTER 6 ASYNCHRONOUS QUASI DELAY INSENSITIVE TEMPLATES (QDI) BASED VITERBI DECODER

Logisim: A graphical system for logic circuit design and simulation

L11/12: Reconfigurable Logic Architectures

Reconfigurable FPGA Implementation of FIR Filter using Modified DA Method

Introduction to Digital Logic Missouri S&T University CPE 2210 Exam 3 Logistics

DIGITAL CIRCUIT LOGIC UNIT 9: MULTIPLEXERS, DECODERS, AND PROGRAMMABLE LOGIC DEVICES

From Theory to Practice: Private Circuit and Its Ambush

Laboratory Exercise 6

Simple Link Protocol (SLP)

ALONG with the progressive device scaling, semiconductor

A Delay-based PUF Design Using Multiplexer Chains

R13 SET - 1 '' ''' '' ' '''' Code No: RT21053

CS 61C: Great Ideas in Computer Architecture

Final Examination (Open Katz, Calculators OK, 3 hours)

Design of Memory Based Implementation Using LUT Multiplier

Department of Computer Science and Engineering Question Bank- Even Semester:

Implementation of Low Power and Area Efficient Carry Select Adder

Using minterms, m-notation / decimal notation Sum = Cout = Using maxterms, M-notation Sum = Cout =

Clock Domain Crossing. Presented by Abramov B. 1

L12: Reconfigurable Logic Architectures

VLSI Based Minimized Composite S-Box and Inverse Mix Column for AES Encryption and Decryption

Chapter 3 Unit Combinational

R13. II B. Tech I Semester Regular Examinations, Jan DIGITAL LOGIC DESIGN (Com. to CSE, IT) PART-A

Department of CSIT. Class: B.SC Semester: II Year: 2013 Paper Title: Introduction to logics of Computer Max Marks: 30

HDL & High Level Synthesize (EEET 2035) Laboratory II Sequential Circuits with VHDL: DFF, Counter, TFF and Timer

ISSN:

FPGA Based Implementation of Convolutional Encoder- Viterbi Decoder Using Multiple Booting Technique

FPGA Design with VHDL

To design a sequential logic circuit using D-Flip-flop. To implement the designed circuit.

Experiment 8 Introduction to Latches and Flip-Flops and registers

Memory elements. Topics. Memory element terminology. Variations in memory elements. Clock terminology. Memory element parameters. clock.

PURBANCHAL UNIVERSITY

UNIVERSITI TEKNOLOGI MALAYSIA

Contents Circuits... 1

Principles of Computer Architecture. Appendix A: Digital Logic

Design And Implimentation Of Modified Sqrt Carry Select Adder On FPGA

Design and Implementation of Data Scrambler & Descrambler System Using VHDL

6.S084 Tutorial Problems L05 Sequential Circuits

Module 4:FLIP-FLOP. Quote of the day. Never think you are nothing, never think you are everything, but think you are something and achieve anything.

1ms Column Parallel Vision System and It's Application of High Speed Target Tracking

CHAPTER 4: Logic Circuits

MODULAR DIGITAL ELECTRONICS TRAINING SYSTEM

Sequential Elements con t Synchronous Digital Systems

Laboratory 1 - Introduction to Digital Electronics and Lab Equipment (Logic Analyzers, Digital Oscilloscope, and FPGA-based Labkit)

CHAPTER 4: Logic Circuits

Logic Design. Flip Flops, Registers and Counters

Synchronization in Asynchronously Communicating Digital Systems

TIME SCHEDULE. MODULE TOPICS PERIODS 1 Number system & Boolean algebra 17 Test I 1 2 Logic families &Combinational logic

CS6201 UNIT I PART-A. Develop or build the following Boolean function with NAND gate F(x,y,z)=(1,2,3,5,7).

An Efficient 64-Bit Carry Select Adder With Less Delay And Reduced Area Application

NH 67, Karur Trichy Highways, Puliyur C.F, Karur District DEPARTMENT OF INFORMATION TECHNOLOGY CS 2202 DIGITAL PRINCIPLES AND SYSTEM DESIGN

Lecture 1: Introduction to Digital Logic Design. CK Cheng CSE Dept. UC San Diego

AC103/AT103 ANALOG & DIGITAL ELECTRONICS JUN 2015

EE292: Fundamentals of ECE

UNIT-3: SEQUENTIAL LOGIC CIRCUITS

C8491 C8000 1/17. digital audio modular processing system. 3G/HD/SD-SDI DSP 4/8/16 audio channels. features. block diagram

Course Plan. Course Articulation Matrix: Mapping of Course Outcomes (COs) with Program Outcomes (POs) PSO-1 PSO-2

Design and FPGA Implementation of 100Gbit/s Scrambler Architectures for OTN Protocol Chethan Kumar M 1, Praveen Kumar Y G 2, Dr. M. Z. Kurian 3.

More Digital Circuits

EECS150 - Digital Design Lecture 12 - Video Interfacing. Recap and Outline

EECS150 - Digital Design Lecture 3 Synchronous Digital Systems Review. Announcements

Reconfigurable Architectures. Greg Stitt ECE Department University of Florida

Transcription:

Pushing the Communication Barrier in 2PC using Lookup Tables Ghada Dessouky*, Farinaz Koushanfar, Ahmad-Reza Sadeghi*, Thomas Schneider*, Shaza Zeitouni*, and Michael Zohner* *Technische Universität Darmstadt University of California, San Diego

Secure 2PC Page 2

Secure 2PC Page 2

Secure 2PC This work: semi-honest (passive) security Page 2

Applications of Secure 2PC Sugar Beet Auction [BCD+9] Page 3

Applications of Secure 2PC Sugar Beet Auction [BCD+9] Face Recognition [EFG+9] Page 3

Applications of Secure 2PC Sugar Beet Auction [BCD+9] Face Recognition [EFG+9] Blind En/Decryption [Dyadic] AES Page 3

Applications of Secure 2PC Sugar Beet Auction [BCD+9] Face Recognition [EFG+9] Blind En/Decryption [Dyadic] Stable Matching [DES6] AES Page 3

Generic Secure 2PC Two prominent techniques: Yao's protocol and GMW Both evaluate Boolean circuits securely XOR gates are free AND gates cost sym. crypto / comm. Difference: round complexity Yao is constant round GMW requires interaction per AND gate Page 4

Practical Improvements Million AND gates/sec Speed of 2PC Implementations 4 3 2 24 2 2 23 25 26 Currently: 3 million ANDs/s per thread, however: We have hit a comm. lower-bound per AND for Yao [ZRE5] Run-time for GMW often is mostly network latency Page 5

Lookup Tables Page 6

Our Contributions Develop lookup table (LUT)-based protocols Tool support for generating LUT circuits Evaluation and comparison (Paper: improve building blocks & comm. for GMW) Page 7

Lookup Table Protocols Page 8

oon Oblivious Transfer Bob obliviously obtains one of N messages s.t. Alice does not learn Bob's choice Bob does not learn Alice's other messages Most efficient protocol oon OT: [KK3] Page 9

Intuition of the Protocols Use [KK3] oon OT to perform table lookups Page

LUT Protocols We develop two LUT protocols based on [KK3] OT Online Phase LUT (OP-LUT) Setup Phase LUT (SP-LUT) Pre-Computation Communication [Bytes] Communication [Bytes] OP-LUT SP-LUT 2 3 4 5 6 7 8 Online OP-LUT SP-LUT 2 3 4 5 6 7 8 Number of Inputs Number of Inputs Page

Generating LUT Circuits Page 2

Tool Support for LUTs Generating LUT circuits is difficult and error-prone Automation is required Idea: FPGAs internally operate on single output LUTs Use ABC logic syntesis to generate single output LUTs Add post-processing to improve efficiency Page 3

Combining LUTs FPGAs only support single output LUTs We combine LUTs with similar inputs to improve efficiency SP-LUT Communication: 52 bits SP-LUT Communication: 38 bits Page 4

Extracting XORs Since XORs are free, we can extract them Example Page 5

Comparison Page 6

Communication Mostly: SP-LUT < GMW < OP-LUT < Yao Boolean circuits perform better for sequential structures LUT circuits perform best for tree based structures Page 7

Communication Mostly: SP-LUT < GMW < OP-LUT < Yao Boolean circuits perform better for sequential structures LUT circuits perform best for tree based structures Page 7

Communication Mostly: SP-LUT < GMW < OP-LUT < Yao Boolean circuits perform better for sequential structures LUT circuits perform best for tree based structures Page 7

Interaction Rounds Yao is constant round Mostly: SP-LUT < OP-LUT < GMW Exception: Multiplication with Ripple-carry addition Page 8

Interaction Rounds Yao is constant round Mostly: SP-LUT < OP-LUT < GMW Exception: Multiplication with Ripple-carry addition Page 8

Empirical Evaluation AES encryption of blocks using 4 threads LAN ( GBit network,.2 ms latency) WAN (28 MBit network, 22ms latency).8.6.4.2.8.6.4.2 AES Evaluations in WAN 6 5 Yao GMW OP-LUT SP-LUT Run-Time [s] Run-Time [s] AES Evaluations in LAN Yao GMW OP-LUT SP-LUT 4 3 2 Setup Online Total Setup Online Total Page 9

Conclusion Communication is bottleneck in 2PC Developed LUT protocols based on oon OT Tool chain for compiling LUT circuits Showed that LUT protocols can improve communication Page 2

Thank you for your attention Page 2

From oo2 OT to oon OT [IKNP3] [KK3] oo2 OT oon OT

Our Results

oon OT Extension [KK3]....... Hamming distance............ Codewords with HD..

From oo2 OT to oon OT oon OT can be obtained from logn oo2 OTs Example oo4:

From oon OT to oo2 OT ( Surprising insight: reducing oon OT to single bit oo2 OT saves communication,,, ) Best for N=6: Requires 32 bits instead of 52 bits