Principles of Data Visualization. Jeffrey University of Washington

Similar documents
CSE Data Visualization. Graphical Perception. Jeffrey Heer University of Washington

Visual Encoding Design

Graphical Perception. Graphical Perception. Which best encodes quantities?

Graphical Perception. Graphical Perception. Graphical Perception. Which best encodes quantities? Jeffrey Heer Stanford University

Data Visualization (CIS 468)

CSE Data Visualization. Color. Jeffrey Heer University of Washington

The theory of data visualisation

Escaping RGBland: Selecting Colors for Statistical Graphics

Color in Information Visualization

DICOM Correction Item

Statistics for Engineers

Tradeoffs in information graphics 1. Andrew Gelman 2 and Antony Unwin Oct 2012

MODE FIELD DIAMETER AND EFFECTIVE AREA MEASUREMENT OF DISPERSION COMPENSATION OPTICAL DEVICES

Visualizing Social Networks

Why Publish in Journals? How to write a technical paper. How about Theses and Reports? Where Should I Publish? General Considerations: Tone and Style

Recap of Last (Last) Week

DATA VISUALIZATION BE A VISUAL NINJA APRIL 7, 2015

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2002

What do you really do in a literature review? Studying the Comparative Politics of Public. Education

S. S. Stevens papers,

6 ~ata-ink Maximization and Graphical Design

Students will understand that inferences may be supported using evidence from the text. that explicit textual evidence can be accurately cited.

Relationships Between Quantitative Variables

Case Study: Can Video Quality Testing be Scripted?

Relationships. Between Quantitative Variables. Chapter 5. Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.

arxiv: v1 [cs.dl] 20 May 2016

Statistical Consulting Topics. RCBD with a covariate

Tutorial 0: Uncertainty in Power and Sample Size Estimation. Acknowledgements:

STUDENT: TEACHER: DATE: 2.5

Monitor and Display Adapters UNIT 4

Zombie Makeup Artist plugin Control layout

Speech Recognition and Signal Processing for Broadcast News Transcription

Somewhere over the Rainbow How to Make Effective Use of Colors in Statistical Graphics

Class 1: Motivation, Signals, Systems, Policies

E X P E R I M E N T 1

High Efficiency Video coding Master Class. Matthew Goldman Senior Vice President TV Compression Technology Ericsson

BEAMAGE 3.0 KEY FEATURES BEAM DIAGNOSTICS PRELIMINARY AVAILABLE MODEL MAIN FUNCTIONS. CMOS Beam Profiling Camera

Supplemental Material: Color Compatibility From Large Datasets

The Definition of 'db' and 'dbm'

The software concept. Try yourself and experience how your processes are significantly simplified. You need. weqube.

CASE HISTORY#3 COOLING TOWER GEARBOX BEARING FAULT. Barry T. Cease Cease Industrial Consulting

1. Structure of the paper: 2. Title

JBL f s New Differential Drive Transducers for VerTec Subwoofer Applications:

Barbara Tversky. using space to represent space and meaning

Fairfield Public Schools English Curriculum

Page I-ix / Lab Notebooks, Lab Reports, Graphs, Parts Per Thousand Information on Lab Notebooks, Lab Reports and Graphs

Richard B. Haynes Philip J. Muniz Douglas C. Smith

Measurement User Guide

Visual Arts and Language Arts. Complementary Learning

With prompting and support, ask and answer questions about key details in a text. Grade 1 Ask and answer questions about key details in a text.

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Signal Stability Analyser

Beautiful Evidence: A Journey through the Mind of Edward Tufte Stephen Few August 8, 2006

Sheffield Softworks. Copyright 2015 Sheffield Softworks

Vannevar Bush: As We May Think

Why t? TEACHER NOTES MATH NSPIRED. Math Objectives. Vocabulary. About the Lesson

colors AN INTRODUCTION TO USING COLORS FOR UNITY v1.1

STAT 113: Statistics and Society Ellen Gundlach, Purdue University. (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e)

An Exploration of Great Cinema using Information Visualization Platforms (March 2008)

BPS Interim Assessments SY Grade 2 ELA

MATH 214 (NOTES) Math 214 Al Nosedal. Department of Mathematics Indiana University of Pennsylvania. MATH 214 (NOTES) p. 1/3

Short Set. The following musical variables are indicated in individual staves in the score:

Keep your broadcast clear.

Correlation to Common Core State Standards Books A-F for Grade 5

Visual Explanations: Images And Quantities, Evidence And Narrative By Edward R. Tufte

POL 572 Multivariate Political Analysis

The software concept. Try yourself and experience how your processes are significantly simplified. You need. weqube.

PRODUCT FAMILY DATASHEET LEDVANCE SPOT LED

Disney Broadway Magic. DISNEY PERFORMING ARTS WORKSHOPS WITH CORRESPONDING NATIONAL CORE ARTS STANDARDS (Click below to review the standards)

Bring out the Best in Pixels Video Pipe in Intel Processor Graphics

Understanding Human Color Vision

Characteristics of the Text Genre Informational Text: Biography Text Structure

MANOVA COM 631/731 Spring 2017 M. DANIELS. From Jeffres & Neuendorf (2015) Film and TV Usage National Survey

Machine-learning and R in plastic surgery Classification and attractiveness of facial emotions

WHAT IS GRAPHIC DESIGN? It is difficult to define something that is both a moving target and a ubiquitous part of our culture.

Analysis of Film Revenues: Saturated and Limited Films Megan Gold

NETFLIX MOVIE RATING ANALYSIS

Auditions Workshop: Musical Theatre

Cite. Infer. to determine the meaning of something by applying background knowledge to evidence found in a text.

Title characteristics and citations in economics

Computer and Machine Vision

1 Introduction Steganography and Steganalysis as Empirical Sciences Objective and Approach Outline... 4

Creating Color Combos

Congratulations to the Bureau of Labor Statistics for Creating an Excellent Graph By Jeffrey A. Shaffer 12/16/2011

semi-automated scanning

Findings from Indiana Flashing Yellow Arrow Study. Robert A. Rescot, Ph.D., P.E.

PRODUCT FAMILY DATASHEET LEDVANCE SPOT LED

The APA Style Converter: A Web-based interface for converting articles to APA style for publication

GRADE 2. NOTE: Relevant Georgia Performance Standards in Fine Arts (based on The National Standards for Arts Education) are also listed.

College and Career Readiness Anchor Standards K-12 Montana Common Core Reading Standards (CCRA.R)

Implementation and performance analysis of convolution error correcting codes with code rate=1/2.

COLOR AND COLOR SPACES ABSTRACT

Chapter 3. Averages and Variation

Achieve Accurate Critical Display Performance With Professional and Consumer Level Displays

Processes for the Intersection

Milestone Leverages Intel Processors with Intel Quick Sync Video to Create Breakthrough Capabilities for Video Surveillance and Monitoring

For the SIA. Applications of Propagation Delay & Skew tool. Introduction. Theory of Operation. Propagation Delay & Skew Tool

Curriculum Standard One: The student will use his/her senses to perceive works of art, objects in nature, events, and the environment.

TECHNICAL SUPPLEMENT FOR THE DELIVERY OF PROGRAMMES WITH HIGH DYNAMIC RANGE

ZX-44XL Liquid Fuel Analyzer. User s Manual Version 1.2

Transcription:

Principles of Data Visualization Jeffrey Heer @jeffrey_heer University of Washington

Data Analysis & Statistics, Tukey & Wilk 1966

Four major influences act on data analysis today: 1. The formal theories of statistics. 2. Accelerating developments in computers and display devices. 3. The challenge, in many fields, of more and larger bodies of data. 4. The emphasis on quantification in a wider variety of disciplines. Data Analysis & Statistics, Tukey & Wilk 1966

While some of the influences of statistical theory on data analysis have been helpful, others have not. Data Analysis & Statistics, Tukey & Wilk 1966

Exposure, the effective laying open of the data to display the unanticipated, is to us a major portion of data analysis It is not clear how the informality and flexibility appropriate to the exploratory character of exposure can be fitted into any of the structures of formal statistics so far proposed. Data Analysis & Statistics, Tukey & Wilk 1966

Set A Set B Set C Set D X Y X Y X Y X Y 10 8.04 10 9.14 10 7.46 8 6.58 8 6.95 8 8.14 8 6.77 8 5.76 13 7.58 13 8.74 13 12.74 8 7.71 9 8.81 9 8.77 9 7.11 8 8.84 11 8.33 11 9.26 11 7.81 8 8.47 14 9.96 14 8.1 14 8.84 8 7.04 6 7.24 6 6.13 6 6.08 8 5.25 4 4.26 4 3.1 4 5.39 19 12.5 12 10.84 12 9.11 12 8.15 8 5.56 7 4.82 7 7.26 7 6.42 8 7.91 5 5.68 5 4.74 5 5.73 8 6.89 Summary Statistics Linear Regression u X = 9.0 σ X = 3.317 Y 2 = 3 + 0.5 X u Y = 7.5 σ Y = 2.03 R 2 = 0.67 Anscombe 1973

Set A Set B Y Set C Set D Y X X

Wikipedia History Flow [Viégas & Wattenberg 04]

d3.js Data-Driven Documents with Mike Bostock, Vadim Ogievetsky [InfoVis 11]

d3 d3

What makes a visualization good?

Design Principles [Mackinlay 86] Expressiveness A set of facts is expressible in a visual language if the sentences (i.e. the visualizations) in the language express all the facts in the set of data, and only the facts in the data. Effectiveness A visualization is more effective than another visualization if the information conveyed by one visualization is more readily perceived than the information in the other visualization.

Design Principles [Mackinlay 86] Expressiveness A set of facts is expressible in a visual language if the sentences (i.e. the visualizations) in the language express all the facts in the set of data, and only the facts in the data. Effectiveness A visualization is more effective than another visualization if the information conveyed by one visualization is more readily perceived than the information in the other visualization.

Expresses Facts Not in the Data A length is interpreted as a quantitative value.

Design Principles [Mackinlay 86] Expressiveness A set of facts is expressible in a visual language if the sentences (i.e. the visualizations) in the language express all the facts in the set of data, and only the facts in the data. Effectiveness A visualization is more effective than another visualization if the information conveyed by one visualization is more readily perceived than the information in the other visualization.

Design Principles [Mackinlay 86] Expressiveness A set of facts is expressible in a visual language if the sentences (i.e. the visualizations) in the language express all the facts in the set of data, and only the facts in the data. Effectiveness A visualization is more effective than another visualization if the information conveyed by one visualization is more readily perceived than the information in the other visualization.

Design Principles [Tversky 02] Congruence The structure and content of the external representation should correspond to the desired structure and content of the internal representation. Apprehension The structure and content of the external representation should be readily and accurately perceived and comprehended.

Design Principles Translated Tell the truth and nothing but the truth (don t lie, and don t lie by omission) Use encodings that people decode better (where better = more accurate and/or faster)

A quick experiment

Compare area of circles

Compare length of bars

Steven s Power Law Exponent (Empirically Determined) Perceived Sensation Physical Intensity Graph from Wilkinson 99, based on Stevens 61

Graphical Perception [Cleveland & McGill 84]

Position 1 Position 2 Position 3 Length 1 Length 2 Angle Area (Circular) Area (Rect 1) Area (Rect 2) Log Absolute Estimation Error Graphical Perception Experiments Empirical estimates of encoding effectiveness

Comparing Two Quantities Most accurate Position (common) scale Position (non-aligned) scale Length Slope Angle Area Volume Least accurate Color hue-saturation-density

Effectiveness Rankings [Mackinlay 86] QUANTITATIVE ORDINAL NOMINAL Position Position Position Length Density (Value) Color Hue Angle Color Sat Texture Slope Color Hue Connection Area (Size) Texture Containment Volume Connection Density (Value) Density (Value) Containment Color Sat Color Sat Length Shape Color Hue Angle Length Texture Slope Angle Connection Area (Size) Slope Containment Volume Area Shape Shape Volume

Effectiveness Rankings [Mackinlay 86] QUANTITATIVE ORDINAL NOMINAL Position Position Position Length Density (Value) Color Hue Angle Color Sat Texture Slope Color Hue Connection Area (Size) Texture Containment Volume Connection Density (Value) Density (Value) Containment Color Sat Color Sat Length Shape Color Hue Angle Length Texture Slope Angle Connection Area (Size) Slope Containment Volume Area Shape Shape Volume

Effectiveness Rankings [Mackinlay 86] QUANTITATIVE ORDINAL NOMINAL Position Position Position Length Density (Value) Color Hue Angle Color Sat Texture Slope Color Hue Connection Area (Size) Texture Containment Volume Connection Density (Value) Density (Value) Containment Color Sat Color Sat Length Shape Color Hue Angle Length Texture Slope Angle Connection Area (Size) Slope Containment Volume Area Shape Shape Volume

Gene Expression Time-Series [Meyer et al 11] Color Encoding Position Encoding

Artery Visualization [Borkin et al 11] Rainbow Palette Diverging Palette 62% 92% 2D 39% 71% 3D

Additional Resources The Visual Display of Quantitative Information. Edward Tufte. Show Me the Numbers. Stephen Few. Visualizing Data. William S. Cleveland. Perception for Design. Colin Ware.

Principles of Data Visualization Jeffrey Heer @jeffrey_heer http://idl.cs.washington.edu