Chapter 3. Averages and Variation

Similar documents
Chapter 5. Describing Distributions Numerically. Finding the Center: The Median. Spread: Home on the Range. Finding the Center: The Median (cont.

Chapter 6. Normal Distributions

Lesson 7: Measuring Variability for Skewed Distributions (Interquartile Range)

Measuring Variability for Skewed Distributions

Box Plots. So that I can: look at large amount of data in condensed form.

Frequencies. Chapter 2. Descriptive statistics and charts

STAT 113: Statistics and Society Ellen Gundlach, Purdue University. (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e)

Algebra I Module 2 Lessons 1 19

Lesson 7: Measuring Variability for Skewed Distributions (Interquartile Range)

Chapter 1 Midterm Review

9.2 Data Distributions and Outliers

NAA ENHANCING THE QUALITY OF MARKING PROJECT: THE EFFECT OF SAMPLE SIZE ON INCREASED PRECISION IN DETECTING ERRANT MARKING

MATH 214 (NOTES) Math 214 Al Nosedal. Department of Mathematics Indiana University of Pennsylvania. MATH 214 (NOTES) p. 1/3

Math 7 /Unit 07 Practice Test: Collecting, Displaying and Analyzing Data

What can you tell about these films from this box plot? Could you work out the genre of these films?

Comparing Distributions of Univariate Data

Distribution of Data and the Empirical Rule

Dot Plots and Distributions

Normalization Methods for Two-Color Microarray Data

AP Statistics Sampling. Sampling Exercise (adapted from a document from the NCSSM Leadership Institute, July 2000).

Homework Packet Week #5 All problems with answers or work are examples.

COMP Test on Psychology 320 Check on Mastery of Prerequisites

abc Mark Scheme Statistics 3311 General Certificate of Secondary Education Higher Tier 2007 examination - June series

Sample Analysis Design. Element2 - Basic Software Concepts (cont d)

Collecting Data Name:

Sampler Overview. Statistical Demonstration Software Copyright 2007 by Clifford H. Wagner

11, 6, 8, 7, 7, 6, 9, 11, 9

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2002

THE USE OF RESAMPLING FOR ESTIMATING CONTROL CHART LIMITS

Chapter 27. Inferences for Regression. Remembering Regression. An Example: Body Fat and Waist Size. Remembering Regression (cont.)

The One Penny Whiteboard

Estimation of inter-rater reliability

What is Statistics? 13.1 What is Statistics? Statistics

Visual Encoding Design

Statistics for Engineers

Sampling Plans. Sampling Plan - Variable Physical Unit Sample. Sampling Application. Sampling Approach. Universe and Frame Information

Bootstrap Methods in Regression Questions Have you had a chance to try any of this? Any of the review questions?

Use black ink or black ball-point pen. Pencil should only be used for drawing. *

The impact of sound technology on the distribution of shot lengths in motion pictures

1. MORTALITY AT ADVANCED AGES IN SPAIN MARIA DELS ÀNGELS FELIPE CHECA 1 COL LEGI D ACTUARIS DE CATALUNYA

Lecture 10: Release the Kraken!

Bioconductor s marray package: Plotting component

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certificate of Education Ordinary Level

Chapter 2 Describing Data: Frequency Tables, Frequency Distributions, and

More About Regression

STAT 250: Introduction to Biostatistics LAB 6

Agilent Feature Extraction Software (v10.7)

Good playing practice when drumming: Influence of tempo on timing and preparatory movements for healthy and dystonic players

AGAINST ALL ODDS EPISODE 22 SAMPLING DISTRIBUTIONS TRANSCRIPT

(Week 13) A05. Data Analysis Methods for CRM. Electronic Commerce Marketing

Libraries as Repositories of Popular Culture: Is Popular Culture Still Forgotten?

Sector sampling. Nick Smith, Kim Iles and Kurt Raynor

EXPLORING DISTRIBUTIONS

Chapter 4. Displaying Quantitative Data. Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

NETFLIX MOVIE RATING ANALYSIS

MEASURING EMERGING SCIENTIFIC IMPACT AND CURRENT RESEARCH TRENDS: A COMPARISON OF ALTMETRIC AND HOT PAPERS INDICATORS

in the Howard County Public School System and Rocketship Education

Navigate to the Journal Profile page

MURDOCH RESEARCH REPOSITORY

6 th Grade Semester 2 Review 1) It cost me $18 to make a lamp, but I m selling it for $45. What was the percent of increase in price?

For the SIA. Applications of Propagation Delay & Skew tool. Introduction. Theory of Operation. Propagation Delay & Skew Tool

Graphical Displays of Univariate Data

Does the number of users rating the movie accurately predict the average user rating?

Psychoacoustic Evaluation of Fan Noise

Sitting through commercials: How commercial break timing and duration affect viewership

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian

Blueline, Linefree, Accuracy Ratio, & Moving Absolute Mean Ratio Charts

Rhythm Rounds. Joyce Ma. January 2003

Math 81 Graphing. Cartesian Coordinate System Plotting Ordered Pairs (x, y) (x is horizontal, y is vertical) center is (0,0) Quadrants:

Supplementary Figures Supplementary Figure 1 Comparison of among-replicate variance in invasion dynamics

ANALYSING DIFFERENCES BETWEEN THE INPUT IMPEDANCES OF FIVE CLARINETS OF DIFFERENT MAKES

Why visualize data? Advanced GDA and Software: Multivariate approaches, Interactive Graphics, Mondrian, iplots and R. German Bundestagswahl 2005

A Comparison of Relative Gain Estimation Methods for High Radiometric Resolution Pushbroom Sensors

CYBERTOOLS IN RESEARCH

User Guide. S-Curve Tool

MATH 214 (NOTES) Math 214 Al Nosedal. Department of Mathematics Indiana University of Pennsylvania. MATH 214 (NOTES) p. 1/11

Why t? TEACHER NOTES MATH NSPIRED. Math Objectives. Vocabulary. About the Lesson

STAT 503 Case Study: Supervised classification of music clips

Faculty of Environmental Engineering, The University of Kitakyushu,Hibikino, Wakamatsu, Kitakyushu , Japan

Monday 15 May 2017 Afternoon Time allowed: 1 hour 30 minutes

Level 1 Mathematics and Statistics, 2011

1996 Yampi Shelf, Browse Basin Airborne Laser Fluorosensor Survey Interpretation Report [WGC Browse Survey Number ]

Modeling memory for melodies

Reviews of earlier editions

Object selectivity of local field potentials and spikes in the macaque inferior temporal cortex

Notes Unit 8: Dot Plots and Histograms

Moving on from MSTAT. March The University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID

Sources of Error in Time Interval Measurements

Detecting Medicaid Data Anomalies Using Data Mining Techniques Shenjun Zhu, Qiling Shi, Aran Canes, AdvanceMed Corporation, Nashville, TN

6 ~ata-ink Maximization and Graphical Design

AMusical Instrument Sample Database of Isolated Notes

AP Statistics Sec 5.1: An Exercise in Sampling: The Corn Field

Statistics For Dummies PDF

Practice makes less imperfect: the effects of experience and practice on the kinetics and coordination of flutists' fingers

Open access press vs traditional university presses on Amazon

ECONOMICS 351* -- INTRODUCTORY ECONOMETRICS. Queen's University Department of Economics. ECONOMICS 351* -- Winter Term 2005 INTRODUCTORY ECONOMETRICS

Discriminant Analysis. DFs

STUDIES on visual aesthetics have gained an increasing

1.1 Common Graphs and Data Plots

Mathematics in Contemporary Society Chapter 11

Transcription:

Chapter 3 Averages and Variation Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Measures of Central Tendency We use the term average to indicate one number that gives a measure of center for a population or sample. This text investigates three averages : Mode Median Mean Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 2

Mode The mode is the most frequently occurring value in a data set. Example: Sixteen students are asked how many college math classes they have completed. {0, 3, 2, 2, 1, 1, 0, 5, 1, 1, 0, 2, 2, 7, 1, 3} The mode is 1 Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 3

Median Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 4

Mean Read x-bar Read mu Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 5

Trimmed Mean Order the data and remove k% of the data values from the bottom and top. 5% and 10% trimmed means are common. Then simply compute the mean with the remaining data values. Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 6

Resistant Measures of Central Tendency A resistant measure will not be affected by extreme values in the data set. The mean is not resistant to extreme values. The median is resistant to extreme values. A trimmed mean is also resistant. Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 7

Critical Thinking Four levels of data nominal, ordinal, interval, ratio Mode can be used with all four levels. Median may be used with ordinal level or above. Mean may be used with interval or ratio level Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 8

Critical Thinking Mound-shaped symmetrical data values of mean, median and mode are almost same. Skewed-left data mean < median < mode. Skewed-right data mean > median > mode. Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 9

Weighted Average At times, we may need to assign more importance to some of the data values. Weighted Average xw w x is a data value. w is the weight assigned to that value. Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 10

Measures of Variation: Range Range = Largest value smallest value Only two data values are used in the computation, so much of the information in the data is lost. Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 11

Sample Variance and Standard Deviation Sample Variance = s 2 = n i 1 ( x i n 1 x) 2 Sample Standard Deviation = s = s 2 Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 12

Population Variance and Standard Deviation Population Variance = 2 N i 1 ( xi ) N 2 Population Standard Deviation = 2 Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 13

The Coefficient of Variation For Samples For Populations s CV 100 CV 100 x Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 14

Chebyshev s Theorem Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 15

Chebyshev s Theorem Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 16

Critical Thinking Standard deviation or variance, along with the mean, gives a better picture of the data distribution. Chebyshev s theorem works for all kinds of data distribution. Data values beyond 2.5 standard deviations from the mean may be considered as outliers. Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 17

Percentiles and Quartiles For whole numbers P, 1 P 99, the P th percentile of a distribution is a value such that P% of the data fall below it, and (100-P)% of the data fall at or above it. Q 1 = 25 th Percentile Q 2 = 50 th Percentile = The Median Q 3 = 75 th Percentile Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 18

Quartiles and Interquartile Range (IQR) Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 19

Computing Quartiles Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 20

Five Number Summary A listing of the following statistics: Minimum, Q 1, Median, Q 3, Maximum Box-and-Whisder plot represents the fivenumber summary graphically. Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 21

Box-and-Whisker Plot Construction Tutorial de como hacer un Boxplot http://math.uprag.edu/boxplot/boxplot.htm Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 22

Critical Thinking Box-and-whisker plots display the spread of data about the median. If the median is centered and the whiskers are about the same length, then the data distribution is symmetric around the median. Fences may be placed on either side of the box. Values lie beyond the fences are outliers. Copyright Houghton Mifflin Harcourt Publishing Company. All rights reserved. 3 23