Resampling Statistics. Conventional Statistics. Resampling Statistics

Size: px
Start display at page:

Download "Resampling Statistics. Conventional Statistics. Resampling Statistics"

Transcription

1 Resampling Statistics Introduction to Resampling Probability Modeling Resample add-in Bootstrapping values, vectors, matrices R boot package Conclusions Conventional Statistics Assumptions of conventional statistics: - Variables are randomly sampled - Follow a normal distribution (Gaussian) Thus, the basis of conventional inference is that samples are drawn at random from a larger population and the observations in the sample are then presumed to reflect the population (e.g., mean & variance). Resampling Statistics In resampling statistics, statistical estimates are formed by taking random samples directly from the data at hand. In other words, you randomly sample your random sample!

2 Resampling Statistics - Key Features - 1. For small data sets, resampling procedures probably provide more accurate statistical answers than conventional statistics. 2. For large data sets, resampling answers and conventional answers usually agree. 3. Resampling can handle virtually any statistic, not just those for which a distribution is known. 4. Resampling typically generates accurate 95CIs. Resampling Statistics - Terminology - Resampling is a generic term which refers to a whole array of computer intensive methods for testing hypotheses based on Monte Carlo and resampling simulations. Bootstrapping and jackknifing represent the two most common forms applied to conventional statistical designs. This lecture will focus primarily on bootstrapping procedures. Resampling Statistics - References - These procedures have been around for a long time but have really only begun to be applied recently because of enhanced computer technology. Selected References: Efron, B The jackknife, the bootstrap, and other resampling plans. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA. Simon, J.L Resampling: The new statistics, 2 nd ed. (online) Good, P.I Introduction to statistics through resampling methods and R/S-Plus. Wiley Interscience, New York, NY.

3 Probability Modeling Direct modeling of probabilities is the primary point of resampling statistics. Consider a simple coin flip example. A coin contains two outcomes: heads (1), tails (0) If you flip 100 times, the expectation is: 50:50 or half 1s and half 0s. Probability Modeling Consider a less trivial & more biological case of probabilities: In clutch sizes of 8, how often would you expect to see 3 males and 5 females (i.e., 3:5 ratio)? This can be modeled using a coin flip algorithm. Assume the probability of male vs. female is equal and independent of previous clutches. One can flip 8 coins, count the heads (males), and repeat this procedure many times.

4 Probability Modeling The only possible logistical difficulty in this is the many times part. Resampling statistical software is available in a variety of forms. A simple Excel add-in is available for $99 (academic pricing) or calculations can be done various ways in R. Let's first look at a simple using the Excel add-in to get the general idea using our clutch size data. We can mathematically flip a coin 8 times, determine how many males there are, and do this many, many times: Resampling Software Select Resample, input range A1:A2, place data in D1 in a group of 8 Resampling Software The result is 8 values of 0 or 1 placed in column D. Cell D9 contains the column sum (5 males for this one case of 8 flips). We need to do this 999 more times!

5 Resampling Software Click OK, then 2x click on this cell (will turn red when selected, then 2x Click on any empty cell), 1 score recorded. Resampling Software Next, click on RS (Repeat and Score), enter 1000 trials, click OK, go to output tab The sum (males) of 1000 groups of 8-flips are placed in A on output sheet Data are sorted high to low

6 Resampling Software Now, using the stats add-in from Excel, construct a histogram of the 1000 resamples. 3 males happens in 210 of 1000 clutches or 0.210, or ca. 1 in 5 clutches. Boot Package v SEP-11 The BOOT package is designed to provide extensive facilities for all forms of bootstrapping and resampling. One can bootstrap a simple statistic (e,g., median), a vector (e.g., regression weights), or an entire matrix. The main bootstrapping function is boot() and has the following format: Bootobject <- boot(data=, statistic=, R=,...) where, data = a vector, matrix, or dataframe statistic = a function that produces the k statistics to be bootstrapped (k=1 if bootstrapping a single statistic). The function should include an indicies parameter that the boot( ) function can use to select cases for each replication. R = the number of bootstrap replicates = additional parameters

7 Boot( ) calls the statistic function R times. Each time, it generates a set of random indices, with replacement. (Just like the resample Excel add-in.) These indices are used within the statistic function to select a sample. The statistics are calculated on the sample and the results accumulated in bootobject. The bootobject structure includes: t0 = The observed values of k statistics applied to the original data t = An R x k matrix where each row is a bootstrap replicate of the k statistics. You can access these as bootobject$t0 and bootobject$t Once the bootstrap samples have been generated, use print(bootobject) and plot(bootobject) to examine the results. boot.ci() can be used to obtain confidence intervals for the statistic(s). Let's load the library boot and use one of its datasets:...

8 We can try a standard linear model of mpg as a function of weight and displacement: > summary(reg) Call: lm(formula = mpg ~ wt + disp) Residuals: Min 1Q Median 3Q Max Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) e-16 *** wt ** disp Signif. codes: 0 *** ** 0.01 * Residual standard error: on 29 degrees of freedom Multiple R-squared: , Adjusted R-squared: F-statistic: on 2 and 29 DF, p-value: 2.744e-10

9 > results ORDINARY NONPARAMETRIC BOOTSTRAP Call: boot(data = mtcars, statistic = rsq, R = 1000, formula = mpg ~ wt + disp) Bootstrap Statistics : original bias std. error t1* > quartz(height=4,width=7) > plot(results) > boot.ci(results, type="bca") BOOTSTRAP CONFIDENCE INTERVAL CALCULATIONS Based on 1000 bootstrap replicates CALL : boot.ci(boot.out = results, type = "bca") Intervals : Level BCa 95% ( , ) Calculations and Intervals on Original Scale Some BCa intervals may be unstable

10 We can extend a single value bootstrap to an entire vector and continue with same example, but this time determine the model regression coefficients: > bsmodel <- function(formula, data, indices) { + d <- data[indices,] # allows boot to select sample + fit <- lm(formula, data=d) + return(coef(fit)) + } > results <- boot(data=mtcars, + statistic=bsmodel, + R=1000, formula=mpg~wt+disp) > results ORDINARY NONPARAMETRIC BOOTSTRAP Call: boot(data = mtcars, statistic = bs, R = 1000, formula = mpg ~ wt + disp) Bootstrap Statistics : original bias std. error t1* e t2* e t3* e > results$t [,1] [,2] [,3] [1,] e-02 [2,] e-02 [3,] e-02 [4,] e-03 [5,] e-02 [6,] e-02 [7,] e-02 [8,] e-02 [9,] e-02 [10,] e-03 [11,] e > results$t0 (Intercept) wt disp

11 > boot.ci(results, type="bca", index=1) # intercept BOOTSTRAP CONFIDENCE INTERVAL CALCULATIONS Based on 1000 bootstrap replicates CALL : boot.ci(boot.out = results, type = "bca", index = 1) Intervals : Level BCa 95% (29.83, ) Calculations and Intervals on Original Scale > boot.ci(results, type="bca", index=2) # wt > boot.ci(results, type="bca", index=3) # disp CarBoot.R Script File Resampling - Conclusions - Hopefully, by now, you can see that there is a very general principle here that can be applied to virtually any statistical design. Resampling via bootstrapping is a powerful tool in many statistical situations (cf. Chpt. 19 in W&S textbook). A nice overview of the concepts examined here can be found in: Efron, B Computer-intensive methods in statistics. Scientific American, May,

12

Fundamentals and applications of resampling methods for the analysis of speech production and perception data.

Fundamentals and applications of resampling methods for the analysis of speech production and perception data. Fundamentals and applications of resampling methods for the analysis of speech production and perception data. Olivier Crouzet 1 Laboratoire de Linguistique de Nantes (LLING UMR 6310, Université de Nantes

More information

Chapter 27. Inferences for Regression. Remembering Regression. An Example: Body Fat and Waist Size. Remembering Regression (cont.)

Chapter 27. Inferences for Regression. Remembering Regression. An Example: Body Fat and Waist Size. Remembering Regression (cont.) Chapter 27 Inferences for Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 27-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley An

More information

Bootstrap Methods in Regression Questions Have you had a chance to try any of this? Any of the review questions?

Bootstrap Methods in Regression Questions Have you had a chance to try any of this? Any of the review questions? ICPSR Blalock Lectures, 2003 Bootstrap Resampling Robert Stine Lecture 3 Bootstrap Methods in Regression Questions Have you had a chance to try any of this? Any of the review questions? Getting class notes

More information

DV: Liking Cartoon Comedy

DV: Liking Cartoon Comedy 1 Stepwise Multiple Regression Model Rikki Price Com 631/731 March 24, 2016 I. MODEL Block 1 Block 2 DV: Liking Cartoon Comedy 2 Block Stepwise Block 1 = Demographics: Item: Age (G2) Item: Political Philosophy

More information

Problem Points Score USE YOUR TIME WISELY USE CLOSEST DF AVAILABLE IN TABLE SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT

Problem Points Score USE YOUR TIME WISELY USE CLOSEST DF AVAILABLE IN TABLE SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT Stat 514 EXAM I Stat 514 Name (6 pts) Problem Points Score 1 32 2 30 3 32 USE YOUR TIME WISELY USE CLOSEST DF AVAILABLE IN TABLE SHOW YOUR WORK TO RECEIVE PARTIAL CREDIT WRITE LEGIBLY. ANYTHING UNREADABLE

More information

More About Regression

More About Regression Regression Line for the Sample Chapter 14 More About Regression is spoken as y-hat, and it is also referred to either as predicted y or estimated y. b 0 is the intercept of the straight line. The intercept

More information

STAT 113: Statistics and Society Ellen Gundlach, Purdue University. (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e)

STAT 113: Statistics and Society Ellen Gundlach, Purdue University. (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e) STAT 113: Statistics and Society Ellen Gundlach, Purdue University (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e) Learning Objectives for Exam 1: Unit 1, Part 1: Population

More information

GLM Example: One-Way Analysis of Covariance

GLM Example: One-Way Analysis of Covariance Understanding Design and Analysis of Research Experiments An animal scientist is interested in determining the effects of four different feed plans on hogs. Twenty four hogs of a breed were chosen and

More information

Linear mixed models and when implied assumptions not appropriate

Linear mixed models and when implied assumptions not appropriate Mixed Models Lecture Notes By Dr. Hanford page 94 Generalized Linear Mixed Models (GLMM) GLMMs are based on GLM, extended to include random effects, random coefficients and covariance patterns. GLMMs are

More information

Blueline, Linefree, Accuracy Ratio, & Moving Absolute Mean Ratio Charts

Blueline, Linefree, Accuracy Ratio, & Moving Absolute Mean Ratio Charts INTRODUCTION This instruction manual describes for users of the Excel Standard Celeration Template(s) the features of each page or worksheet in the template, allowing the user to set up and generate charts

More information

ECONOMICS 351* -- INTRODUCTORY ECONOMETRICS. Queen's University Department of Economics. ECONOMICS 351* -- Winter Term 2005 INTRODUCTORY ECONOMETRICS

ECONOMICS 351* -- INTRODUCTORY ECONOMETRICS. Queen's University Department of Economics. ECONOMICS 351* -- Winter Term 2005 INTRODUCTORY ECONOMETRICS Queen's University Department of Economics ECONOMICS 351* -- Winter Term 2005 INTRODUCTORY ECONOMETRICS Winter Term 2005 Instructor: Web Site: Mike Abbott Office: Room A521 Mackintosh-Corry Hall or Room

More information

Subject-specific observed profiles of change from baseline vs week trt=10000u

Subject-specific observed profiles of change from baseline vs week trt=10000u Mean of age 1 The MEANS Procedure Analysis Variable : age N Mean Std Dev Minimum Maximum ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ 109 55.5321101 12.1255537 26.0000000 83.0000000

More information

Moving on from MSTAT. March The University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID

Moving on from MSTAT. March The University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Moving on from MSTAT March 2000 The University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Contents 1. Introduction 3 2. Moving from MSTAT to Genstat 4 2.1 Analysis

More information

Mixed Models Lecture Notes By Dr. Hanford page 151 More Statistics& SAS Tutorial at Type 3 Tests of Fixed Effects

Mixed Models Lecture Notes By Dr. Hanford page 151 More Statistics& SAS Tutorial at  Type 3 Tests of Fixed Effects Assessing fixed effects Mixed Models Lecture Notes By Dr. Hanford page 151 In our example so far, we have been concentrating on determining the covariance pattern. Now we ll look at the treatment effects

More information

Mixed models in R using the lme4 package Part 2: Longitudinal data, modeling interactions

Mixed models in R using the lme4 package Part 2: Longitudinal data, modeling interactions Mixed models in R using the lme4 package Part 2: Longitudinal data, modeling interactions Douglas Bates 2011-03-16 Contents 1 sleepstudy 1 2 Random slopes 3 3 Conditional means 6 4 Conclusions 9 5 Other

More information

I. Model. Q29a. I love the options at my fingertips today, watching videos on my phone, texting, and streaming films. Main Effect X1: Gender

I. Model. Q29a. I love the options at my fingertips today, watching videos on my phone, texting, and streaming films. Main Effect X1: Gender 1 Hopewell, Sonoyta & Walker, Krista COM 631/731 Multivariate Statistical Methods Dr. Kim Neuendorf Film & TV National Survey dataset (2014) by Jeffres & Neuendorf MANOVA Class Presentation I. Model INDEPENDENT

More information

Lecture 10: Release the Kraken!

Lecture 10: Release the Kraken! Lecture 10: Release the Kraken! Last time We considered some simple classical probability computations, deriving the socalled binomial distribution -- We used it immediately to derive the mathematical

More information

TI-Inspire manual 1. Real old version. This version works well but is not as convenient entering letter

TI-Inspire manual 1. Real old version. This version works well but is not as convenient entering letter TI-Inspire manual 1 Newest version Older version Real old version This version works well but is not as convenient entering letter Instructions TI-Inspire manual 1 General Introduction Ti-Inspire for statistics

More information

COMP Test on Psychology 320 Check on Mastery of Prerequisites

COMP Test on Psychology 320 Check on Mastery of Prerequisites COMP Test on Psychology 320 Check on Mastery of Prerequisites This test is designed to provide you and your instructor with information on your mastery of the basic content of Psychology 320. The results

More information

STAT 250: Introduction to Biostatistics LAB 6

STAT 250: Introduction to Biostatistics LAB 6 STAT 250: Introduction to Biostatistics LAB 6 Dr. Kari Lock Morgan Sampling Distributions In this lab, we ll explore sampling distributions using StatKey: www.lock5stat.com/statkey. We ll be using StatKey,

More information

Algebra I Module 2 Lessons 1 19

Algebra I Module 2 Lessons 1 19 Eureka Math 2015 2016 Algebra I Module 2 Lessons 1 19 Eureka Math, Published by the non-profit Great Minds. Copyright 2015 Great Minds. No part of this work may be reproduced, distributed, modified, sold,

More information

Latin Square Design. Design of Experiments - Montgomery Section 4-2

Latin Square Design. Design of Experiments - Montgomery Section 4-2 Latin Square Design Design of Experiments - Montgomery Section 4-2 Latin Square Design Can be used when goal is to block on two nuisance factors Constructed so blocking factors orthogonal to treatment

More information

Hybrid resampling methods for confidence intervals: comment

Hybrid resampling methods for confidence intervals: comment Title Hybrid resampling methods for confidence intervals: comment Author(s) Lee, SMS; Young, GA Citation Statistica Sinica, 2000, v. 10 n. 1, p. 43-46 Issued Date 2000 URL http://hdl.handle.net/10722/45352

More information

GBA 327: Module 7D AVP Transcript Title: The Monte Carlo Simulation Using Risk Solver. Title Slide

GBA 327: Module 7D AVP Transcript Title: The Monte Carlo Simulation Using Risk Solver. Title Slide GBA 327: Module 7D AVP Transcript Title: The Monte Carlo Simulation Using Risk Solver Title Slide Narrator: Although the use of a data table illustrates how we can apply Monte Carlo simulation to a decision

More information

Reviews of earlier editions

Reviews of earlier editions Reviews of earlier editions Statistics in medicine ( 1997 by John Wiley & Sons, Ltd. Statist. Med., 16, 2627Ð2631 (1997) STATISTICS AT SQUARE ONE. Ninth Edition, revised by M. J. Campbell, T. D. V. Swinscow,

More information

THE USE OF RESAMPLING FOR ESTIMATING CONTROL CHART LIMITS

THE USE OF RESAMPLING FOR ESTIMATING CONTROL CHART LIMITS THE USE OF RESAMPLING FOR ESTIMATING CONTROL CHART LIMITS Draft of paper published in Journal of the Operational Research Society, 50, 651-659, 1999. Michael Wood, Michael Kaye and Nick Capon Management

More information

Reconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn

Reconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn Reconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn Introduction Active neurons communicate by action potential firing (spikes), accompanied

More information

Chapter 21. Margin of Error. Intervals. Asymmetric Boxes Interpretation Examples. Chapter 21. Margin of Error

Chapter 21. Margin of Error. Intervals. Asymmetric Boxes Interpretation Examples. Chapter 21. Margin of Error Context Part VI Sampling Accuracy of Percentages Previously, we assumed that we knew the contents of the box and argued about chances for the draws based on this knowledge. In survey work, we frequently

More information

User Guide. S-Curve Tool

User Guide. S-Curve Tool User Guide for S-Curve Tool Version 1.0 (as of 09/12/12) Sponsored by: Naval Center for Cost Analysis (NCCA) Developed by: Technomics, Inc. 201 12 th Street South, Suite 612 Arlington, VA 22202 Points

More information

Model II ANOVA: Variance Components

Model II ANOVA: Variance Components Model II ANOVA: Variance Components Model II MS A = s 2 + ns 2 A MS A MS W = ns 2 A (MS A MS W )/n = ns 2 A /n = s2 A Usually Expressed: s 2 A /(s2 A + s2 W ) x 100 Assumptions of ANOVA Random Sampling

More information

MANOVA/MANCOVA Paul and Kaila

MANOVA/MANCOVA Paul and Kaila I. Model MANOVA/MANCOVA Paul and Kaila From the Music and Film Experiment (Neuendorf et al.) Covariates (ONLY IN MANCOVA) X1 Music Condition Y1 E20 Contempt Y2 E21 Anticipation X2 Instrument Interaction

More information

RANDOMIZED COMPLETE BLOCK DESIGN (RCBD) Probably the most used and useful of the experimental designs.

RANDOMIZED COMPLETE BLOCK DESIGN (RCBD) Probably the most used and useful of the experimental designs. Description of the Design RANDOMIZED COMPLETE BLOCK DESIGN (RCBD) Probably the most used and useful of the experimental designs. Takes advantage of grouping similar experimental units into blocks or replicates.

More information

TWO-FACTOR ANOVA Kim Neuendorf 4/9/18 COM 631/731 I. MODEL

TWO-FACTOR ANOVA Kim Neuendorf 4/9/18 COM 631/731 I. MODEL 1 TWO-FACTOR ANOVA Kim Neuendorf 4/9/18 COM 631/731 I. MODEL Using the Humor and Public Opinion Data, a two-factor ANOVA was run, using the full factorial model: MAIN EFFECT: Political Philosophy (3 groups)

More information

Replicated Latin Square and Crossover Designs

Replicated Latin Square and Crossover Designs Replicated Latin Square and Crossover Designs Replicated Latin Square Latin Square Design small df E, low power If 3 treatments 2 df error If 4 treatments 6 df error Can use replication to increase df

More information

Release Year Prediction for Songs

Release Year Prediction for Songs Release Year Prediction for Songs [CSE 258 Assignment 2] Ruyu Tan University of California San Diego PID: A53099216 rut003@ucsd.edu Jiaying Liu University of California San Diego PID: A53107720 jil672@ucsd.edu

More information

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2002

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2002 1 UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2002 Exercises Unit 2 Descriptive Statistics Tables and Graphs Due: Monday September

More information

Why t? TEACHER NOTES MATH NSPIRED. Math Objectives. Vocabulary. About the Lesson

Why t? TEACHER NOTES MATH NSPIRED. Math Objectives. Vocabulary. About the Lesson Math Objectives Students will recognize that when the population standard deviation is unknown, it must be estimated from the sample in order to calculate a standardized test statistic. Students will recognize

More information

Tutorial 0: Uncertainty in Power and Sample Size Estimation. Acknowledgements:

Tutorial 0: Uncertainty in Power and Sample Size Estimation. Acknowledgements: Tutorial 0: Uncertainty in Power and Sample Size Estimation Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in large part by the National

More information

Normalization Methods for Two-Color Microarray Data

Normalization Methods for Two-Color Microarray Data Normalization Methods for Two-Color Microarray Data 1/13/2009 Copyright 2009 Dan Nettleton What is Normalization? Normalization describes the process of removing (or minimizing) non-biological variation

More information

Use black ink or black ball-point pen. Pencil should only be used for drawing. *

Use black ink or black ball-point pen. Pencil should only be used for drawing. * General Certificate of Education June 2009 Advanced Subsidiary Examination MATHEMATICS Unit Statistics 1B MS/SS1B STATISTICS Unit Statistics 1B Wednesday 20 May 2009 1.30 pm to 3.00 pm For this paper you

More information

Frequencies. Chapter 2. Descriptive statistics and charts

Frequencies. Chapter 2. Descriptive statistics and charts An analyst usually does not concentrate on each individual data values but would like to have a whole picture of how the variables distributed. In this chapter, we will introduce some tools to tabulate

More information

Reproducibility Assessment of Independent Component Analysis of Expression Ratios from DNA microarrays.

Reproducibility Assessment of Independent Component Analysis of Expression Ratios from DNA microarrays. Reproducibility Assessment of Independent Component Analysis of Expression Ratios from DNA microarrays. David Philip Kreil David J. C. MacKay Technical Report Revision 1., compiled 16th October 22 Department

More information

LAB 1: Plotting a GM Plateau and Introduction to Statistical Distribution. A. Plotting a GM Plateau. This lab will have two sections, A and B.

LAB 1: Plotting a GM Plateau and Introduction to Statistical Distribution. A. Plotting a GM Plateau. This lab will have two sections, A and B. LAB 1: Plotting a GM Plateau and Introduction to Statistical Distribution This lab will have two sections, A and B. Students are supposed to write separate lab reports on section A and B, and submit the

More information

MATH 214 (NOTES) Math 214 Al Nosedal. Department of Mathematics Indiana University of Pennsylvania. MATH 214 (NOTES) p. 1/3

MATH 214 (NOTES) Math 214 Al Nosedal. Department of Mathematics Indiana University of Pennsylvania. MATH 214 (NOTES) p. 1/3 MATH 214 (NOTES) Math 214 Al Nosedal Department of Mathematics Indiana University of Pennsylvania MATH 214 (NOTES) p. 1/3 CHAPTER 1 DATA AND STATISTICS MATH 214 (NOTES) p. 2/3 Definitions. Statistics is

More information

AP Statistics Sampling. Sampling Exercise (adapted from a document from the NCSSM Leadership Institute, July 2000).

AP Statistics Sampling. Sampling Exercise (adapted from a document from the NCSSM Leadership Institute, July 2000). AP Statistics Sampling Name Sampling Exercise (adapted from a document from the NCSSM Leadership Institute, July 2000). Problem: A farmer has just cleared a field for corn that can be divided into 100

More information

Multiple-point simulation of multiple categories Part 1. Testing against multiple truncation of a Gaussian field

Multiple-point simulation of multiple categories Part 1. Testing against multiple truncation of a Gaussian field Multiple-point simulation of multiple categories Part 1. Testing against multiple truncation of a Gaussian field Tuanfeng Zhang November, 2001 Abstract Multiple-point simulation of multiple categories

More information

International Comparison on Operational Efficiency of Terrestrial TV Operators: Based on Bootstrapped DEA and Tobit Regression

International Comparison on Operational Efficiency of Terrestrial TV Operators: Based on Bootstrapped DEA and Tobit Regression , pp.154-159 http://dx.doi.org/10.14257/astl.2015.92.32 International Comparison on Operational Efficiency of Terrestrial TV Operators: Based on Bootstrapped DEA and Tobit Regression Yonghee Kim 1,a, Jeongil

More information

Supplementary Figures Supplementary Figure 1 Comparison of among-replicate variance in invasion dynamics

Supplementary Figures Supplementary Figure 1 Comparison of among-replicate variance in invasion dynamics 1 Supplementary Figures Supplementary Figure 1 Comparison of among-replicate variance in invasion dynamics Scaled posterior probability densities for among-replicate variances in invasion speed (nine replicates

More information

Modeling memory for melodies

Modeling memory for melodies Modeling memory for melodies Daniel Müllensiefen 1 and Christian Hennig 2 1 Musikwissenschaftliches Institut, Universität Hamburg, 20354 Hamburg, Germany 2 Department of Statistical Science, University

More information

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian

Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian OLS Regression Assumptions Sociology 7704: Regression Models for Categorical Data Instructor: Natasha Sarkisian A1. All independent variables are quantitative or dichotomous, and the dependent variable

More information

FPA (Focal Plane Array) Characterization set up (CamIRa) Standard Operating Procedure

FPA (Focal Plane Array) Characterization set up (CamIRa) Standard Operating Procedure FPA (Focal Plane Array) Characterization set up (CamIRa) Standard Operating Procedure FACULTY IN-CHARGE Prof. Subhananda Chakrabarti (IITB) SYSTEM OWNER Hemant Ghadi (ghadihemant16@gmail.com) 05 July 2013

More information

Western Statistics Teachers Conference 2000

Western Statistics Teachers Conference 2000 Teaching Using Ratios 13 Mar, 2000 Teaching Using Ratios 1 Western Statistics Teachers Conference 2000 March 13, 2000 MILO SCHIELD Augsburg College www.augsburg.edu/ppages/schield schield@augsburg.edu

More information

RCBD with Sampling Pooling Experimental and Sampling Error

RCBD with Sampling Pooling Experimental and Sampling Error RCBD with Sampling Pooling Experimental and Sampling Error As we had with the CRD with sampling, we will have a source of variation for sampling error. Calculation of the Experimental Error df is done

More information

The Effects of Study Condition Preference on Memory and Free Recall LIANA, MARISSA, JESSI AND BROOKE

The Effects of Study Condition Preference on Memory and Free Recall LIANA, MARISSA, JESSI AND BROOKE The Effects of Study Condition Preference on Memory and Free Recall LIANA, MARISSA, JESSI AND BROOKE Introduction -Salamè & Baddeley 1988 Presented nine digits on a computer screen for 750 milliseconds

More information

How Large a Sample? CHAPTER 24. Issues in determining sample size

How Large a Sample? CHAPTER 24. Issues in determining sample size 388 Resampling: The New Statistics CHAPTER 24 How Large a Sample? Issues in Determining Sample Size Some Practical Examples Step-Wise Sample-Size Determination Summary Issues in determining sample size

More information

Modelling Intervention Effects in Clustered Randomized Pretest/Posttest Studies. Ed Stanek

Modelling Intervention Effects in Clustered Randomized Pretest/Posttest Studies. Ed Stanek Modelling Intervention Effects in Clustered Randomized Pretest/Posttest Studies Introduction Ed Stanek We consider a study design similar to the design for the Well Women Project, and discuss analyses

More information

Paired plot designs experience and recommendations for in field product evaluation at Syngenta

Paired plot designs experience and recommendations for in field product evaluation at Syngenta Paired plot designs experience and recommendations for in field product evaluation at Syngenta 1. What are paired plot designs? 2. Analysis and reporting of paired plot designs 3. Case study 1 : analysis

More information

Tech Paper. HMI Display Readability During Sinusoidal Vibration

Tech Paper. HMI Display Readability During Sinusoidal Vibration Tech Paper HMI Display Readability During Sinusoidal Vibration HMI Display Readability During Sinusoidal Vibration Abhilash Marthi Somashankar, Paul Weindorf Visteon Corporation, Michigan, USA James Krier,

More information

EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach

EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach Song Hui Chon Stanford University Everyone has different musical taste,

More information

MATH& 146 Lesson 11. Section 1.6 Categorical Data

MATH& 146 Lesson 11. Section 1.6 Categorical Data MATH& 146 Lesson 11 Section 1.6 Categorical Data 1 Frequency The first step to organizing categorical data is to count the number of data values there are in each category of interest. We can organize

More information

Creating a Feature Vector to Identify Similarity between MIDI Files

Creating a Feature Vector to Identify Similarity between MIDI Files Creating a Feature Vector to Identify Similarity between MIDI Files Joseph Stroud 2017 Honors Thesis Advised by Sergio Alvarez Computer Science Department, Boston College 1 Abstract Today there are many

More information

Import and quantification of a micro titer plate image

Import and quantification of a micro titer plate image BioNumerics Tutorial: Import and quantification of a micro titer plate image 1 Aims BioNumerics can import character type data from TIFF images. This happens by quantification of the color intensity and/or

More information

Predicting the Importance of Current Papers

Predicting the Importance of Current Papers Predicting the Importance of Current Papers Kevin W. Boyack * and Richard Klavans ** kboyack@sandia.gov * Sandia National Laboratories, P.O. Box 5800, MS-0310, Albuquerque, NM 87185, USA rklavans@mapofscience.com

More information

Programs. onevent("can", "mousedown", function(event) { var x = event.x; var y = event.y; circle( x, y, 10 ); });

Programs. onevent(can, mousedown, function(event) { var x = event.x; var y = event.y; circle( x, y, 10 ); }); Loops and Canvas Programs AP CSP Program 1. Draw something like the figure shown. There should be: a blue sky with no black outline a green field with no black outline a yellow sun with a black outline

More information

Discipline of Economics, University of Sydney, Sydney, NSW, Australia PLEASE SCROLL DOWN FOR ARTICLE

Discipline of Economics, University of Sydney, Sydney, NSW, Australia PLEASE SCROLL DOWN FOR ARTICLE This article was downloaded by: [University of Sydney] On: 30 March 2010 Access details: Access Details: [subscription number 777157963] Publisher Routledge Informa Ltd Registered in England and Wales

More information

Discriminant Analysis. DFs

Discriminant Analysis. DFs Discriminant Analysis Chichang Xiong Kelly Kinahan COM 631 March 27, 2013 I. Model Using the Humor and Public Opinion Data Set (Neuendorf & Skalski, 2010) IVs: C44 reverse coded C17 C22 C23 C27 reverse

More information

Proceedings of the 2010 Winter Simulation Conference B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan, and E. Yücesan, eds.

Proceedings of the 2010 Winter Simulation Conference B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan, and E. Yücesan, eds. Proceedings of the 2010 Winter Simulation Conference B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan, and E. Yücesan, eds. STATE ESTIMATION OF A SUPPLY CHAIN USING IMPROVED RESAMPLING RULES FOR PARTICLE

More information

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller) Topic 11 Score-Informed Source Separation (chroma slides adapted from Meinard Mueller) Why Score-informed Source Separation? Audio source separation is useful Music transcription, remixing, search Non-satisfying

More information

The following content is provided under a Creative Commons license. Your support

The following content is provided under a Creative Commons license. Your support MITOCW Lecture 17 The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To make a

More information

LabView Exercises: Part II

LabView Exercises: Part II Physics 3100 Electronics, Fall 2008, Digital Circuits 1 LabView Exercises: Part II The working VIs should be handed in to the TA at the end of the lab. Using LabView for Calculations and Simulations LabView

More information

Package ForImp. R topics documented: February 19, Type Package. Title Imputation of Missing Values Through a Forward Imputation.

Package ForImp. R topics documented: February 19, Type Package. Title Imputation of Missing Values Through a Forward Imputation. Type Package Package ForImp February 19, 2015 Title Imputation of Missing s Through a Forward Imputation Algorithm Version 1.0.3 Date 2014-11-24 Author Alessandro Barbiero, Pier Alda Ferrari, Giancarlo

More information

Incorporation of Escorting Children to School in Individual Daily Activity Patterns of the Household Members

Incorporation of Escorting Children to School in Individual Daily Activity Patterns of the Household Members Incorporation of ing Children to School in Individual Daily Activity Patterns of the Household Members Peter Vovsha, Surabhi Gupta, Binny Paul, PB Americas Vladimir Livshits, Petya Maneva, Kyunghwi Jeon,

More information

Sector sampling. Nick Smith, Kim Iles and Kurt Raynor

Sector sampling. Nick Smith, Kim Iles and Kurt Raynor Sector sampling Nick Smith, Kim Iles and Kurt Raynor Partly funded by British Columbia Forest Science Program, Canada; Western Forest Products, Canada with support from ESRI Canada What do sector samples

More information

in the Howard County Public School System and Rocketship Education

in the Howard County Public School System and Rocketship Education Technical Appendix May 2016 DREAMBOX LEARNING ACHIEVEMENT GROWTH in the Howard County Public School System and Rocketship Education Abstract In this technical appendix, we present analyses of the relationship

More information

Statistical Consulting Topics. RCBD with a covariate

Statistical Consulting Topics. RCBD with a covariate Statistical Consulting Topics RCBD with a covariate Goal: to determine the optimal level of feed additive to maximize the average daily gain of steers. VARIABLES Y = Average Daily Gain of steers for 160

More information

Best Pat-Tricks on Model Diagnostics What are they? Why use them? What good do they do?

Best Pat-Tricks on Model Diagnostics What are they? Why use them? What good do they do? Best Pat-Tricks on Model Diagnostics What are they? Why use them? What good do they do? Before we get started feel free to download the presentation and file(s) being used for today s webinar. http://www.statease.com/webinar.html

More information

Exercises. ASReml Tutorial: B4 Bivariate Analysis p. 55

Exercises. ASReml Tutorial: B4 Bivariate Analysis p. 55 Exercises Coopworth data set - see Reference manual Five traits with varying amounts of data. No depth of pedigree (dams not linked to sires) Do univariate analyses Do bivariate analyses. Use COOP data

More information

NENS 230 Assignment #2 Data Import, Manipulation, and Basic Plotting

NENS 230 Assignment #2 Data Import, Manipulation, and Basic Plotting NENS 230 Assignment #2 Data Import, Manipulation, and Basic Plotting Compound Action Potential Due: Tuesday, October 6th, 2015 Goals Become comfortable reading data into Matlab from several common formats

More information

SECTION I. THE MODEL. Discriminant Analysis Presentation~ REVISION Marcy Saxton and Jenn Stoneking DF1 DF2 DF3

SECTION I. THE MODEL. Discriminant Analysis Presentation~ REVISION Marcy Saxton and Jenn Stoneking DF1 DF2 DF3 Discriminant Analysis Presentation~ REVISION Marcy Saxton and Jenn Stoneking COM 631/731--Multivariate Statistical Methods Instructor: Prof. Kim Neuendorf (k.neuendorf@csuohio.edu) Cleveland State University,

More information

Comparison of Mixed-Effects Model, Pattern-Mixture Model, and Selection Model in Estimating Treatment Effect Using PRO Data in Clinical Trials

Comparison of Mixed-Effects Model, Pattern-Mixture Model, and Selection Model in Estimating Treatment Effect Using PRO Data in Clinical Trials Comparison of Mixed-Effects Model, Pattern-Mixture Model, and Selection Model in Estimating Treatment Effect Using PRO Data in Clinical Trials Xiaolei Zhou, 1,2 Jianmin Wang, 1 Jessica Zhang, 1 Hongtu

More information

Factors affecting enhanced video quality preferences

Factors affecting enhanced video quality preferences Factors affecting enhanced video quality preferences PremNandhini Satgunam, Russell L Woods, P Matthew Bronstad, Eli Peli, Member, IEEE Abstract - The development of video quality metrics requires methods

More information

4.1 GENERATION OF VIGNETTE TEXTS & RANDOM VIGNETTE SAMPLES

4.1 GENERATION OF VIGNETTE TEXTS & RANDOM VIGNETTE SAMPLES 4_vignettetextspdf 41 GENERATION OF VIGNETTE TEXTS & RANDOM VIGNETTE SAMPLES [Accompanying material for: Katrin Auspurg & Thomas Hinz (2015): Factorial Survey Experiments Sage Series: Quantitative Applications

More information

K ABC Mplus CFA Model. Syntax file (kabc-mplus.inp) Data file (kabc-mplus.dat)

K ABC Mplus CFA Model. Syntax file (kabc-mplus.inp) Data file (kabc-mplus.dat) K ABC Mplus CFA Model Syntax file (kabc-mplus.inp) title: principles and practice of sem (4th ed.), rex kline two-factor model of the kabc-i, figure 9.7, table 13.1 data: file is "kabc-mplus.dat"; type

More information

Chapter 5. Describing Distributions Numerically. Finding the Center: The Median. Spread: Home on the Range. Finding the Center: The Median (cont.

Chapter 5. Describing Distributions Numerically. Finding the Center: The Median. Spread: Home on the Range. Finding the Center: The Median (cont. Chapter 5 Describing Distributions Numerically Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide

More information

Homework Packet Week #5 All problems with answers or work are examples.

Homework Packet Week #5 All problems with answers or work are examples. Lesson 8.1 Construct the graphical display for each given data set. Describe the distribution of the data. 1. Construct a box-and-whisker plot to display the number of miles from school that a number of

More information

Singer Traits Identification using Deep Neural Network

Singer Traits Identification using Deep Neural Network Singer Traits Identification using Deep Neural Network Zhengshan Shi Center for Computer Research in Music and Acoustics Stanford University kittyshi@stanford.edu Abstract The author investigates automatic

More information

STAT 503 Case Study: Supervised classification of music clips

STAT 503 Case Study: Supervised classification of music clips STAT 503 Case Study: Supervised classification of music clips 1 Data Description This data was collected by Dr Cook from her own CDs. Using a Mac she read the track into the music editing software Amadeus

More information

abc Mark Scheme Statistics 3311 General Certificate of Secondary Education Higher Tier 2007 examination - June series

abc Mark Scheme Statistics 3311 General Certificate of Secondary Education Higher Tier 2007 examination - June series abc General Certificate of Secondary Education Statistics 3311 Higher Tier Mark Scheme 2007 examination - June series Mark schemes are prepared by the Principal Examiner and considered, together with the

More information

F1000 recommendations as a new data source for research evaluation: A comparison with citations

F1000 recommendations as a new data source for research evaluation: A comparison with citations F1000 recommendations as a new data source for research evaluation: A comparison with citations Ludo Waltman and Rodrigo Costas Paper number CWTS Working Paper Series CWTS-WP-2013-003 Publication date

More information

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and private study only. The thesis may not be reproduced elsewhere

More information

What is Statistics? 13.1 What is Statistics? Statistics

What is Statistics? 13.1 What is Statistics? Statistics 13.1 What is Statistics? What is Statistics? The collection of all outcomes, responses, measurements, or counts that are of interest. A portion or subset of the population. Statistics Is the science of

More information

Estimating. Proportions with Confidence. Chapter 10. Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.

Estimating. Proportions with Confidence. Chapter 10. Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc. Estimating Chapter 10 Proportions with Confidence Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc. Principal Idea: Survey 150 randomly selected students and 41% think marijuana should be

More information

Mixed Effects Models Yan Wang, Bristol-Myers Squibb, Wallingford, CT

Mixed Effects Models Yan Wang, Bristol-Myers Squibb, Wallingford, CT PharmaSUG 2016 - Paper PO06 Mixed Effects Models Yan Wang, Bristol-Myers Squibb, Wallingford, CT ABSTRACT The MIXED procedure has been commonly used at the Bristol-Myers Squibb Company for quality of life

More information

AmbDec User Manual. Fons Adriaensen

AmbDec User Manual. Fons Adriaensen AmbDec - 0.4.2 User Manual Fons Adriaensen fons@kokkinizita.net Contents 1 Introduction 3 1.1 Computing decoder matrices............................. 3 2 Installing and running AmbDec 4 2.1 Installing

More information

Sampler Overview. Statistical Demonstration Software Copyright 2007 by Clifford H. Wagner

Sampler Overview. Statistical Demonstration Software Copyright 2007 by Clifford H. Wagner Sampler Overview Statistical Demonstration Software Copyright 2007 by Clifford H. Wagner (w44@psu.edu) Introduction The philosophy behind Sampler is that students learn mathematics and statistics more

More information

A STATISTICAL VIEW ON THE EXPRESSIVE TIMING OF PIANO ROLLED CHORDS

A STATISTICAL VIEW ON THE EXPRESSIVE TIMING OF PIANO ROLLED CHORDS A STATISTICAL VIEW ON THE EXPRESSIVE TIMING OF PIANO ROLLED CHORDS Mutian Fu 1 Guangyu Xia 2 Roger Dannenberg 2 Larry Wasserman 2 1 School of Music, Carnegie Mellon University, USA 2 School of Computer

More information

How to present your paper in correct APA style

How to present your paper in correct APA style APA STYLE (6 th edition) 1 How to present your paper in correct APA style Julie F. Pallant This document provides a brief overview of how to prepare a journal article or research paper following the guidelines

More information

Static Timing Analysis for Nanometer Designs

Static Timing Analysis for Nanometer Designs J. Bhasker Rakesh Chadha Static Timing Analysis for Nanometer Designs A Practical Approach 4y Spri ringer Contents Preface xv CHAPTER 1: Introduction / 1.1 Nanometer Designs 1 1.2 What is Static Timing

More information

PROC GLM AND PROC MIXED CODES FOR TREND ANALYSES FOR ROW-COLUMN DESIGNED EXPERIMENTS

PROC GLM AND PROC MIXED CODES FOR TREND ANALYSES FOR ROW-COLUMN DESIGNED EXPERIMENTS PROC GLM AND PROC MIXED CODES FOR TREND ANALYSES FOR ROW-COLUMN DESIGNED EXPERIMENTS BU-1491-M June,2000 Walter T. Federer Dept. of Biometrics Cornell University Ithaca, NY 14853 wtfl@cornell.edu and Russell

More information

BPS 7th Grade Pre-Algebra Revised summer 2014 Year at a Glance Unit Standards Practices Days

BPS 7th Grade Pre-Algebra Revised summer 2014 Year at a Glance Unit Standards Practices Days BPS 7th Grade Pre-Algebra Revised summer 2014 Year at a Glance Unit Standards Practices Days 1 All Operations with Integers 7.NS.1, 7.NS.2, 7.NS.3 1,4,6,8 7 2 All Operations with Rational Numbers 7.NS.1c,

More information