Soc 195B 04W EXAMPLES OF DIRECT STANDARDIZATION

Similar documents
d. Could you represent the profit for n copies in other different ways?

Frequencies. Chapter 2. Descriptive statistics and charts

MATH 214 (NOTES) Math 214 Al Nosedal. Department of Mathematics Indiana University of Pennsylvania. MATH 214 (NOTES) p. 1/3

The Relationship Between Movie theater Attendance and Streaming Behavior. Survey Findings. December 2018

Sunday Maximum All TV News Big Four Average Saturday

Using DICTION. Some Basics. Importing Files. Analyzing Texts

Measuring Variability for Skewed Distributions

DOWNLOAD PDF BOWKER ANNUAL LIBRARY AND TRADE ALMANAC 2005

Media Xpress by TAM Media Research INDEX. 1. How has a particular channel been performing over the chosen time period(quarter/month/week)

Western Statistics Teachers Conference 2000

The RTDNA/Hofstra University Annual Survey found that 2009 meant another year of TV

Task-based Activity Cover Sheet

What's New in Journal Citation Reports?

Chapter 4. Logic Design

Virtual Vibration Analyzer

Human Hair Studies: II Scale Counts

Chapter 1 Midterm Review

Citation analysis: Web of science, scopus. Masoud Mohammadi Golestan University of Medical Sciences Information Management and Research Network

Cited Publications 1 (ISI Indexed) (6 Apr 2012)

Some features of children s composing in a computer-based environment. Figure 1. Screenshot of the MelodyMaker application

48 TV Caller ID TV CALLER ID

Concise Papers. Comprehensive Citation Index for Research Networks 1 INTRODUCTION 2 COMPREHENSIVE CITATION INDEX

Before the Federal Communications Commission Washington, D.C ) ) ) ) ) ) ) ) ) REPORT ON CABLE INDUSTRY PRICES

Information Networks

Distribution of Data and the Empirical Rule

Lesson 7: Measuring Variability for Skewed Distributions (Interquartile Range)

Telephone calls and the Brontosaurus Adam Atkinson

What is Statistics? 13.1 What is Statistics? Statistics

technical note flicker measurement display & lighting measurement

EDL8 Race Dash Manual Engine Management Systems

Improvised Duet Interaction: Learning Improvisation Techniques for Automatic Accompaniment

First Year Evaluation Report for PDAE Grant Accentuating Music, Language and Cultural Literacy through Kodály Inspired Instruction

Supplementary Note. Supplementary Table 1. Coverage in patent families with a granted. all patent. Nature Biotechnology: doi: /nbt.

THE JOURNAL OF POULTRY SCIENCE: AN ANALYSIS OF CITATION PATTERN

What is bibliometrics?

Training Note TR-06RD. Schedules. Schedule types

CPSC 121: Models of Computation. Module 1: Propositional Logic

Case Study: Can Video Quality Testing be Scripted?

Patchmaster. Elektronik. The Pulse generator. February 2013

DMX Channel Index. Revision F Covers firmware version

*On-Line appendix for non-tables, by Margo Schlanger

A variable bandwidth broadcasting protocol for video-on-demand

Nielsen Examines TV Viewers to the Political Conventions. September 2008

Authority Control -- Key Takeaways & Reminders

CURRICULUM CHECK SHEET COLLEGE OF THE ARTS ARIZONA STATE UNIVERSITY

Semi-supervised Musical Instrument Recognition

The Relationship Between Movie Theatre Attendance and Streaming Behavior. Survey insights. April 24, 2018

Citation Analysis of Doctoral Theses in the field of Sociology submitted to Panjab University, Chandigarh (India) during

Static Timing Analysis for Nanometer Designs

Box Plots. So that I can: look at large amount of data in condensed form.

homework solutions for: Homework #4: Signal-to-Noise Ratio Estimation submitted to: Dr. Joseph Picone ECE 8993 Fundamentals of Speech Recognition

Gain/Attenuation Settings in RTSA P, 418 and 427

Upper Iowa University-Academic Extension and Lakeshore Technical College (WI) Course-to-Course Articulation. October 2009

Cambridge International Examinations Cambridge International General Certificate of Secondary Education. Paper 1 May/June hours 30 minutes

Proofed Paper: ntp Mon Jan 30 23:05:28 EST 2017

CALIFORNIA GERMANY TRAVEL TRADE BAROMETER

AMTR Report For Destiny Users

LSVT LOUD Assessment Packet Initial Interview Evaluation Protocol Stimulability Protocol Follow-up Questions Perceptual Scales

EE201: Transmission Line Protection

University of California at Berkeley College of Engineering Department of Electrical Engineering and Computer Science

University of California at Berkeley College of Engineering Department of Electrical Engineering and Computer Science. EECS 150 Spring 2000

Swept-tuned spectrum analyzer. Gianfranco Miele, Ph.D

Chapter 40: MIDI Tool

1. MORTALITY AT ADVANCED AGES IN SPAIN MARIA DELS ÀNGELS FELIPE CHECA 1 COL LEGI D ACTUARIS DE CATALUNYA

RF Safety Surveys At Broadcast Sites: A Basic Guide

Older People and Television Viewing in Japan

University of La Verne and Glendale Community College General Education Transfer Agreement Plan Track I

Technical Theater Certificate

Configuration options allow the trader to select up to 12 different ratios, the color, style, width and label contents can be configured too.

Lab experience 1: Introduction to LabView

Syddansk Universitet. The data sharing advantage in astrophysics Dorch, Bertil F.; Drachen, Thea Marie; Ellegaard, Ole

E X P E R I M E N T 1

Phone-based Plosive Detection

2013 Environmental Monitoring, Evaluation, and Protection (EMEP) Citation Analysis

This document is a reference document that shows the menus in the 5500sc, 9610sc and 9611sc analyzers. There are 3 top-level menus:

Publish or Perish in the Internet Age

Character Users Guide

The Public and Its Problems

2018 RTDNA/Hofstra University Newsroom Survey

Bibliometric Rankings of Journals Based on the Thomson Reuters Citations Database

The Power of Terrestrial Radio in Puerto Rico. Presented by: Brad LaRock Arbitron June 2012

Blueline, Linefree, Accuracy Ratio, & Moving Absolute Mean Ratio Charts

SIDRA INTERSECTION 8.0 UPDATE HISTORY

The Urbana Free Library Patron Survey. Final Report

Program Outcomes and Assessment

Complementary bibliometric analysis of the Health and Welfare (HV) research specialisation

VIRTUAL NETWORKING AND CITATION ANALYSIS

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

THE LOCAL TV NEWS HOUSEHOLD AUDIENCE THE LOCAL TV

ThinkTV FACT PACK NEW ZEALAND JAN TO DEC 2017

Composer Style Attribution

GRADE 6: Performance Task

DESI WULANDARI A

DPhil SR953 RESEARCH FOR 21ST CENTURY SCHOLARSHIP. Supplement Joy Montgomery and David Ward, Oxford Graduate School (Thanks to Allison Shade)

The DataView PowerPad III Control Panel

CHEMICAL TECHNOLOGY LITERATURE: AN OBSOLESCENCE STUDY.

LOW POWER AND HIGH PERFORMANCE SHIFT REGISTERS USING PULSED LATCH TECHNIQUE

Keywords: Publications, Citation Impact, Scholarly Productivity, Scopus, Web of Science, Iran.

Dana Hunt Second-year undergraduate Millikin University, USA

Impact Factor COMMUN ANAL GEOM >10.0 >10.0. Cited Journal Citing Journal Source Data Journal Self Cites

Transcription:

Sociology Department William M. Mason UCLA Winter, 2004 Soc 195B 04W EXAMPLES OF DIRECT STANDARDIZATION This annotated Chip output file illustrates the use of the Standardize command using a file named status98_freq.chp that has the same information as status98.chp. (The file named status98.chp is formatted in such a way that the standardization command does not work with it, but other commands do work with it.) I have placed a self-extracting file named status98_freq.sfx on the course web site. It may be found in the Index of Course Materials. 1. To begin this session I opened a log file named status98_21oct02.log. 2. I next opened the file status98_freq.chp. N = 2195 3. Here is the result of requesting the tabulation of respondent s income by father s occupational prestige. Clearly there is an association between the two variables. Income/Papres $35K+ 32.8 34.0 44.2 37.3 $17.5K-$ 43.1 42.6 39.8 41.7 <$17.5K 24.1 23.4 16.1 21.0 100%= 677 721 797 N = 2195 4. I next issued commands to percentage respondent s income by father s occupational prestige, controlling for respondent s education. This makes for a very large table, consisting of four conditional subtables: Ed = <12yrs $35K+ 20.3 14.7 7.7 16.0 $17.5K-$ 40.5 41.3 42.3 41.1 <$17.5K 39.2 44.0 50.0 42.9 100%= 74 75 26 N = 175 D:\courses\soc195b_04w\handout\standardization_status98_2feb04.doc Page 1 of 8

Ed = 12yrs $35K+ 23.5 21.7 29.3 24.3 $17.5K-$ 47.8 49.3 45.9 47.8 <$17.5K 28.7 29.0 24.8 27.9 100%= 268 221 157 N = 646 Ed = 13-15yrs $35K+ 32.3 26.4 38.6 32.6 $17.5K-$ 45.2 49.1 42.6 45.5 <$17.5K 22.6 24.5 18.8 21.9 100%= 155 163 176 N = 494 Ed = 16+yrs $35K+ 52.2 54.6 53.9 53.8 $17.5K-$ 35.6 33.2 36.3 35.2 <$17.5K 12.2 12.2 9.8 11.0 100%= 180 262 438 N = 880 5. It is difficult to summarize, much less interpret such a complex table, yet it is highly desirable to do so. Fortunately, there are tools to deal with this situation. The simplest of these is known as direct standardization. We will henceforth refer to this tool as standardization, since we will not be using variants such as indirect standardization. We will use standardization to form a particular type of weighted average over conditional subtables, so that we can summarization controlled associations between pairs of variables. Before explaining the calculations, I want to illustrate how you can use Chip to produce standardized results. Although you can t see it in this transcript file, after extracting the income by father s occupational prestige table within each educational level (i.e., controlling for education), I issued the standardize command. That produced the following output: File Causal order: Ed -> Inco* -> Papr* -> Regi* -> Race* -> Sex* -> Age* -> Marr* -> Sibs* 4 x 3 x 3 x 4 x 2 x 2 x 3 x 3 x 3 N = 2197 6. The above output should be read as telling us that until we tell it otherwise, Chip will standardize all associations for educations (note the lack of an asterisk next to Ed ). (Notice that, unfortunately, the N is 2,197 D:\courses\soc195b_04w\handout\standardization_status98_2feb04.doc Page 2 of 8

and not 2,195. This is due to internal rounding errors in Chip. In the mathematics of standardization, the N stays exactly the same.) Still, Chip will do nothing until you give it the next command. The command is not printed in the log file, but it was simply frequency, which produced the following table of frequencies standardized for education: Low Medium High 100%= $35K+ 254 248 317 820 $17.5K-$ 282 303 331 917 <$17.5K 147 163 149 461 100%= 684 715 798 N = 2197 7. I presented these frequencies so that you could see that the number of respondents remains the same under standardization. The number of individuals in each father s occupational prestige category also remains the same. Next I asked for percent down : $35K+ 37.2 34.7 39.8 37.3 $17.5K-$ 41.2 42.4 41.5 41.7 <$17.5K 21.6 22.9 18.7 21.0 100%= 684 715 798 N = 2197 8. The above percentage table summarizes the association between respondent s income and father s occupational prestige, controlling respondent s education. It shows that there is virtually no association. 9. Now let s consider what happens to the income by education association, controlling father s occupational prestige. First, let s remind ourselves of the uncontrolled association. To obtain that, I need to mouse through standard restore. Then I issue the commands for the uncontrolled table. Income/Ed $35K+ 16.0 24.3 32.6 53.8 37.3 $17.5K-$ 41.1 47.8 45.5 35.2 41.7 <$17.5K 42.9 27.9 21.9 11.0 21.0 100%= 175 646 494 880 N = 2195 D:\courses\soc195b_04w\handout\standardization_status98_2feb04.doc Page 3 of 8

10. Here comes the table controlling for father s occupational prestige: Papres = Low $35K+ 20.3 23.5 32.3 52.2 32.8 $17.5K-$ 40.5 47.8 45.2 35.6 43.1 <$17.5K 39.2 28.7 22.6 12.2 24.1 100%= 74 268 155 180 N = 677 Papres = Medium $35K+ 14.7 21.7 26.4 54.6 34.0 $17.5K-$ 41.3 49.3 49.1 33.2 42.6 <$17.5K 44.0 29.0 24.5 12.2 23.4 100%= 75 221 163 262 N = 721 Papres = High $35K+ 7.7 29.3 38.6 53.9 44.2 $17.5K-$ 42.3 45.9 42.6 36.3 39.8 <$17.5K 50.0 24.8 18.8 9.8 16.1 100%= 26 157 176 438 N = 797 11. Again, such a table is difficult to summarize and interpret. We standardize on father s occupational prestige. File Causal order: Papre -> Inco* -> Ed* -> Regi* -> Race* -> Sex* -> Age* -> Marr* -> Sibs* 3 x 3 x 4 x 4 x 2 x 2 x 3 x 3 x 3 N = 2195 12. Next, we percentage the table that Chip has been holding in memory for us: $35K+ 17.8 25.3 32.6 52.6 37.3 $17.5K-$ 40.9 47.5 45.4 35.6 41.7 <$17.5K 41.3 27.2 22.0 11.8 21.0 100%= 173 645 496 881 N = 2195 D:\courses\soc195b_04w\handout\standardization_status98_2feb04.doc Page 4 of 8

13. The standardized table can and should be compared to the original, uncontrolled table: Income/Ed $35K+ 16.0 24.3 32.6 53.8 37.3 $17.5K-$ 41.1 47.8 45.5 35.2 41.7 <$17.5K 42.9 27.9 21.9 11.0 21.0 100%= 175 646 494 880 N = 2195 14. Standardization becomes even more helpful when we want to control for more than one variable. Consider the income by education association, controlling for race and region. To obtain the 4-way tabulation, I need to restore the data to their pre-standardized state. Then I issue the appropriate table, control, and percentaging commands: Race = White Region = Northeast $35K+ 21.2 29.0 43.2 53.1 41.2 $17.5K-$ 39.4 50.0 45.9 35.2 42.0 <$17.5K 39.4 21.0 10.8 11.7 16.8 100%= 33 100 74 145 N = 352 Race = White Region = Midwest $35K+ 12.5 22.7 33.6 55.6 37.2 $17.5K-$ 45.8 49.7 45.7 34.3 42.8 <$17.5K 41.7 27.6 20.7 10.1 20.0 100%= 24 181 116 198 N = 519 Race = White Region = South $35K+ 13.4 24.1 25.3 52.4 33.9 $17.5K-$ 46.3 49.2 47.9 37.8 44.4 <$17.5K 40.3 26.6 26.7 9.8 21.7 100%= 67 199 146 246 N = 658 D:\courses\soc195b_04w\handout\standardization_status98_2feb04.doc Page 5 of 8

Race = White Region = West $35K+ 18.2 29.8 42.2 55.9 45.7 $17.5K-$ 36.4 44.0 38.9 32.4 36.4 <$17.5K 45.5 26.2 18.9 11.7 17.9 100%= 22 84 90 222 N = 418 Race = Black Region = Northeast $35K+ 33.3 18.8 27.3 16.7 21.4 $17.5K-$ 33.3 50.0 45.5 66.7 52.4 <$17.5K 33.3 31.3 27.3 16.7 26.2 100%= 3 16 11 12 N = 42 Race = Black Region = Midwest $35K+ 33.3 11.1 22.7 66.7 32.8 $17.5K-$ 16.7 50.0 45.5 22.2 37.5 <$17.5K 50.0 38.9 31.8 11.1 29.7 100%= 6 18 22 18 N = 64 Race = Black Region = South $35K+ 11.1 21.4 22.6 42.4 25.8 $17.5K-$ 27.8 33.3 48.4 42.4 38.7 <$17.5K 61.1 45.2 29.0 15.2 35.5 100%= 18 42 31 33 N = 124 Race = Black Region = West $35K+.0.0.0 83.3 27.8 $17.5K-$ 100.0 50.0 75.0.0 44.4 <$17.5K.0 50.0 25.0 16.7 27.8 100%= 2 6 4 6 N = 18 D:\courses\soc195b_04w\handout\standardization_status98_2feb04.doc Page 6 of 8

15. Next I standardize on race and region. File Causal order: Race -> Regio -> Inco* -> Ed* -> Sex* -> Age* -> Marr* -> Papr* -> Sibs* 2 x 4 x 3 x 4 x 2 x 3 x 3 x 3 x 3 N = 2195 16. And then percentage the standardized table. $35K+ 16.9 25.0 32.7 53.0 37.3 $17.5K-$ 40.8 47.5 45.8 35.4 41.7 <$17.5K 42.3 27.5 21.5 11.6 21.0 100%= 174 645 492 883 N = 2195 17. Now let s look at the income/race association uncontrolled: Income/Race $35K+ 38.6 27.0 37.3 $17.5K-$ 41.8 41.1 41.7 <$17.5K 19.6 31.9 21.0 100%= 1947 248 N = 2195 18. Controlling for education: Ed = <12yrs $35K+ 15.8 17.2 16.0 $17.5K-$ 43.2 31.0 41.1 <$17.5K 41.1 51.7 42.9 100%= 146 29 N = 175 Ed = 12yrs $35K+ 25.4 17.1 24.3 $17.5K-$ 48.8 41.5 47.8 <$17.5K 25.9 41.5 27.9 100%= 564 82 N = 646 D:\courses\soc195b_04w\handout\standardization_status98_2feb04.doc Page 7 of 8

Ed = 13-15yrs $35K+ 34.3 22.1 32.6 $17.5K-$ 45.1 48.5 45.5 <$17.5K 20.7 29.4 21.9 100%= 426 68 N = 494 Ed = 16+yrs $35K+ 54.3 47.8 53.8 $17.5K-$ 35.0 37.7 35.2 <$17.5K 10.7 14.5 11.0 100%= 811 69 N = 880 19. Standardizing on education: File Causal order: Ed -> Inco* -> Race* -> Regi* -> Sex* -> Age* -> Marr* -> Papr* -> Sibs* 4 x 3 x 2 x 4 x 2 x 3 x 3 x 3 x 3 N = 2195 20. Percentaging the standardized association: $35K+ 38.2 30.0 37.3 $17.5K-$ 41.8 40.9 41.7 <$17.5K 19.9 29.1 21.0 100%= 1951 244 N = 2195 D:\courses\soc195b_04w\handout\standardization_status98_2feb04.doc Page 8 of 8