Lecture 1: Course logistics, homework 0

Similar documents
15-415: Database Applications. Project 1: Querying the MovieLens Database

NETFLIX MOVIE RATING ANALYSIS

Description of Variables

Toronto Alliance for the Performing Arts

IMDB Movie Review Analysis

1) New Paths to New Machine Learning Science. 2) How an Unruly Mob Almost Stole. Jeff Howbert University of Washington

Class 1: Motivation, Signals, Systems, Policies

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007

Music 4 - Exploring Music Fall 2016

S E A S O N. Proudly Supporting FSU College of Music Students

SPONSORSHIP OPPORTUNITIES OPENING SEASON

APPENDIX I. MARKETING OF LIBRARY AND INFORMATION PRODUCTS AND SERVICES IN ACADEMIC LIBRARIES OF UTTARAKHAND: A STUDY (Questionnaire for Librarian)

LIBRARY. Preble County District Library Annual Report. Preble County District

PERCUSSION CAMP HANDBOOK July 9-13, 2018

MEASURING EMERGING SCIENTIFIC IMPACT AND CURRENT RESEARCH TRENDS: A COMPARISON OF ALTMETRIC AND HOT PAPERS INDICATORS

Netflix & American Latinos: An Integrated Marketing Communications Plan. Anthony Morelle Jose Velez Borbon Melissa Greco Lopes

Full Page Ads. Against the Grain. Volume 28 Issue 3 Article 2

Theatre Arts I-IV. Course Overview & Syllabus Course Information:

COLLECTION DEVELOPMENT

Research Evaluation Metrics. Gali Halevi, MLS, PhD Chief Director Mount Sinai Health System Libraries Assistant Professor Department of Medicine

THE SPOTLIGHT Main Productions Lunchbox Series

CSE 166: Image Processing. Overview. Representing an image. What is an image? History. What is image processing? Today. Image Processing CSE 166

Measuring Your Research Impact: Citation and Altmetrics Tools

Music 4 - Exploring Music Fall 2015

STRATEGIC PARTNERSHIPS. Audiences at the 38th San Francisco Jewish Film Festival Opening Night screening at the Castro Theatre

Proudly supporting students in the College of Music S E A S O N F L O R I D A S T A T E U N I V E R S I T Y

WORLD ASSOCIATION FOR SYMPHONIC BANDS AND ENSEMBLES GUIDELINES FOR PERFORMING ENSEMBLES AT WASBE INTERNATIONAL CONFERENCES

photo: GretjenHelene.com Serving and supporting early music professionals and enthusiasts since 1985.

SHREK the Musical: Information, Audition Requirements, and Rehearsal Schedule

2nd MICHELANGELO INTERNATIONAL MUSIC FESTIVAL 17th - 19th April 2018 FLORENCE, ITALY

Event Services & Fees

VBM683 Machine Learning

Publishing your paper

AGENDA. Mendeley Content. What are the advantages of Mendeley? How to use Mendeley? Mendeley Institutional Edition

Approaches to E-Book Acquisition in Bavaria

ANALYZING CERTAIN TEMPORAL DEPENDENCES IN NETFLIX DATA

Speakeasy By Suzey Ingold READ ONLINE

1 Category 1: Instrumental Performance (Individual)

Strategic Partnerships 2018

Theatre By David Mamet

RESPONSE OF THE NATIONAL ASSOCIATION OF THEATRE OWNERS (NATO) To the report and recommendations of The Federal Trade Commission

STRATEGIC PARTNERSHIPS

Open Access Journals: Quantity vs Quality Ruchareka Wittayawuttikul

administration access control A security feature that determines who can edit the configuration settings for a given Transmitter.

Keyboard Area Handbook for Undergraduate and Graduate Students in Applied Keyboard Courses

Vincenzo Terenzio Prize

The Sinner (The Return Of The Highlanders) By Margaret Mallory

Professional Orchestra Player

Pay TV channels distribution in Latam and North America

2017 The 5 th GOCAA YOUTH PIANIST COMPETITION SYLLABUS AND APPLICATION

Texas Bandmasters Association 2016 Convention/Clinic

On the Citation Advantage of linking to data

INTERIM RESULTS SKY NETWORK TELEVISION LIMITED INTERIM RESULTS DECEMBER 2018

Information for Authors and Editors

STAT 250: Introduction to Biostatistics LAB 6

Geoscience Librarianship 101 Geoscience Information Society (GSIS) Denver, CO September 24, 2016

Objective Content or process student will be able to know and do

- Primo Central (PCI) is a database of citations a mega-aggregator, approaching 1 billion items contained in 1700 collections

Magic Lantern Slide Heritage As Artefacts in the Common European History of Learning

PHILHARMONIA BAROQUE ORCHESTRA PRESENTS DECEMBER PERFORMANCES OF BACH S MASS IN B MINOR AND HANDEL S MESSIAH IN BAY AREA AND LOS ANGELES

Supplementary Note. Supplementary Table 1. Coverage in patent families with a granted. all patent. Nature Biotechnology: doi: /nbt.

ORCHESTRA ASSISTANT AND MUSIC LIBRARIAN

Australian Chamber Choir Regional Performance and Relationship Model

Chair s Corner. Special points of interest: May 1st Honors Convocation. May 14th Last day of classes. May 15 16th Study Period

MPO Patrons Concert Season

Patent Universe. Databases: USPTO and NBER. EXTRACTION of GOVERNING RULES of FORMATION First Patent (1975)

Pulling the plug: Three-in-ten Canadians are forgoing home TV service in favour of online streaming

Intermediate Piano Syllabus and Course Outline

American Popular Music: Course Syllabus

PHILIP C. CHANG

2019 JUNIOR AND INTERMEDIATE COMPETITION RULES AND REGULATIONS

Simplified Distribution Rules

Outline Traditional collection development Use studies Interlibrary loan Post transaction analysis Book purchase model Early implementers

RENAISSANCE THEATRE RENTAL GUIDE

Classical Pianist Nikolay Khozyainov to Appear in Silicon Valley

EXPERIENCE MUSIC IN COMMUNITY

MUSIC INTRODUCTION TO MUSICAL EXPERIENCES FIRST SUMMER SESSION 2012 SYLLABUS

State of the Library Report May Stephenville High School Library

Archiving Your Research: the UNM Institutional Repository

Homework 2 Key-finding algorithm

GMTA AUDITIONS INFORMATION & REQUIREMENTS Woodwinds and Brass

Palomar Pacific Chapter Barbershop Harmony Society Mission, Goals, Policies and Procedures

Wind Ensemble.

From Storehouse to Clubhouse Collection Management and the Library as Place. Indiana Library Federation Conference Fort Wayne, Indiana October 2009

7 th Grade. Drum Intermediate School Band Handbook

The role of publishers

A quarterly review of population trends and changes in how people can watch television

Hartt School Community Division Oboe Audition Teacher Resource Packet

Reading Habits Across Disciplines: A Study of Student E-book Use

Subscribe Now and receive these benefits:

Determinants of Cable Program Diversity [Slides]

SCHOOL (BEGINNING) BAND

Lessons from the Netflix Prize: Going beyond the algorithms

NORTH CAROLINA THEATRE PLAYBILL

ForeWord Reviews Announces IndieFab Awards. Interview with Victoria Champagne Sutherland Howard Lovy Matthew Sutherland

Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University

Cracking the PubMed Linkout System

Matthew Gill & Jordan Laird Band Directors. David Lord Co-Teacher. Greer Middle School 3032 East Gap Creek Road Greer, SC 29651

Cutting the Cable. Mark Schulman

Finding a Home for Your Publication. Michael Ladisch Pacific Libraries

Transcription:

Lecture 1: Course logistics, homework 0 STATS 202: Data mining and analysis Jonathan Taylor, 9/24 Slide credits: Sergio Bacallado September 19, 2018 1 / 6

Syllabus Videos: Every lecture will be recorded by SCPD. 2 / 6

Syllabus Videos: Every lecture will be recorded by SCPD. Email policy: Please use the Piazza site for most questions. For administrative issues that only concern you, email the course staff mailing list: stats202-aut1819-staff@lists.stanford.edu 2 / 6

Syllabus Videos: Every lecture will be recorded by SCPD. Email policy: Please use the Piazza site for most questions. For administrative issues that only concern you, email the course staff mailing list: stats202-aut1819-staff@lists.stanford.edu Class website: stats202.stanford.edu. If you are auditing the class (not registered on Axess), email us your SUNet ID in order to gain access to the lectures and homework. 2 / 6

Prediction challenges The MNIST dataset is a library of handwritten digits. 3 / 6

Prediction challenges The MNIST dataset is a library of handwritten digits. In a prediction challenge, you are given a training set of images of handwritten digits, which are labeled from 0 to 9. 3 / 6

Prediction challenges The MNIST dataset is a library of handwritten digits. In a prediction challenge, you are given a training set of images of handwritten digits, which are labeled from 0 to 9. You are also given a test set of handwritten digits, which are not identified. 3 / 6

Prediction challenges The MNIST dataset is a library of handwritten digits. In a prediction challenge, you are given a training set of images of handwritten digits, which are labeled from 0 to 9. You are also given a test set of handwritten digits, which are not identified. Your job is to assign a digit to each image in the test set. 3 / 6

The Netflix prize Netflix popularized prediction challenges by organizing an open, blind contest to improve its recommendation system. The prize was $1 million. Users Rankings (1 to 5 stars) Movies 4 / 6

The Netflix prize Netflix popularized prediction challenges by organizing an open, blind contest to improve its recommendation system. The prize was $1 million. Users Some rankings were hidden in the training data Movies 4 / 6

The Netflix prize Netflix popularized prediction challenges by organizing an open, blind contest to improve its recommendation system. The prize was $1 million. Users The challenge was to predict those rankings Movies 4 / 6

Kaggle Company founded in 2010. Business model: Organize prediction competitions hosted online. Offer companies consulting services from Kaggle stars. 5 / 6

Kaggle Company founded in 2010. Business model: Organize prediction competitions hosted online. Offer companies consulting services from Kaggle stars. Kaggle-in-class is a competition engine offered to degree-granting institutions for free. Stats 202 was the first class to use it! 5 / 6

This quarter s Kaggle challenge Help out San Francisco s foremost Baroque ensemble bring in subscriptions! 6 / 6

This quarter s Kaggle challenge Help out San Francisco s foremost Baroque ensemble bring in subscriptions! Option 1: Using Philharmonia s database of subscriptions and single ticket sales, including information about concerts, and patrons, predict who will subscribe for the 2014-2015 season. 6 / 6

This quarter s Kaggle challenge Help out San Francisco s foremost Baroque ensemble bring in subscriptions! Option 1: Using Philharmonia s database of subscriptions and single ticket sales, including information about concerts, and patrons, predict who will subscribe for the 2014-2015 season. Option 2: Create an interactive visualization of Philharmonia s database using the R package Shiny. 6 / 6

This quarter s Kaggle challenge Help out San Francisco s foremost Baroque ensemble bring in subscriptions! Option 1: Using Philharmonia s database of subscriptions and single ticket sales, including information about concerts, and patrons, predict who will subscribe for the 2014-2015 season. Option 2: Create an interactive visualization of Philharmonia s database using the R package Shiny. Invitations to the competition go out at the end of the week! 6 / 6