Week 5 Video 4. Relationship Mining Sequential Pattern Mining

Similar documents
db math Training materials for wireless trainers

Data Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of Computer Science

Marking Policy Published by SOAS

A Transaction-Oriented UVM-based Library for Verification of Analog Behavior

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Automated extraction of motivic patterns and application to the analysis of Debussy s Syrinx

ACT-R ACT-R. Core Components of the Architecture. Core Commitments of the Theory. Chunks. Modules

Temporal Error Concealment Algorithm Using Adaptive Multi- Side Boundary Matching Principle

secundaria EDUCATIONAL PROGRAM YEAR PROGRAM FOR 9 TH GRADE The mountain s eyes 10 arts movements you should know

UNIVERSITY OF SOUTH ALABAMA PSYCHOLOGY

Mining Complex Boolean Expressions for Sequential Equivalence Checking

The UK framework for access to white spaces in the UHF TV band

Incorporation of Escorting Children to School in Individual Daily Activity Patterns of the Household Members

PSYCHOLOGICAL SCIENCES. Student: PUID: Catalog Term: Fall Additional Majors: Minors:

DOWNLOAD OR READ : OXFORD POCKET THESAURUS PDF EBOOK EPUB MOBI

IJMIE Volume 2, Issue 3 ISSN:

Synthesis Technology E102 Quad Temporal Shifter User Guide Version 1.0. Dec

MUSIC BUSINESS

R&S BCDRIVE R&S ETC-K930 Broadcast Drive Test Manual

CM3106 Solutions. Do not turn this page over until instructed to do so by the Senior Invigilator.

An Effective Filtering Algorithm to Mitigate Transient Decaying DC Offset

The Construction of the DB for Mobile Moviegoer s Behavior and Its Application to Fuzzy Clustering-Based Reservation App in China

Eindhoven University of Technology MASTER. Connected lighting system data analytics. Zhang, Y. Award date: Link to publication

Rating the impact and success of films beyond the box office

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting

Project: IEEE P Working Group for Wireless Personal Area Networks (WPANs)

Fairy Tales and Tall Tales Second Grade Common Core Unit Scope and Sequence

Beethoven Bot. Oliver Chang. University of Florida. Department of Electrical and Computer Engineering. EEL 4665-IMDL-Final Report

When the OR-array is pre-programed (fixed) and the AND-array. is programmable, you have what is known as a PAL/GAL. These are very low

Design for Test. Design for test (DFT) refers to those design techniques that make test generation and test application cost-effective.

17th Edition IEE Wiring Regulations: Design and Verification of Electrical Installations

FINAL PROJECT: PERFORMANCE ARTS AND AI

Core Values-Timeout? (Sw. Värdegrundstimeout)

Physics 277:Special Topics Medieval Arms and Armor. Fall Dr. Martin John Madsen Department of Physics Wabash College

A Brief Overview of Existing Tools for Testing the Internet-of-Things

Leakage Current Reduction in Sequential Circuits by Modifying the Scan Chains. Outline

Classes may fill up fast so please obtain their signature a.s.a.p. *This list is subject to change*

Various Artificial Intelligence Techniques For Automated Melody Generation

Using Scan Side Channel to Detect IP Theft

Extreme Experience Research Report

Campus Academic Resource Program Quick Reading: most important

Lab2: Cache Memories. Dimitar Nikolov

Discovering Sequential Association Rules with Constraints and Time Lags in Multiple Sequences

Lecture 8: Sequential Logic

Decade Counters Mod-5 counter: Decade Counter:

Temporal data mining for root-cause analysis of machine faults in automotive assembly lines

BAL Real Power Balancing Control Performance Standard Background Document

Inside Digital Design Accompany Lab Manual

College of Health and Human Sciences 120 credits Student: PUID: Catalog Term: PSYCHOLOGICAL SCIENCES PSYSCI-BS. Additional Majors: Minors:

OSL Preprocessing Henry Luckhoo. Wednesday, 23 October 13

Yong Cao, Debprakash Patnaik, Sean Ponce, Jeremy Archuleta, Patrick Butler, Wu-chun Feng, and Naren Ramakrishnan

CURRICULUM CHECK SHEET COLLEGE OF THE ARTS ARIZONA STATE UNIVERSITY

Detection and demodulation of non-cooperative burst signal Feng Yue 1, Wu Guangzhi 1, Tao Min 1

Keep your broadcast clear.

Asynchronous IC Interconnect Network Design and Implementation Using a Standard ASIC Flow

Robin Sullivan 03/04/2018

Planning Tool of Point to Poin Optical Communication Links

Secondary Vocal Music

DIFFERENTIAL CONDITIONAL CAPTURING FLIP-FLOP TECHNIQUE USED FOR LOW POWER CONSUMPTION IN CLOCKING SCHEME

ECE302H1S Probability and Applications (Updated January 10, 2017)

Re: ENSC440 Post-Mortem for a License Plate Recognition Auto-gate System

DLH Packer. We Know Downhole. Special Features. To order, Call +61 (0) or visit Australian - Accredited - Committed

Data Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS

(Refer Slide Time 1:58)

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

PRACTICAL APPLICATION OF THE PHASED-ARRAY TECHNOLOGY WITH PAINT-BRUSH EVALUATION FOR SEAMLESS-TUBE TESTING

Experiments and Experience in SP173. MIT Student

TAFE Illawarra Institute SUBJECT OUTLINE. Screen and Media Advanced Diploma. Subject: Sound 3

College of Health and Human Sciences 120 credits Student: PUID: Catalog Term: PSYCHOLOGICAL SCIENCES PSYSCI-BS. Additional Majors: Minors:

MAJOR IN PSYCHOLOGY, GENERAL PSYCHOLOGY CONCENTRATION

CS6201 UNIT I PART-A. Develop or build the following Boolean function with NAND gate F(x,y,z)=(1,2,3,5,7).

Instructions for Use of the 2018 NRR Contest Logger

Practical Application of the Phased-Array Technology with Paint-Brush Evaluation for Seamless-Tube Testing

Eighth note triplets (Quaver triplets)

Load Frequency Control Structure for Ireland and Northern Ireland

Chapter 3. Boolean Algebra and Digital Logic

Interlace and De-interlace Application on Video

CHARACTERIZATION OF END-TO-END DELAYS IN HEAD-MOUNTED DISPLAY SYSTEMS

VLSI Design: 3) Explain the various MOSFET Capacitances & their significance. 4) Draw a CMOS Inverter. Explain its transfer characteristics

Logic Design II (17.342) Spring Lecture Outline

Case Study: Can Video Quality Testing be Scripted?

Discovery of frequent episodes in event sequences

7th Grade Vocal Music Music

Static Timing Analysis for Nanometer Designs

Enhancing Performance in Multiple Execution Unit Architecture using Tomasulo Algorithm

Signal Processing. Case Study - 3. It s Too Loud. Hardware. Sound Levels

PSYCHOLOGICAL SCIENCES. Student: PUID: Catalog Term: Fall Additional Majors: Minors:

Objective: Write on the goal/objective sheet and give a before class rating. Determine the types of graphs appropriate for specific data.

Vertical Music Discovery

*This list is subject to change*

Computer and Electronics Engineering. General Education Requirements

Feedback Control of SPS E-Cloud/TMCI Instabilities

Undergraduate Degree Map for Completion in Four Years

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

CSE 101. Algorithm Design and Analysis Miles Jones Office 4208 CSE Building Lecture 9: Greedy

Synchronous sequential circuits

CRYPTOGRAPHY. Sharafat Ibn Mollah Mosharraf TOUCH-N-PASS EXAM CRAM GUIDE SERIES. Special Edition for CSEDU. Students CSE, DU )

Cisco Video Surveillance 6400 IP Camera

ECE 301 Digital Electronics

Transcription:

Week 5 Video 4 Relationship Mining Sequential Pattern Mining

Association Rule Mining Try to automatically find if-then rules within the data set

Sequential Pattern Mining Try to automatically find temporal patterns within the data set

ARM Example If person X buys diapers, Person X buys beer Purchases occur at the same time

SPM Example If person X takes Intro Stats now, Person X takes Advanced Data Mining in a later semester Conclusion: recommend Advanced Data Mining to students who have previously taken Intro Stats Doesn t matter if they take other courses in between

SPM Example Learners in virtual environments have different sequences of behavior depending on their degree of self-regulated learning High self-regulated learning: Tend to gather information and then immediately record it carefully Low self-regulated learning: Tend to gather more information without pausing to record it (Sabourin, Mott, & Lester, 2011)

Different Constraints than ARM If-then elements do not need to occur in the same data point Instead If-then elements should involve the same student (or other organizing variable, like teacher or school) If elements can be within a certain time window of each other Then element time should be within a certain window after if times

Sequential Pattern Mining Find all subsequences in data with high support Support calculated as number of sequences that contain subsequence, divided by total number of sequences

GSP (Generalized Sequential Pattern) Classic Algorithm for SPM (Srikant & Agrawal, 1996)

Data pre-processing Data transformed from individual actions to sequences by user Bob: {GAMING and BORED, OFF-TASK and BORED, ON-TASK and BORED, GAMING and BORED, GAMING and FRUSTRATED, ON-TASK and BORED}

Data pre-processing In some cases, time also included Bob: {GAMING and BORED 5:05:20, OFF-TASK and BORED 5:05:40, ON-TASK and BORED 5:06:00, GAMING and BORED 5:06:20, GAMING and FRUSTRATED 5:06:40, ON-TASK and BORED 5:07:00}

Algorithm Take the whole set of sequences of length 1 May include ANDed combinations at same time Find which sequences of length 1 have support over pre-chosen threshold Compose potential sequences out of pairs of sequences of length 1 with acceptable support Find which sequences of length 2 have support over pre-chosen threshold Compose potential sequences out of triplets of sequences of length 1 and 2 with acceptable support Continue until no new sequences found

a, b, c, d, e, f

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac

a, b, c, d, e, f, ac(14/40=35%)

a, b, c, d, e, f, ac, ad, ae

a, b, c, d, e, f, ac, ad, ae, aad,

a, b, c, d, e, f, ac, ad, ae, aad

a, b, c, d, e, f, ac, ad, ae, aad

a, b, c, d, e, f, ac, ad, ae, aad

a, b, c, d, e, f, ac, ad, ae, aad

a, b, c, d, e, f, ac, ad, ae, aad

a, b, c, d, e, f, ac, ad, ae, aad

a, b, c, d, e, f, ac, ad, ae, aad

a, b, c, d, e, f, ac, ad, ae, aad, aae, ade

From ac, ad, ae, aad, aae, ade To a à c, a à d, a à e, a à ad, a à ae, ad à e

Other algorithms Free-Span Prefix-Span Select sub-sets of data to search within Faster, but same basic idea as in GPS

Differential Sequence Mining (Kinnebrew et al., 2013) Compares the support for sequential patterns between two groups Such as high-performing and low-performing students To find the patterns that are much more common in one group than the other

Process Mining Related algorithm Rather than just finding small, local patterns Tries to find overarching processes that occur over the course of a set of events, or tries to find discrepancies in approved processes For example, do students self-regulatory processes over time match theoretical models? (Bannert et al., 2014)

Next lecture Network Analysis