Contact Lens Data. spectacle prescription astigmatism

Similar documents
Machine Learning: finding patterns

Data Mining Part 1. Tony C. Smith WEKA Machine Learning Group Department of Computer Science University of Waikato

Introduction to Knowledge Systems

Simple applications of neural nets. Character recognition. CIS 412 Artificial Intelligence, Dr. Iren Valova, UMASS Dartmouth

Integrated Precision Harvesting System: A Promising Technology to Improve Berry Yield and Quality. Precision Agriculture Research Team

Singer Traits Identification using Deep Neural Network

Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj

Frequencies. Chapter 2. Descriptive statistics and charts

Sonic's Third Quarter Results Reflect Current Challenges

MAG LED TETRIS LINEAR. LEDs for everyone...

Product Number : VZ570 Product Name : As of August Specifications and appearance are subject to change without notice.

N12/5/MATSD/SP2/ENG/TZ0/XX. mathematical STUDIES. Wednesday 7 November 2012 (morning) 1 hour 30 minutes. instructions to candidates

Product Number : VX600 Product Name : As of August Specifications and appearance are subject to change without notice.

Answers. Chapter 9 A Puzzle Time MUSSELS. 9.1 Practice A. Technology Connection. 9.1 Start Thinking! 9.1 Warm Up. 9.1 Start Thinking!

Chapter 4. Displaying Quantitative Data. Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

7.5ft Outdoor Double-Sided Interactive LCD Sign, Transit Center Kiosk Model: NISDK-550PCT, 55 Double-Sided

Iva Petrinska, Dilyan Ivanov, Valchan Georgiev TU Sofia, Bulgaria. September 25, 2017

Human Number Line. Introduction to Negative Numbers

eclipse HYBRID POWER & SIGNAL THE SCIENCE OF CERTAINTY M06U-NC For use in power supplies, server equipment and related hardware

PT-LB385 S P E C F I L E. LCD Projectors. As of December Specifications and appearance are subject to change without notice.

SHOWLINE SL BAR 640 LINEAR WASH LUMINAIRE SPECIFICATIONS.

PT-LB382 S P E C F I L E. LCD Projectors. As of July Specifications and appearance are subject to change without notice. 1/9.

GCSE Mathematics Practice Tests: Set 4

Barry County 4-H. Name: Address: 4-H Club: 4-H Leader: 4-H Age: Years in 4-H Llama/Alpaca Project:

base calling: PHRED...

TECHNICAL SPECIFICATION Multi-beam S-band Klystron type BT267

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2002

Product Number : LW373 Product Name : As of October Specifications and appearance are subject to change without notice.

Portland Public Schools Content Standards Science Scientific Inquiry Grade 3

EdiLine III Series Datasheet

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs

High intensity led illuminator ILED42A FEATURES: DESCRIPTION: ELECTRICAL CHARACTERISTICS:

Guidelines for Specification of LED Lighting Products 2010

Data Pattern Generator

Modeling memory for melodies

download instant at

User s Manual. Log Scale (/LG) GX10/GP10/GX20/GP20 IM 04L51B01-06EN. 1st Edition

Fortnightly withholding table with no and half Medicare levy

SHOWLINE SL NITRO 510 LED STROBE LUMINAIRE SPECIFICATIONS.

Self-publishing. March 19 April 9, Ray Bonnell

HOW WE KNOW WHAT ISN T SO: COGNITIVE SCIENCE AND MIND TRAPS

In this Issue: AMS News Letter. Software Updates. Display Phase Out. Topdressing. Spring Inside this issue:

A G REAT P L A C E T O W OR K

X-ray inspection system Dymond

SignalTap Plus System Analyzer

PRODUCT FAMILY DATASHEET LEDVANCE SPOT LED

Connected Industry and Enterprise Role of AI, IoT and Geospatial Technology. Vijay Kumar, CTO ESRI India

Algebra I Module 2 Lessons 1 19

FOM-9010, FOM-9011 and FOM-9012

NOXLITE LED SPOT. Product family datasheet. Outdoor LED wall luminaires. Areas of application _ Outdoor _ Garages _ Patios, balconies

In-process inspection: Inspector technology and concept

Analysis of data from the pilot exercise to develop bibliometric indicators for the REF

E2 Full-sized Event Master processor

PT-LB300 S P E C F I L E. LCD Projectors. As of April Specifications and appearance are subject to change without notice. 1/8.

HAVE GOT WAS WERE CAN. Koalatext.com TO BE GRAMMAR CONDITIONAL 0

TOUCH PANEL SPECIFICATION

Spring Themed. Writing Tools for Intervention. created by: The Curriculum Corner

BUREAU OF ENERGY EFFICIENCY

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

User Manual CC DC 24 V 5A. Universal Control Unit UC-1-E. General Information SET. Universal Control Unit UC-1 Of Central Lubrication PAUSE CONTACT

Relationships Between Quantitative Variables

PRODUCT FAMILY DATASHEET LEDVANCE SPOT LED

Product Number : TW350 Product Name : As of October Specifications and appearance are subject to change without notice.

Table of Contents. iii

Printed Documentation

Eagle Business Software

LC026B - Gen3. High Power LED Series Chip on Board. High efficacy COB LED package, well-suited for use in spotlight applications. Features & Benefits

Secto Design Oy Kauppalantie Kauniainen Finland tel fax

THE COMPACT, ALL-IN-ONE HD PROJECTOR bringing the big screen to your home

15D05A Broadband Solid State Power Amplifier

Hot Spot Interview Results Joshua Gutwill 7/25/99

PT-LW330 S P E C F I L E. LCD Projectors. As of April Specifications and appearance are subject to change without notice. 1/9.

The Art and Science of Depiction. Color. Fredo Durand MIT- Lab for Computer Science

SHOWLINE SL epar 180 LUMINAIRE SPECIFICATIONS.

PLAXIS 8.x Professional version A3-2 REPORT. agosto 05, Università Politecnica delle Marche. Molo foraneo di Testata 2/51

Music Genre Classification

Family of Christ. Child Development Center. Goals & Objectives for Kindergarten

Relationships. Between Quantitative Variables. Chapter 5. Copyright 2006 Brooks/Cole, a division of Thomson Learning, Inc.

SUPER LED F4.7 VW Vari-White INSTRUCTION MANUAL

Multifunction Digital Timer

Dot Plots and Distributions

LCD MODULE DEM B SYH

Detecting Medicaid Data Anomalies Using Data Mining Techniques Shenjun Zhu, Qiling Shi, Aran Canes, AdvanceMed Corporation, Nashville, TN

ET-5050x-BF1W Datasheet

Size B INDUSTRIAL STANDARD

Year 6 Questionnaire

Fundamentals of LED Video

SPEED LABOLIGHT. Instruction Manual

Qualifying Criteria Versions 4.1 and 5.1

Automatic Construction of Synthetic Musical Instruments and Performers

11 GHz MDD FIBER OPTIC LINK FEATURES TYPICAL APPLICATIONS

Job Name. Housings are constructed of extruded aluminum. Clear polycarbonate lens standard. c o n s t r u c t i o n

COMMUNICATION SKILLS: K-2

Math 7 /Unit 07 Practice Test: Collecting, Displaying and Analyzing Data

PILOT COVE INTERIOR - INTEGRAL POWER SUPPLY WHITE LIGHT

Musicians, Singers, and Related Workers

Stenberg, Shari J. Composition Studies Through a Feminist Lens. Anderson: Parlor Press, Print. 120 pages.

Product Number : LB303 Product Name : As of October Specifications and appearance are subject to change without notice.

TIME SURFING. Let s go

G-SPOT MOVING HEAD PROFILE

Transcription:

Example Data Sets Contact Lens (symbolic) Weather (symbolic data) Weather ( numeric +symbolic) Iris (numeric; outcome:symbolic) CPU Perf.(numeric; outcome:numeric) Labor Negotiations (missing values) Soybean 1

Contact Lens Data age spectacle prescription astigmatism tear production rate recommendation lenses young myope no reduced none young myope no normal soft young myope yes reduced none young myope yes normal hard young hypermetrope no reduced none young hypermetrope no normal soft young hypermetrope yes reduced none young hypermetrope yes normal hard pre-presbyopic myope no reduced none pre-presbyopic myope no normal soft pre-presbyopic myope yes reduced none pre-presbyopic myope yes normal hard pre-presbyopic hypermetrope no reduced none pre-presbyopic hypermetrope no normal soft pre-presbyopic hypermetrope yes reduced none pre-presbyopic hypermetrope yes normal none presbyopic myope no reduced none presbyopic myope no normal none presbyopic myope yes reduced none presbyopic myope yes normal hard presbyopic hypermetrope no reduced none presbyopic hypermetrope no normal soft presbyopic hypermetrope yes reduced none presbyopic hypermetrope yes normal none 2

Structural Patterns Part of structural description If tear production rate = reduced Otherwise, if age = young and astigmatic = no then recommendation = soft then recommendation = none Example is simplistic because all combinations of possible values are represented in table 3

Structural Patterns In most learning situations, the set of examples given as input is far from complete Part of the job is to generalize to other, new examples 4

Weather Data outlook temperature humidity windy play sunny hot high false no sunny hot high true no overcast hot high false yes rainy mild high false yes rainy cool normal false yes rainy cool normal true no overcast cool normal true yes sunny mild high false no sunny cool normal false yes rainy mild normal false yes sunny mild normal true yes overcast mild high true yes overcast hot normal false yes rainy mild high true no 5

Weather Problem This creates 36 possible combinations (3 X 3 X 2 X 2 = 36), of which 14 are present in the set of examples If outlook = sunny and humidity = high If outlook = rainy and windy = true If outlook = overcast If humidity = normal If none of the above then play = no then play = no then play = yes then play = yes then play = yes 6

Weather Data with Some Numeric Attributes outlook temperature humidity windy play sunny 85 85 false no sunny 80 90 true no overcast 83 86 false yes rainy 70 96 false yes rainy 68 80 false yes rainy 65 70 true no overcast 64 65 true yes sunny 72 95 false no sunny 69 70 false yes rainy 75 80 false yes sunny 75 70 true yes overcast 72 90 true yes overcast 81 75 false yes rainy 71 91 true no 7

Classification and Association Rules Classification Rules: rules which predict the classification of the example in terms of whether to play or not If outlook = sunny and humidity = >83, then play = no 8

Classification and Association Rules Association Rules: rules which strongly associate different attribute values Association rules which derive from weather table If temperature = cool If humidity = normal and windy = false If outlook = sunny and play = no If windy = false and play = no and humidity = high then humidity = normal then play = yes then humidity = high then outlook = sunny 9

Rules for Contact Lens Data If tear production rate = reduced then recommendation = none If age = young and astigmatic = no and tear production rate = normal then recommendation = soft If age = pre-presbyopic and astigmatic = no and tear production rate = normal then recommendation = soft If age = presbyopic and spectacle prescription = myope and astigmatic = no then recommendation = none If spectacle prescription = hypermetrope and astigmatic = no and tear production rate = normal then recommendation = soft If spectacle prescription = myope and astigmatic = yes and tear production rate = normal then recommendation = hard If age = young and astigmatic = yes and tear production rate = normal then recommendation = hard If age = pre-presbyopic and spectacle prescription = hypermetrope and astigmatic = yes then recommendation = none If age = presbyopic and spectacle prescription = hypermetrope and astigmatic = yes then recommendation = none 10

Decision Tree for Contact Lens Data tear production rate none astigmatism soft spectacle prescription hard none 11

Iris Data sepal length sepal width pedal lenth pedal width type 1 5.1 3.5 1.4 0.2 Iris setosa 2 4.9 3.0 1.4 0.2 Iris setosa 3 4.7 3.2 1.3 0.2 Iris setosa 4 4.6 3.1 1.5 0.2 Iris setosa 5 5.0 3.6 1.4 0.2 Iris setosa 51 7.0 3.2 4.7 1.4 Iris 52 6.4 3.2 4.5 1.5 Iris 53 6.9 3.1 4.9 1.5 Iris 54 5.5 2.3 4.0 1.3 Iris 55 6.5 2.8 4.6 1.5 Iris 101 6.3 3.3 6.0 2.5 Iris virginica 102 5.8 2.7 5.1 1.9 Iris virginica 103 7.1 3.0 5.9 2.1 Iris virginica 104 6.3 2.9 5.6 1.8 Iris virginica 105 6.5 3.0 5.8 2.2 Iris virginica 12

Iris Rules Learned If petal-length <2.45 then Iris-setosa If sepal-width <2.10 then Iris-versicolor If sepal-width < 2.45 and petal-length <4.55 then Irisversicolor... 13

CPU Performance Data cycle main memory (Kb) cache channels performance time (ns) min max (Kb) min max MYCT MMIN MMAX CACH CHMIN CHMAX PRP 1 125 256 6000 256 16 128 198 2 29 8000 32000 32 8 32 269 3 29 8000 32000 32 8 32 220 4 29 8000 32000 32 8 32 172 5 29 8000 16000 32 8 16 132 207 125 2000 8000 0 2 14 52 208 480 512 8000 32 0 0 67 209 480 1000 4000 0 0 0 45 14

CPU Performance Numerical Prediction: outcome as linear sum of weighted attributes Regression equation: PRP=-55.9+.049MYCT+.+1.48CHMAX Regression can discover linear relationships, not non-linear ones 15

Labor Negotiations Data attribute type 1 2 3 40 duration (number of years) 1 2 3 2 wage increase first year persentage 2% 4% 4.3% 4.5 wage increase second year persentage? 5% 4.4% 4.0 wage increase third year persentage???? cost of living adjustment {none, tcf, tc} none tcf? none working hours per week (number of hours 28 35 38 40 pension {none, ret-allw, none??? standby pay persentage? 13%?? shift-work supplement persentage? 5% 4% 4 education allowance {yes, no} yes??? statutory holidays (number of days) 11 15 12 12 vacation {below-avg, avg, avg gen gen avg long-term disablity {yes, no} no?? yes dental plan contribution {none, half, full} none? full full bereavement assistance {yes, no} no?? yes health plan contribution {none, half, full} none? full half acceptablity of contract {good, bad} bad good good good 16

Classification Debt No loan Loan Income A simple linear classification boundary for the loan data set; shaded region denotes class no loan 17

Linear Regression Regression Line Debt Income A simple linear regression for the loan data set 18

Clustering Debt Cluster 1 Cluster 2 Cluster 3 Income A simple clustering of the loan data set into 3 clusters; note that the original labels are replaced by + s 19

Non-Linear Classification Debt No Loan Loan Income An example of classification boundaries learned by a non-linear classifier (such as a neural network) for the loan data set 20

Nearest Neighbor Classifier Debt No Loan Loan Income Classification boundaries for a nearest neighbor classifier for the loan data set 21

Decision Trees for... Wage increase first year 2.5 Bad > 2.5 Statutory holidays > 10 10 Good Wage increase first year 4 < 4 Bad Good 22

Labor Negotiations Data Wage increase first year 2.5 > 2.5 Working hours per week Statutory holidays > 36 36 > 10 10 Bad Health plan contribution Good Wage increase first year none half full 4 < 4 Bad Good Bad Bad Good 23

Soy Bean Data Attribute Number of Values Sample Value Environment time of occurrence 7 July precipitation 3 above normal temperature 3 normal Seed condition 2 normal mold growth 2 absent discoloration 2 absent Fruit condition of fruit pods 4 normal Leaves condition 2 abnormal yellow leaf spot halo 3 absent leaf spot margins 3 no data Stem condition 2 abnormal stem lodging 2 yes stem cankers 4 above the soil line Roots condition 3 normal Diagnosis 19 diaporthe stem canker 24

Two Example Rules If then If then [leaf condition is normal and stem condition is abnormal and stem cankers is below soil line and canker lesion color is brown] diagnosis is rhizoctonia root rot [leaf malformation is absent and stem condition is abnormal and stem cankers is below soil line and canker lesion color is brown] diagnosis is rhizoctonia root rot 25

Iris Data Clustering Problem Sepal Length Sepal Width Petal Length Petal Width 1 5.1 3.5 1.4 0.2 2 4.9 3 1.4 0.2 3 4.7 3.2 1.3 0.2 4 4.6 3.1 1.5 0.2 5 5 3.6 1.4 0.2 51 7 3.2 4.7 1.4 52 6.4 3.2 4.5 1.5 53 6.9 3.1 4.9 1.5 54 5.5 2.3 4 1.3 55 6.5 2.8 4.6 1.5 101 6.3 3.3 6 2.5 102 5.8 2.7 5.1 1.9 103 7.1 3 5.9 2.1 104 6.3 2.9 5.6 1.8 105 6.5 3 5.8 2.2 26

Weather Data Numeric Class Outlook Temperature Humidity Windy Play-time sunny 85 85 false 5 sunny 80 90 true 0 overcast 83 86 false 55 rainy 70 96 false 40 rainy 68 80 false 65 rainy 65 70 true 45 overcast 64 65 true 60 sunny 72 95 false 0 sunny 69 70 false 70 rainy 75 80 false 45 sunny 75 70 true 50 overcast 72 90 true 55 overcast 81 75 false 75 rainy 71 91 true 10 27

Family Tree Peter M = Peggy F Grace F = Ray M Steven M M Pa F = Ian M Pippa F Brian M Anna F Nikki F 28

Family Tree First Person Second Person Sister-of? Peter Peggy no Peter Steven no Steven Peter no Steven Graham no Steven Pam yes Steven Grace no Ian Pippa yes Anna Nikki yes Nikki Anna yes First Person Second Person Steven Pam yes Graham Pam yes Ian Pippa yes Brian Pippa yes Anna Nikki yes Nikki Anna yes All the rest no Sister-of? 29

Family Tree As Table Name Gender Parent1 Parent2 Peter male?? Peggy female?? Steven male Peter Peggy Graham male Peter Peggy Pam female Peter Peggy Ian male Grace Ray 30

Sister-of As Table First Person Second Person Sister of? Name Gender Parent1 Parent2 Name Gender Parent 1 Parent2 Steven male Peter Peggy Pam female Peter Peggy yes Graham male Peter Peggy Pam female Peter Peggy yes Ian male Grace Ray Pippa female Grace Ray yes Ian male Grace Ray Pippa female Grace Ray yes Annna female Pam Ian Nikki female Pam Ian yes Nikki female Pam Ian Anna female Pam Ian yes All the rest no 31

Another Relationship As Table First Person Second Person Ancestor of? Name Gender Parent1 Parent2 Name Gender Parent 1 Parent2 Peter male?? Steven male Peter Peggy yes Peter male?? Pam female Peter Peggy yes Peter male?? Anna female Pam Ian yes Peter male?? Nikki female Pam Ian yes Pam female Peter Peggy Nikki female Pam Ian yes Grace female?? Ian male Grace Ray yes Grace female?? Nikki female Pam Ian yes Other examples here All the rest yes no 32

ARFF File for Weather Data % ARFF file for the weather data with some numeric features % @relation weather @attribute outlook {sunny, overcast, rainy} @attribute temperature numeric @attribute humidity numeric @attribute windy {true, false} @attribute play? {yes, no} rainy, 70, 96, false, yes rainy, 68, 80, false, yes rainy, 65, 70, true, no overcast, 64, 65, true, yes sunny, 72, 95, false, no sunny, 69, 70, false, yes rainy, 75, 80, false, yes sunny, 75, 70, true, yes overcast, 72, 90, true, yes overcast, 81, 75, false, yes rainy, 71, 91, true, no @data % %14 instances % sunny, 85, 85, false, no sunny, 80, 90, true, no overcast, 83, 86, false, yes 33

Simple Disjunction a y n b c y n y n x c d y n y n d x y n x 34

Exclusive-Or Problem 1 0 a b b a 0 1 Y =1? no yes X =1? no yes Y =1? no yes If x = 1 and y = 0 then class = a If x = 0 and y = 1 then class = a If x = 0 and y = 0 then class = b If x = 1 and y = 1 then class = b b a a b 35

Replicated Subtree If x = 1 and y = 1 then class = a If z = 0 and w = 1 then class = a Otherwise class = b y X 1 2 3 1 3 a 2 z 1 2 3 w b b 1 2 3 a b b 36

New Iris Flower Sepal Length Sepal Width Petal Length Petal Width Type 5.1 3.5 2.6 0.2? 37

Rules for Iris Data Default: Iris-setosa 1 except if petal-length 2.45 and petal-length < 5.355 2 and petal-width < 1.75 3 then Iris-versicolor 4 except if petal-length 4.95 and petal-width < 1.55 5 then Iris-virginica 6 else if sepal-length < 4.95 and sepal-width 2.45 7 then Iris-virginica 8 else if petal-length 3.35 9 then Iris-virginica 10 except if petal-length < 4.85 and sepal-length < 5.95 11 then Iris-versicolor 12 38

The Shapes Problem Shaded: Standing Unshaded: Lying 39

Training Data for Shapes Problem Width Height Sides Class 2 4 4 standing 3 6 4 standing 4 3 4 lying 7 8 3 standing 7 6 3 lying 2 9 4 standing 9 1 4 lying 10 2 3 lying 40

CPU Performance Data PRP = -56.1 +0.049 MYCT +0.015 MMIN +0.006MMAX +0.630CACH -0.270CHMIN +1.46 CHMAX (a) linear regression MMAX 8.5 64.6 (24/19.2%) CACH CHMIN 7.5 >7.5 MMAX (8.5, >28 28000 >28000 28] 157 MMAX CHMAX (21/73.7%) 19.3 (28/8.7%) (2500, 2500 29.8 (37/8.18%) 4250] >4250 1000 >10000 58 >58 CACH 75.7 (10/24.6%) 133 (16/28.8%) MMIN 783 (5/359%) MYCT 0.5 (0.5,8.5] 59.3 (24/16.9%) 281 (11/56%) 12000 >12000 492 (7/53.9%) 37.3 (19/11.3%) 550 >550 18.3 (7/3.83%) (b) regression tree 41

CPU Performance Data CHMIN 7.5 >7.5 MMAX 4250 LM1 (65/7.32%) LM2 (26/6.37%) 8.5 CACH LM4 (50/22.17%) >4250 CACH 0.5 >8.5 (0.5,8.5] LM3 (24/14.5%) MMAX LM5 (21/45.5%) 28000 >28000 LM6 (23/63.5%) LM1 PRP = 8.29 + 0.004 MMAX + 2.77 CHMIN LM2 PRP = 20.3 + 0.004 MMIN - 3.99 CHMIN + 0.946 CHMAX LM3 PRP = 38.1 + 0.012 MMIN LM4 PRP = 10.5 + 0.002 MMAX + 0.698 CACH +0.969 CHMAX LM5 PRP = 285-1.46 MYCT + 1.02 CACH - 9.39 CHMIN LM6 PRP = -65.8 + 0.03 MMIN - 2.94 CHMIN = 4.98 CHMAX (c) model 42

Partitioning Instance Space 43

Ways to Represent Clusters 44