ECS 189G: Intro to Computer Vision March 31 st, Yong Jae Lee Assistant Professor CS, UC Davis

Similar documents
CS 1699: Intro to Computer Vision. Introduction. Prof. Adriana Kovashka University of Pittsburgh September 1, 2015

CS 2770: Computer Vision. Introduction. Prof. Adriana Kovashka University of Pittsburgh January 5, 2017

Generic object recognition

Indexing local features. Wed March 30 Prof. Kristen Grauman UT-Austin

CS 1674: Intro to Computer Vision. Intro to Recognition. Prof. Adriana Kovashka University of Pittsburgh October 24, 2016

CSE 166: Image Processing. Overview. Representing an image. What is an image? History. What is image processing? Today. Image Processing CSE 166

Indexing local features and instance recognition

Lecture 5: Clustering and Segmentation Part 1

CPSC 425: Computer Vision

BBM 413 Fundamentals of Image Processing Dec. 11, Erkut Erdem Dept. of Computer Engineering Hacettepe University. Segmentation Part 1

CS 1674: Intro to Computer Vision. Face Detection. Prof. Adriana Kovashka University of Pittsburgh November 7, 2016

Lecture 5: Clustering and Segmenta4on Part 1

Announcements. Project Turn-In Process. Project 1A: Project 1B. and URL for project on a Word doc Upload to Catalyst Collect It

Announcements. Project Turn-In Process. and URL for project on a Word doc Upload to Catalyst Collect It

Smart Traffic Control System Using Image Processing

Exhibits. Open House. NHK STRL Open House Entrance. Smart Production. Open House 2018 Exhibits

This project will work with two different areas in digital signal processing: Image Processing Sound Processing

1/29/2008. Announcements. Announcements. Announcements. Announcements. Announcements. Announcements. Project Turn-In Process. Quiz 2.

Concept of ELFi Educational program. Android + LEGO

Image Processing Using MATLAB (Summer Training Program) 6 Weeks/ 45 Days PRESENTED BY

Fundamentals of Telecommunications and Computer Networks

Instance Recognition. Jia-Bin Huang Virginia Tech ECE 6554 Advanced Computer Vision

AN INTEGRATED MATLAB SUITE FOR INTRODUCTORY DSP EDUCATION. Richard Radke and Sanjeev Kulkarni

MATLAB & Image Processing (Summer Training Program) 4 Weeks/ 30 Days

NENS 230 Assignment #2 Data Import, Manipulation, and Basic Plotting

CS 7643: Deep Learning

ECE302H1S Probability and Applications (Updated January 10, 2017)

CURIE Day 3: Frequency Domain Images

Bar Codes to the Rescue!

Lab 6: Edge Detection in Image and Video

SMART VEHICLE SCREENING SYSTEM USING ARTIFICIAL INTELLIGENCE METHODS

HISTORY 3800 (The Historian s Craft), Spring :00 MWF, Haley 2196

2. Problem formulation

Digital Signal Processing

Music Appreciation Course Syllabus Fall 2016

Music Appreciation Course Syllabus Fall 2014

Music Understanding and the Future of Music

Design and implementation (in VHDL) of a VGA Display and Light Sensor to run on the Nexys4DDR board Report and Signoff due Week 6 (October 4)

Outline. Why do we classify? Audio Classification

COURSE SYLLABUS Fall 2018

General Course information for. Primate Biology

Summarizing Long First-Person Videos

RTD 470 Electronic News Field Production

MUS 210: SONGWRITING MICHIGAN STATE UNIVERSITY FALL 2014

ECE 4/517 MIXED SIGNAL IC DESIGN LECTURE 1 SLIDES. Vishal Saxena (vsaxena AT uidaho DOT edu) AMPIC Laboratory University of Idaho

Music Business and Industry MUS Fall 2017 M-W-F 8:30 9:20 CB1, Rm. 0308

DEPARTMENT OF FINE ARTS COURSE OUTLINE FALL 2015 MU2550 A2 MUSIC THEORY III MW 10:00-11:20AM, L228

Introduction to GRIP. The GRIP user interface consists of 4 parts:

You will be first asked to demonstrate regular operation with default values. You will be asked to reprogram your time values and continue operation

RTD 470 Electronic News Field Production

Just plug and go. Practical Features. Valuable Benefits

Fundamentals of DSP Chap. 1: Introduction

DEPARTMENT OF FINE ARTS COURSE OUTLINE WINTER 2016 TR 14:30-15:50, L123

Viewer-Adaptive Control of Displayed Content for Digital Signage

Journal of Field Robotics. Instructions to Authors

ORANGE COAST COLLEGE MUSIC 241 Piano 3 Course Syllabus Fall 2018

Kristen Baldridge. This year I am teaching. Principles of AV Audio/Video Production Advanced A/V Production Practicum of AV

CS 7643: Deep Learning

VBM683 Machine Learning

Board Meeting Broadcast Project Preliminary Report August 02, 2017

August version Syllabus Duke University Fall 2014 Economics 555 International Trade Professor Edward Tower

CTP 431 Music and Audio Computing. Course Introduction. Graduate School of Culture Technology (GSCT) Juhan Nam

AAAS 382R KOREAN POLITICS THROUGH CINEMA Binghamton University, Fall 2011

2 Preface. some familiarity with ordinary differential equations,

Grande Prairie Regional College. EN 3650 A3 Credit 3 (3-0-0) UT 45 Hours Early Twentieth Century British Novel

ELEC 310 Digital Signal Processing

Written Progress Report. Automated High Beam System

Multicore Design Considerations

ANNALS OF OTOLOGY, RHINOLOGY & LARYNGOLOGY

SYRACUSE UNIVERSITY 3-week On-Camera TV/Film Unit Mondays, September 17, September 24, October 1, :45 PM 10:00 PM

EE 330 Spring 2018 Integrated Electronics

Introduction to Western Music

Table of content. Table of content Introduction Concepts Hardware setup...4

Part III Conclusion Paper Checklist Use this checklist to ensure that your paper is submitted your Conclusion Paper correctly

Lecture 18: Exam Review

ENG 221 Children s Literature Winter 2018 Tentative syllabus

Thesis & Dissertation Formatting. Presented by: The Graduate School

Computer Graphics NV1 (1DT383) Computer Graphics (1TT180) Cary Laxer, Ph.D. Visiting Lecturer

Rodin Maroufi. December 17, Dr. Andrew Rawicz School of Engineering Science Simon Fraser University Burnaby, British Columbia V5A 1S6

Syllabus: PHYS 1300 Introduction to Musical Acoustics Fall 20XX

A COMPARATIVE STUDY ALGORITHM FOR NOISY IMAGE RESTORATION IN THE FIELD OF MEDICAL IMAGING

Automatic Construction of Synthetic Musical Instruments and Performers

Homework 2 Key-finding algorithm

Understanding Compression Technologies for HD and Megapixel Surveillance

A Beginner's Guide to Digital 3-D Projection: A Guide for the Not-Too-Technically Inclined by David Starkman -

Guidelines for Paper 3: Choose Your Own Adventure

MUSI 1306 Music Appreciation 3 Creative Arts MUSI 1306

Doubletalk Detection

New York University A Private University in the Public Service

THE INTERNET OF VISION ENABLED THINGS. Tom Brennan Artemis Vision

AI FOR BETTER STORYTELLING IN LIVE FOOTBALL

POLS 3045: Humor and American Politics SPRING 2017, Dr. Baumgartner Meets Tues. & Thur., 9:30-10:45, in Brewster, D-202

Course Syllabus Art Appreciation ARTS (787) /

Lab 1 Introduction to the Software Development Environment and Signal Sampling

Monday, October 29 [8:00 9:15 / 9:30 10:45] today homework due Monday, November 5

ILLINOIS VALLEY COMMUNITY COLLEGE Course Syllabus for Music 1000

Lab Assignment 2 Simulation and Image Processing

UCSC Summer Session MUSIC 11D Introduction to World Music. Class Times: TTH 1:00 4:30 pm Class Location: Music Center 138 (DARC 340 July10 21)

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

POLS Introduction to Urban Politics

Transcription:

ECS 189G: Intro to Computer Vision March 31 st, 2015 Yong Jae Lee Assistant Professor CS, UC Davis

Plan for today Topic overview Introductions Course overview: Logistics and requirements 2

What is Computer Vision? 3

Computer Vision Enable machines to see the visual world as we do

Computer Vision Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) Slide credit: Kristen Grauman 5

1. Vision for measurement Real-time stereo Structure from motion Tracking NASA Mars Rover Snavely et al. Demirdjian et al. Wang et al. Slide credit: Kristen Grauman 6

Computer Vision Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 2. Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities (perception and interpretation) Slide credit: Kristen Grauman 7

2. Vision for perception, interpretation amusement park sky The Wicked Twister Cedar Point Ferris wheel ride Lake Erie ride 12 E water ride tree tree Objects Activities Scenes Locations Text / writing Faces Gestures Motions Emotions people waiting in line people sitting on ride umbrellas tree deck bench Slide credit: Kristen Grauman carousel tree pedestrians maxair 8

Computer Vision Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 2. Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) 3. Algorithms to mine, search, and interact with visual data (search and organization) Slide credit: Kristen Grauman 9

3. Visual search, organization Query Image or video archives Relevant content Slide credit: Kristen Grauman 10

Related disciplines Graphics Image processing Artificial intelligence Computer vision Algorithms Machine learning Cognitive science Slide credit: Kristen Grauman 11

Vision and graphics Images Vision Model Graphics Inverse problems: analysis and synthesis Slide credit: Kristen Grauman 12

Why is vision difficult? 13

What humans see 14

What computers see 243 239 240 225 206 185 188 218 211 206 216 225 242 239 218 110 67 31 34 152 213 206 208 221 243 242 123 58 94 82 132 77 108 208 208 215 235 217 115 212 243 236 247 139 91 209 208 211 233 208 131 222 219 226 196 114 74 208 213 214 232 217 131 116 77 150 69 56 52 201 228 223 232 232 182 186 184 179 159 123 93 232 235 235 232 236 201 154 216 133 129 81 175 252 241 240 235 238 230 128 172 138 65 63 234 249 241 245 237 236 247 143 59 78 10 94 255 248 247 251 234 237 245 193 55 33 115 144 213 255 253 251 248 245 161 128 149 109 138 65 47 156 239 255 190 107 39 102 94 73 114 58 17 7 51 137 23 32 33 148 168 203 179 43 27 17 12 8 17 26 12 160 255 255 109 22 26 19 35 24 Slide credit: Larry Zitnick 15

Why is vision difficult? Ill-posed problem: real world much more complex than what we can measure in images 3D 2D Impossible to literally invert image formation process Slide credit: Kristen Grauman 16

Challenges: ambiguity Many different 3D scenes could have given rise to a particular 2D picture Slide credit: Svetlana Lazebnik

Challenges: many nuisance parameters Illumination Object pose Clutter Occlusions Intra-class appearance Viewpoint Slide credit: Kristen Grauman 18

Challenges: scale slide credit: Fei-Fei, Fergus, Torralba

Challenges: Motion slide credit: Svetlana Lazebnik

Challenges: occlusion, clutter Image source: National Geograph slide credit: Svetlana Lazebnik

Challenges: object intra-class variation slide credit: Fei-Fei, Fergus, Torralba

Challenges: context and human experience Slide credit: Fei-Fei, Fergus, Torralba 23

Challenges: context and human experience Fei Fei Li, Rob Fergus, Antonio Torralba

Challenges: context and human experience Fei Fei Li, Rob Fergus, Antonio Torralba

Challenges: complexity How many object categories are there? Slide credit: Fei-Fei, Fergus, Torralba Biederman 1987 26

Challenges: complexity 6 billion images 70 billion images 1 billion images served daily 10 billion images 100 hours uploaded per minute From Almost 90% of web traffic is visual! : 27

Challenges: complexity Thousands to millions of pixels in an image 30+ degrees of freedom in the pose of articulated objects (humans) About half of the cerebral cortex in primates is devoted to processing visual information [Felleman and van Essen 1991] Slide credit: Kristen Grauman 28

What works well today? 29

Optical character recognition (OCR) Digit recognition yann.lecun.com License plate readers http://en.wikipedia.org/wiki/automatic_number_plate_recognition Sudoku grabber http://sudokugrab.blogspot.com/ Automatic check processing Source: S. Seitz, N. Snavely

Biometrics Fingerprint scanners Face recognition systems

Face detection Many consumer digital cameras now detect faces Source: S. Seitz

Face detection for privacy protection slide credit: Svetlana Lazebnik

Technology gone wild slide credit: Svetlana Lazebnik

Face recognition Slide credit: Devi Parikh 35

Shotton et al. Interactive systems

Instance recognition Slide credit: Devi Parikh 37

Pedestrian detection Slide credit: Devi Parikh 38

Autonomous agents Mars rover Google self-driving car

3D reconstruction from photo collections Q. Shan, R. Adams, B. Curless, Y. Furukawa, and S. Seitz, The Visual Turing Test for Scene Reconstruction, 3DV 2013 YouTube Video slide credit: Svetlana Lazebnik

Special effects: shape capture The Matrix movies, ESC Entertainment, XYZRGB, NRC Source: S. Seitz

Special effects: motion capture Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

Medical imaging 3D imaging MRI, CT Image guided surgery Grimson et al., MIT Source: S. Seitz

Visual data in 1963 L. G. Roberts, Machine Perception of Three Dimensional Solids, Ph.D. thesis, MIT Department of Electrical Engineering, 1963. Slide credit: Kristen Grauman 44

Visual data today Understand and organize and Personal photo albums Movies, news, sports index all this data!! Surveillance and security Svetlana Lazebnik Medical and scientific images

Why vision? As image sources multiply, so do applications Relieve humans of boring, easy tasks Enhance human abilities Advance human-computer interaction, visualization Perception for robotics / autonomous agents Organize and give access to visual content Slide credit: Kristen Grauman 46

Applications Law enforcement / Surveillance Robotics Autonomous driving Medical imaging Photo organization Image search E-commerce cell phone cameras, social media, Google Glass, etc. Slide adapted from Devi Parikh 47

Summary Computer Vision is useful, interesting, and difficult A growing and exciting field Lots of cool and important applications New teams in existing companies, startups, etc. Slide adapted from Devi Parikh 48

Introductions Instructor Yong Jae Lee yjlee@cs.ucdavis.edu Assistant Professor in CS, UC Davis since July 2014 Ph.D. from UT Austin in 2012 Post-doc at CMU and UC Berkeley for 2 years Research area: Computer Vision Visual Recognition Graphics Applications 49

Introductions TAs: Vivek Dubey dubey@ucdavis.edu MS student in ECE Ahsan Abdullah abdullah@ucdavis.edu PhD student in CS 50

ECS 189G (4-units) This course Lecture: Tues & Thurs 6:10-7:30 pm, Everson Hall 176 Discussion section: Mon 2:10-3pm, Wellman Hall 2 Office hours: Academic Surge 1044 Yong Jae: Fri 4-6 pm Vivek: Mon & Wed 6-8 pm Ahsan: Tues & Thurs 4-6 pm 51

This course Course webpage https://sites.google.com/a/ucdavis.edu/ecs-189g-intro-tocomputer-vision/ SmartSite (assignment submission, grades) https://smartsite.ucdavis.edu/portal/site/ecs189g-sp2015 Piazza https://piazza.com/uc_davis/spring2015/ecs189g 52

Goals of this course Introduction to primary topics in Computer Vision Basics and fundamentals Practical experience through assignments Views of computer vision as a research area 53

Prerequisites Upper-division undergrad course Basic knowledge of probability and linear algebra Data structures, algorithms Programming experience Experience with image processing or Matlab will help but is not necessary

Topics overview Features and filters Grouping and fitting Recognition and learning Focus is on algorithms, rather than specific systems 55

Features and filters Transforming and describing images; textures, colors, edges Slide credit: Kristen Grauman 56

Grouping and fitting Clustering, segmentation, fitting; what parts belong together? Slide credit: Kristen Grauman [fig from Shi et al] 57

Recognition and learning Recognizing objects and categories, learning techniques Slide credit: Kristen Grauman 58

Additional topic (time permitting) Deep learning 59

Not covered: Multiple views and motion Multi-view geometry, stereo vision Lowe Hartley and Zisserman Fei-Fei Li Slide credit: Kristen Grauman 60

Not covered: Video processing Tracking objects, video analysis, low level motion, optical flow Tomas Izo Slide credit: Kristen Grauman 61

Textbooks By Rick Szeliski http://szeliski.org/book/ By Kristen Grauman, Bastian Leibe Visual Object Recognition 62

Requirements / Grading Problem sets (70%) Final exam (25%) comprehensive (cover all topics learned in class) Class and Piazza participation, including attendance (5%) Piazza: participation points for posting (sensible) questions and answers 63

Problem sets Some short answer concept questions Matlab programming problems Implementation Explanation, results Follow instructions; points will be deducted if we can t run your code out of the box Ask questions on Piazza first Submit to SmartSite The assignments will take significant time to do Start early TAs will go over problem set during first discussion section after release (others will be used as extra office hours) Slide adapted from Kristen Grauman 64

Matlab Built-in toolboxes for lowlevel image processing, visualization Compact programs Intuitive interactive debugging Widely used in engineering Slide credit: Kristen Grauman 65

Matlab CSIF labs 67, 71, 75 (pc33-pc60) Academic Surge 1044 and 1116 Lab schedule (reservations) and remote access info found on class website Matlab (Simulink Student Suite) can be purchased for $99 66

Problem Set 0 Matlab warmup Basic image manipulation Out Thursday, due 4/10 67

Images as matrices Digital images Slide credit: Kristen Grauman 68

Intensity : [0,255] Digital images j=1 width 520 i=1 500 height im[176][201] has value 164 im[194][203] has value 37 Slide credit: Kristen Grauman 69

Color images, RGB color space R G B Slide credit: Kristen Grauman 70

Preview of some problem sets resize: castle squished crop: castle cropped content aware resizing: seam carving Slide credit: Devi Parikh 71

Preview of some problem sets Grouping Slide credit: Kristen Grauman 72

Preview of some problem sets Object search and recognition Slide credit: Kristen Grauman 73

Problem set deadlines Problem sets due 11:59 PM Follow submission instructions given in assignment Submit to SmartSite; no hard copy submissions Deadlines are firm. We ll use SmartSite timestamp. Even 1 minute late is late. 3 total free late days for the semester Use them wisely: first couple assignments are easier than others If your program doesn t work, clean up the code, comment it well, explain what you have, and still submit. Draw our attention to this in your answer sheet. Slide adapted from Kristen Grauman, Devi Parikh 74

Collaboration policy Can discuss problem sets with peers, but all responses and code must be written individually Students submitting answers or code found to be identical or substantially similar (due to inappropriate collaboration) risk failing the course Read and follow UC Davis code of conduct Slide adapted from Kristen Grauman, Devi Parikh 75

Miscellaneous Check class website regularly for assignment files, notes, announcements, etc. Come to lecture on time No laptops, phones, tablets, etc. in class please Please interrupt with questions at any time 76

Coming up Read the class webpage carefully Next class (Thurs): lecture on linear filters PS0 out Thursday, due 4/10 77

Questions? See you Thursday!