Impact of Deep Learning
|
|
- Victor Barrett
- 5 years ago
- Views:
Transcription
1 Impact of Deep Learning Speech Recogni4on Computer Vision Recommender Systems Language Understanding Drug Discovery and Medical Image Analysis [Courtesy of R. Salakhutdinov]
2 Deep Belief Networks: Training [Hinton & Salakhutdinov, 26]
3 Very Large Scale Use of DBN s [Quoc Le, et al., ICML, 212] Data: 1 million 2x2 unlabeled images, sampled from YouTube Training: use 1 machines (16 cores) for 1 week Learned network: 3 multi-stage layers, 1.15 billion parameters Achieves 15.8% (was 9.5%) accuracy classifying 1 of 2k ImageNet items Real images that most excite the feature: Image synthesized to most excite the feature:
4 Restricted Boltzmann Machines Graphical Models: Powerful framework for represen4ng dependency structure between random variables. hidden variables Pair- wise Unary Feature Detectors Image visible variables RBM is a Markov Random Field with: Stochas4c binary visible variables Stochas4c binary hidden variables Bipar4te connec4ons. Markov random fields, Boltzmann machines, log- linear models.
5 Model Learning Hidden units Given a set of i.i.d. training examples, we want to learn model parameters. Maximize log- likelihood objec4ve: Image visible units Deriva4ve of the log- likelihood:
6 Deep Boltzmann Machines Low- level features: Edges Built from unlabeled inputs. Image Input: Pixels (Salakhutdinov & Hinton, Neural Computation 212)
7 Deep Boltzmann Machines Learn simpler representa4ons, then compose more complex ones Higher- level features: Combina4on of edges Low- level features: Edges Built from unlabeled inputs. Image Input: Pixels (Salakhutdinov 28, Salakhutdinov & Hinton 212)
8 Model Formula4on h 3 h 2 h 1 v Input W 3 W 2 W 1 Same as RBMs requires approximate inference to train, but it can be done and scales to millions of examples
9 Samples Generated by the Model Training Data Model- Generated Samples Data
10 Handwri4ng Recogni4on MNIST Dataset Op4cal Character Recogni4on 6, examples of 1 digits 42,152 examples of 26 English le_ers Learning Algorithm Error Logis4c regression 12.% K- NN 3.9% Neural Net (Pla_ 25) 1.53% SVM (Decoste et.al. 22) 1.4% Deep Autoencoder (Bengio et. al. 27) Deep Belief Net (Hinton et. al. 26) 1.4% 1.2% DBM.95% Learning Algorithm Error Logis4c regression 22.14% K- NN 18.92% Neural Net 14.62% SVM (Larochelle et.al. 29) 9.7% Deep Autoencoder (Bengio et. al. 27) Deep Belief Net (Larochelle et. al. 29) 1.5% 9.68% DBM 8.4% Permuta4on- invariant version.
11 3- D object Recogni4on NORB Dataset: 24, examples Learning Algorithm Error Logis4c regression 22.5% K- NN (LeCun 24) 18.92% SVM (Bengio & LeCun 27) 11.6% Deep Belief Net (Nair & Hinton 29) 9.% DBM 7.2% Pa_ern Comple4on
12 Learning Shared Representa4ons Across Sensory Modali4es Concept sunset, pacific ocean, baker beach, seashore, ocean
13 Mul4modal DBM Gaussian model Replicated Sojmax Dense, real- valued image features 1 Word counts (Srivastava & Salakhutdinov, NIPS 212, JMLR 214)
14 Mul4modal DBM Gaussian model Replicated Sojmax Dense, real- valued image features 1 Word counts (Srivastava & Salakhutdinov, NIPS 212, JMLR 214)
15 Mul4modal DBM Gaussian model Replicated Sojmax Dense, real- valued image features 1 Word counts (Srivastava & Salakhutdinov, NIPS 212, JMLR 214)
16 Mul4modal DBM Bo_om- up + Top- down Gaussian model Replicated Sojmax Dense, real- valued image features 1 Word counts (Srivastava & Salakhutdinov, NIPS 212, JMLR 214)
17 Mul4modal DBM Bo_om- up + Top- down Gaussian model Replicated Sojmax Dense, real- valued image features 1 Word counts (Srivastava & Salakhutdinov, NIPS 212, JMLR 214)
18 Text Generated from Images Given Generated Given Generated dog, cat, pet, ki_en, puppy, ginger, tongue, ki_y, dogs, furry insect, bu_erfly, insects, bug, bu_erflies, lepidoptera sea, france, boat, mer, beach, river, bretagne, plage, bri_any graffi4, streetart, stencil, s4cker, urbanart, graff, sanfrancisco portrait, child, kid, ritra_o, kids, children, boy, cute, boys, italy canada, nature, sunrise, ontario, fog, mist, bc, morning
19 Text Generated from Images Given Generated portrait, women, army, soldier, mother, postcard, soldiers obama, barackobama, elec4on, poli4cs, president, hope, change, sanfrancisco, conven4on, rally water, glass, beer, bo_le, drink, wine, bubbles, splash, drops, drop
20 Images Selected from Text Given Retrieved water, red, sunset nature, flower, red, green blue, green, yellow, colors chocolate, cake
21 Summary Efficient learning algorithms for Deep Learning Models. Learning more adap4ve, robust, and structured representa4ons. Learning a Category Image Tagging Text & image retrieval / Hierarchy Object recognigon mosque, tower, building, cathedral, dome, castle Speech RecogniGon HMM decoder MulGmodal Data CapGon GeneraGon sunset, pacific ocean, beach, seashore Deep models improve the current state- of- the art in many applica4on domains: Ø Object recogni4on and detec4on, text and image retrieval, handwri_en character and speech recogni4on, and others. [Courtesy, R. Salakhutdinov]
Joint Image and Text Representation for Aesthetics Analysis
Joint Image and Text Representation for Aesthetics Analysis Ye Zhou 1, Xin Lu 2, Junping Zhang 1, James Z. Wang 3 1 Fudan University, China 2 Adobe Systems Inc., USA 3 The Pennsylvania State University,
More informationA Discriminative Approach to Topic-based Citation Recommendation
A Discriminative Approach to Topic-based Citation Recommendation Jie Tang and Jing Zhang Department of Computer Science and Technology, Tsinghua University, Beijing, 100084. China jietang@tsinghua.edu.cn,zhangjing@keg.cs.tsinghua.edu.cn
More informationA CLASSIFICATION-BASED POLYPHONIC PIANO TRANSCRIPTION APPROACH USING LEARNED FEATURE REPRESENTATIONS
12th International Society for Music Information Retrieval Conference (ISMIR 2011) A CLASSIFICATION-BASED POLYPHONIC PIANO TRANSCRIPTION APPROACH USING LEARNED FEATURE REPRESENTATIONS Juhan Nam Stanford
More informationCS 1674: Intro to Computer Vision. Face Detection. Prof. Adriana Kovashka University of Pittsburgh November 7, 2016
CS 1674: Intro to Computer Vision Face Detection Prof. Adriana Kovashka University of Pittsburgh November 7, 2016 Today Window-based generic object detection basic pipeline boosting classifiers face detection
More informationMusic Emotion Recognition. Jaesung Lee. Chung-Ang University
Music Emotion Recognition Jaesung Lee Chung-Ang University Introduction Searching Music in Music Information Retrieval Some information about target music is available Query by Text: Title, Artist, or
More informationMusic Composition with RNN
Music Composition with RNN Jason Wang Department of Statistics Stanford University zwang01@stanford.edu Abstract Music composition is an interesting problem that tests the creativity capacities of artificial
More informationDeep feature learning for cover song identification
DOI 10.1007/s11042-016-4107-6 Deep feature learning for cover song identification Jiunn-Tsair Fang 1 & Chi-Ting Day 2 & Pao-Chi Chang 2 Received: 2 October 2015 / Revised: 27 October 2016 / Accepted: 31
More informationLarge Scale Concepts and Classifiers for Describing Visual Sentiment in Social Multimedia
Large Scale Concepts and Classifiers for Describing Visual Sentiment in Social Multimedia Shih Fu Chang Columbia University http://www.ee.columbia.edu/dvmm June 2013 Damian Borth Tao Chen Rongrong Ji Yan
More informationPredicting the immediate future with Recurrent Neural Networks: Pre-training and Applications
Predicting the immediate future with Recurrent Neural Networks: Pre-training and Applications Introduction Brandon Richardson December 16, 2011 Research preformed from the last 5 years has shown that the
More informationSequence generation and classification with VAEs and RNNs
Jay Hennig 1 * Akash Umakantha 1 * Ryan Williamson 1 * 1. Introduction Variational autoencoders (VAEs) (Kingma & Welling, 2013) are a popular approach for performing unsupervised learning that can also
More informationImage Steganalysis: Challenges
Image Steganalysis: Challenges Jiwu Huang,China BUCHAREST 2017 Acknowledgement Members in my team Dr. Weiqi Luo and Dr. Fangjun Huang Sun Yat-sen Univ., China Dr. Bin Li and Dr. Shunquan Tan, Mr. Jishen
More informationDiscriminative and Generative Models for Image-Language Understanding. Svetlana Lazebnik
Discriminative and Generative Models for Image-Language Understanding Svetlana Lazebnik Image-language understanding Robot, take the pan off the stove! Discriminative image-language tasks Image-sentence
More informationDetecting Musical Key with Supervised Learning
Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different
More informationSinger Traits Identification using Deep Neural Network
Singer Traits Identification using Deep Neural Network Zhengshan Shi Center for Computer Research in Music and Acoustics Stanford University kittyshi@stanford.edu Abstract The author investigates automatic
More informationPersonalized TV Recommendation with Mixture Probabilistic Matrix Factorization
Personalized TV Recommendation with Mixture Probabilistic Matrix Factorization Huayu Li, Hengshu Zhu #, Yong Ge, Yanjie Fu +,Yuan Ge Computer Science Department, UNC Charlotte # Baidu Research-Big Data
More informationAutomatic Music Genre Classification
Automatic Music Genre Classification Nathan YongHoon Kwon, SUNY Binghamton Ingrid Tchakoua, Jackson State University Matthew Pietrosanu, University of Alberta Freya Fu, Colorado State University Yue Wang,
More informationStatPatternRecognition: Status and Plans. Ilya Narsky, Caltech
StatPatternRecognition: Status and Plans, Caltech Outline Package distribution and management Implemented classifiers and other tools User interface Near-future plans and solicitation This is a technical
More informationDeep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj
Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj 1 Story so far MLPs are universal function approximators Boolean functions, classifiers, and regressions MLPs can be
More informationDeepID: Deep Learning for Face Recognition. Department of Electronic Engineering,
DeepID: Deep Learning for Face Recognition Xiaogang Wang Department of Electronic Engineering, The Chinese University i of Hong Kong Machine Learning with Big Data Machine learning with small data: overfitting,
More informationLarge scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs
Large scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs Damian Borth 1,2, Rongrong Ji 1, Tao Chen 1, Thomas Breuel 2, Shih-Fu Chang 1 1 Columbia University, New York, USA 2 University
More informationMulti-Agent and Semantic Web Systems: Ontologies
Multi-Agent and Semantic Web Systems: Ontologies Fiona McNeill School of Informatics 17th January 2013 Fiona McNeill Multi-agent Semantic Web Systems: Ontologies 17th January 2013 0/29 What is an ontology?
More informationPedestrian Detection with a Large-Field-Of-View Deep Network
Pedestrian Detection with a Large-Field-Of-View Deep Network Anelia Angelova 1 Alex Krizhevsky 2 and Vincent Vanhoucke 3 Abstract Pedestrian detection is of crucial importance to autonomous driving applications.
More information2016 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, SEPT , 2016, SALERNO, ITALY
216 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, SEPT. 13 16, 216, SALERNO, ITALY A FULLY CONVOLUTIONAL DEEP AUDITORY MODEL FOR MUSICAL CHORD RECOGNITION Filip Korzeniowski and
More informationMelody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng
Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Introduction In this project we were interested in extracting the melody from generic audio files. Due to the
More informationRepresentations in Deep Neural Nets. Paul Humphreys July
Representations in Deep Neural Nets Paul Humphreys July 10 2018 Deep learning methods: those that are formed by the composition of multiple non-linear transformations, with the goal of yielding more abstract
More informationarxiv: v1 [cs.lg] 15 Jun 2016
Deep Learning for Music arxiv:1606.04930v1 [cs.lg] 15 Jun 2016 Allen Huang Department of Management Science and Engineering Stanford University allenh@cs.stanford.edu Abstract Raymond Wu Department of
More informationImproving Frame Based Automatic Laughter Detection
Improving Frame Based Automatic Laughter Detection Mary Knox EE225D Class Project knoxm@eecs.berkeley.edu December 13, 2007 Abstract Laughter recognition is an underexplored area of research. My goal for
More informationComputational Modelling of Harmony
Computational Modelling of Harmony Simon Dixon Centre for Digital Music, Queen Mary University of London, Mile End Rd, London E1 4NS, UK simon.dixon@elec.qmul.ac.uk http://www.elec.qmul.ac.uk/people/simond
More informationEE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function
EE391 Special Report (Spring 25) Automatic Chord Recognition Using A Summary Autocorrelation Function Advisor: Professor Julius Smith Kyogu Lee Center for Computer Research in Music and Acoustics (CCRMA)
More informationDISTRIBUTION STATEMENT A 7001Ö
Serial Number 09/678.881 Filing Date 4 October 2000 Inventor Robert C. Higgins NOTICE The above identified patent application is available for licensing. Requests for information should be addressed to:
More informationNeural Aesthetic Image Reviewer
Neural Aesthetic Image Reviewer Wenshan Wang 1, Su Yang 1,3, Weishan Zhang 2, Jiulong Zhang 3 1 Shanghai Key Laboratory of Intelligent Information Processing School of Computer Science, Fudan University
More informationAutomatic Labelling of tabla signals
ISMIR 2003 Oct. 27th 30th 2003 Baltimore (USA) Automatic Labelling of tabla signals Olivier K. GILLET, Gaël RICHARD Introduction Exponential growth of available digital information need for Indexing and
More informationCS 1674: Intro to Computer Vision. Intro to Recognition. Prof. Adriana Kovashka University of Pittsburgh October 24, 2016
CS 1674: Intro to Computer Vision Intro to Recognition Prof. Adriana Kovashka University of Pittsburgh October 24, 2016 Plan for today Examples of visual recognition problems What should we recognize?
More informationNoise (Music) Composition Using Classification Algorithms Peter Wang (pwang01) December 15, 2017
Noise (Music) Composition Using Classification Algorithms Peter Wang (pwang01) December 15, 2017 Background Abstract I attempted a solution at using machine learning to compose music given a large corpus
More informationRepresentations of Sound in Deep Learning of Audio Features from Music
Representations of Sound in Deep Learning of Audio Features from Music Sergey Shuvaev, Hamza Giaffar, and Alexei A. Koulakov Cold Spring Harbor Laboratory, Cold Spring Harbor, NY Abstract The work of a
More informationUsing Variational Autoencoders to Learn Variations in Data
Using Variational Autoencoders to Learn Variations in Data By Dr. Ethan M. Rudd and Cody Wild Often, we would like to be able to model probability distributions of high-dimensional data points that represent
More informationDeep Recurrent Music Writer: Memory-enhanced Variational Autoencoder-based Musical Score Composition and an Objective Measure
Deep Recurrent Music Writer: Memory-enhanced Variational Autoencoder-based Musical Score Composition and an Objective Measure Romain Sabathé, Eduardo Coutinho, and Björn Schuller Department of Computing,
More informationLOCOCODE versus PCA and ICA. Jurgen Schmidhuber. IDSIA, Corso Elvezia 36. CH-6900-Lugano, Switzerland. Abstract
LOCOCODE versus PCA and ICA Sepp Hochreiter Technische Universitat Munchen 80290 Munchen, Germany Jurgen Schmidhuber IDSIA, Corso Elvezia 36 CH-6900-Lugano, Switzerland Abstract We compare the performance
More informationBayesianBand: Jam Session System based on Mutual Prediction by User and System
BayesianBand: Jam Session System based on Mutual Prediction by User and System Tetsuro Kitahara 12, Naoyuki Totani 1, Ryosuke Tokuami 1, and Haruhiro Katayose 12 1 School of Science and Technology, Kwansei
More informationDETECTION OF SLOW-MOTION REPLAY SEGMENTS IN SPORTS VIDEO FOR HIGHLIGHTS GENERATION
DETECTION OF SLOW-MOTION REPLAY SEGMENTS IN SPORTS VIDEO FOR HIGHLIGHTS GENERATION H. Pan P. van Beek M. I. Sezan Electrical & Computer Engineering University of Illinois Urbana, IL 6182 Sharp Laboratories
More informationLearning to Create Jazz Melodies Using Deep Belief Nets
Claremont Colleges Scholarship @ Claremont All HMC Faculty Publications and Research HMC Faculty Scholarship 1-1-2010 Learning to Create Jazz Melodies Using Deep Belief Nets Greg Bickerman '10 Harvey Mudd
More informationLEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception
LEARNING AUDIO SHEET MUSIC CORRESPONDENCES Matthias Dorfer Department of Computational Perception Short Introduction... I am a PhD Candidate in the Department of Computational Perception at Johannes Kepler
More informationOPTICAL MUSIC RECOGNITION WITH CONVOLUTIONAL SEQUENCE-TO-SEQUENCE MODELS
OPTICAL MUSIC RECOGNITION WITH CONVOLUTIONAL SEQUENCE-TO-SEQUENCE MODELS First Author Affiliation1 author1@ismir.edu Second Author Retain these fake authors in submission to preserve the formatting Third
More informationAutomatic Construction of Synthetic Musical Instruments and Performers
Ph.D. Thesis Proposal Automatic Construction of Synthetic Musical Instruments and Performers Ning Hu Carnegie Mellon University Thesis Committee Roger B. Dannenberg, Chair Michael S. Lewicki Richard M.
More informationarxiv: v2 [cs.sd] 31 Mar 2017
On the Futility of Learning Complex Frame-Level Language Models for Chord Recognition arxiv:1702.00178v2 [cs.sd] 31 Mar 2017 Abstract Filip Korzeniowski and Gerhard Widmer Department of Computational Perception
More informationComposer Identification of Digital Audio Modeling Content Specific Features Through Markov Models
Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models Aric Bartle (abartle@stanford.edu) December 14, 2012 1 Background The field of composer recognition has
More informationRewind: A Music Transcription Method
University of Nevada, Reno Rewind: A Music Transcription Method A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science and Engineering by
More informationMusic Genre Classification and Variance Comparison on Number of Genres
Music Genre Classification and Variance Comparison on Number of Genres Miguel Francisco, miguelf@stanford.edu Dong Myung Kim, dmk8265@stanford.edu 1 Abstract In this project we apply machine learning techniques
More informationAutomatic Laughter Detection
Automatic Laughter Detection Mary Knox Final Project (EECS 94) knoxm@eecs.berkeley.edu December 1, 006 1 Introduction Laughter is a powerful cue in communication. It communicates to listeners the emotional
More informationarxiv: v1 [cs.sd] 9 Dec 2017
Music Generation by Deep Learning Challenges and Directions Jean-Pierre Briot François Pachet Sorbonne Universités, UPMC Univ Paris 06, CNRS, LIP6, Paris, France Jean-Pierre.Briot@lip6.fr Spotify Creator
More informationRetrieval and Annotation of Music Using Latent Semantic Models
Retrieval and Annotation of Music Using Latent Semantic Models Thesis submitted in partial fulfilment of the requirements of the University of London for the Degree of Doctor of Philosophy Mark Levy Submitted:
More informationSinging voice synthesis based on deep neural networks
INTERSPEECH 2016 September 8 12, 2016, San Francisco, USA Singing voice synthesis based on deep neural networks Masanari Nishimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
More informationVIDEO COLOR GRADING VIA DEEP NEURAL NETWORKS
Vol. 13, No. 2, pp. 1-15 ISSN: 1646-3692 VIDEO COLOR GRADING VIA DEEP NEURAL NETWORKS John L. Gibbs The University of Georgia, USA ABSTRACT The task of color grading (or color correction) for film and
More informationHidden Markov Model based dance recognition
Hidden Markov Model based dance recognition Dragutin Hrenek, Nenad Mikša, Robert Perica, Pavle Prentašić and Boris Trubić University of Zagreb, Faculty of Electrical Engineering and Computing Unska 3,
More informationAutomatic Laughter Detection
Automatic Laughter Detection Mary Knox 1803707 knoxm@eecs.berkeley.edu December 1, 006 Abstract We built a system to automatically detect laughter from acoustic features of audio. To implement the system,
More informationExperiments on musical instrument separation using multiplecause
Experiments on musical instrument separation using multiplecause models J Klingseisen and M D Plumbley* Department of Electronic Engineering King's College London * - Corresponding Author - mark.plumbley@kcl.ac.uk
More informationChord Classification of an Audio Signal using Artificial Neural Network
Chord Classification of an Audio Signal using Artificial Neural Network Ronesh Shrestha Student, Department of Electrical and Electronic Engineering, Kathmandu University, Dhulikhel, Nepal ---------------------------------------------------------------------***---------------------------------------------------------------------
More informationDo Television and Radio Destroy Social Capital? Evidence from Indonesian Villages Online Appendix Benjamin A. Olken February 27, 2009
Do Television and Radio Destroy Social Capital? Evidence from Indonesian Villages Online Appendix Benjamin A. Olken February 27, 2009 This online appendix presents the additional results discussed in the
More informationNeural Network for Music Instrument Identi cation
Neural Network for Music Instrument Identi cation Zhiwen Zhang(MSE), Hanze Tu(CCRMA), Yuan Li(CCRMA) SUN ID: zhiwen, hanze, yuanli92 Abstract - In the context of music, instrument identi cation would contribute
More informationAn assessment of learned score features for modeling expressive dynamics in music
TRANSACTIONS ON MULTIMEDIA: SPECIAL ISSUE ON MUSIC DATA MINING 1 An assessment of learned score features for modeling expressive dynamics in music Maarten Grachten, Florian Krebs Abstract The study of
More informationResearch Projects. Measuring music similarity and recommending music. Douglas Eck Research Statement 2
Research Statement Douglas Eck Assistant Professor University of Montreal Department of Computer Science Montreal, QC, Canada Overview and Background Since 2003 I have been an assistant professor in the
More informationAvailable online at ScienceDirect. Procedia Computer Science 46 (2015 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 381 387 International Conference on Information and Communication Technologies (ICICT 2014) Music Information
More informationDeep Jammer: A Music Generation Model
Deep Jammer: A Music Generation Model Justin Svegliato and Sam Witty College of Information and Computer Sciences University of Massachusetts Amherst, MA 01003, USA {jsvegliato,switty}@cs.umass.edu Abstract
More informationMusic Genre Classification
Music Genre Classification chunya25 Fall 2017 1 Introduction A genre is defined as a category of artistic composition, characterized by similarities in form, style, or subject matter. [1] Some researchers
More information... A Pseudo-Statistical Approach to Commercial Boundary Detection. Prasanna V Rangarajan Dept of Electrical Engineering Columbia University
A Pseudo-Statistical Approach to Commercial Boundary Detection........ Prasanna V Rangarajan Dept of Electrical Engineering Columbia University pvr2001@columbia.edu 1. Introduction Searching and browsing
More informationComparison Parameters and Speaker Similarity Coincidence Criteria:
Comparison Parameters and Speaker Similarity Coincidence Criteria: The Easy Voice system uses two interrelating parameters of comparison (first and second error types). False Rejection, FR is a probability
More informationMusic Information Retrieval with Temporal Features and Timbre
Music Information Retrieval with Temporal Features and Timbre Angelina A. Tzacheva and Keith J. Bell University of South Carolina Upstate, Department of Informatics 800 University Way, Spartanburg, SC
More informationEXPLORING DATA AUGMENTATION FOR IMPROVED SINGING VOICE DETECTION WITH NEURAL NETWORKS
EXPLORING DATA AUGMENTATION FOR IMPROVED SINGING VOICE DETECTION WITH NEURAL NETWORKS Jan Schlüter and Thomas Grill Austrian Research Institute for Artificial Intelligence, Vienna jan.schlueter@ofai.at
More informationDecision-Maker Preference Modeling in Interactive Multiobjective Optimization
Decision-Maker Preference Modeling in Interactive Multiobjective Optimization 7th International Conference on Evolutionary Multi-Criterion Optimization Introduction This work presents the results of the
More informationgresearch Focus Cognitive Sciences
Learning about Music Cognition by Asking MIR Questions Sebastian Stober August 12, 2016 CogMIR, New York City sstober@uni-potsdam.de http://www.uni-potsdam.de/mlcog/ MLC g Machine Learning in Cognitive
More informationLSTM Neural Style Transfer in Music Using Computational Musicology
LSTM Neural Style Transfer in Music Using Computational Musicology Jett Oristaglio Dartmouth College, June 4 2017 1. Introduction In the 2016 paper A Neural Algorithm of Artistic Style, Gatys et al. discovered
More informationIncremental Dataset Definition for Large Scale Musicological Research
Incremental Dataset Definition for Large Scale Musicological Research Daniel Wolff daniel.wolff.1@city.ac.uk Edouard Dumon edouard.dumon @ensta-paristech.fr Dan Tidhar dan.tidhar.1@city.ac.uk Srikanth
More informationRewind: A Transcription Method and Website
Rewind: A Transcription Method and Website Chase Carthen, Vinh Le, Richard Kelley, Tomasz Kozubowski, Frederick C. Harris Jr. Department of Computer Science, University of Nevada, Reno Reno, Nevada, 89557,
More informationFinding Sarcasm in Reddit Postings: A Deep Learning Approach
Finding Sarcasm in Reddit Postings: A Deep Learning Approach Nick Guo, Ruchir Shah {nickguo, ruchirfs}@stanford.edu Abstract We use the recently published Self-Annotated Reddit Corpus (SARC) with a recurrent
More informationAutomatic Piano Music Transcription
Automatic Piano Music Transcription Jianyu Fan Qiuhan Wang Xin Li Jianyu.Fan.Gr@dartmouth.edu Qiuhan.Wang.Gr@dartmouth.edu Xi.Li.Gr@dartmouth.edu 1. Introduction Writing down the score while listening
More informationReconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn
Reconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn Introduction Active neurons communicate by action potential firing (spikes), accompanied
More informationA Graphical Model for Chord Progressions Embedded in a Psychoacoustic Space
Embedded in a Psychoacoustic Space Jean-François Paiement paiement@idiap.ch IDIAP Research Institute, Rue du Simplon 4, Case Postale 592, CH-1920 Martigny, Switzerland Douglas Eck eckdoug@iro.umontreal.ca
More informationDOWNBEAT TRACKING WITH MULTIPLE FEATURES AND DEEP NEURAL NETWORKS
DOWNBEAT TRACKING WITH MULTIPLE FEATURES AND DEEP NEURAL NETWORKS Simon Durand*, Juan P. Bello, Bertrand David*, Gaël Richard* * Institut Mines-Telecom, Telecom ParisTech, CNRS-LTCI, 37/39, rue Dareau,
More informationA STATISTICAL VIEW ON THE EXPRESSIVE TIMING OF PIANO ROLLED CHORDS
A STATISTICAL VIEW ON THE EXPRESSIVE TIMING OF PIANO ROLLED CHORDS Mutian Fu 1 Guangyu Xia 2 Roger Dannenberg 2 Larry Wasserman 2 1 School of Music, Carnegie Mellon University, USA 2 School of Computer
More informationarxiv: v1 [cs.cv] 16 Jul 2017
OPTICAL MUSIC RECOGNITION WITH CONVOLUTIONAL SEQUENCE-TO-SEQUENCE MODELS Eelco van der Wel University of Amsterdam eelcovdw@gmail.com Karen Ullrich University of Amsterdam karen.ullrich@uva.nl arxiv:1707.04877v1
More informationPredicting Similar Songs Using Musical Structure Armin Namavari, Blake Howell, Gene Lewis
Predicting Similar Songs Using Musical Structure Armin Namavari, Blake Howell, Gene Lewis 1 Introduction In this work we propose a music genre classification method that directly analyzes the structure
More informationTime Series Models for Semantic Music Annotation Emanuele Coviello, Antoni B. Chan, and Gert Lanckriet
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 5, JULY 2011 1343 Time Series Models for Semantic Music Annotation Emanuele Coviello, Antoni B. Chan, and Gert Lanckriet Abstract
More informationCS229 Project Report Polyphonic Piano Transcription
CS229 Project Report Polyphonic Piano Transcription Mohammad Sadegh Ebrahimi Stanford University Jean-Baptiste Boin Stanford University sadegh@stanford.edu jbboin@stanford.edu 1. Introduction In this project
More informationClassification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors
Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors Priyanka S. Jadhav M.E. (Computer Engineering) G. H. Raisoni College of Engg. & Mgmt. Wagholi, Pune, India E-mail:
More informationWeek 14 Music Understanding and Classification
Week 14 Music Understanding and Classification Roger B. Dannenberg Professor of Computer Science, Music & Art Overview n Music Style Classification n What s a classifier? n Naïve Bayesian Classifiers n
More informationCS 7643: Deep Learning
CS 7643: Deep Learning Topics: Stride, padding Pooling layers Fully-connected layers as convolutions Backprop in conv layers Dhruv Batra Georgia Tech Invited Talks Sumit Chopra on CNNs for Pixel Labeling
More informationQuery By Humming: Finding Songs in a Polyphonic Database
Query By Humming: Finding Songs in a Polyphonic Database John Duchi Computer Science Department Stanford University jduchi@stanford.edu Benjamin Phipps Computer Science Department Stanford University bphipps@stanford.edu
More informationChord Representations for Probabilistic Models
R E S E A R C H R E P O R T I D I A P Chord Representations for Probabilistic Models Jean-François Paiement a Douglas Eck b Samy Bengio a IDIAP RR 05-58 September 2005 soumis à publication a b IDIAP Research
More informationGrade 2 - English Ongoing Assessment T-2( ) Lesson 4 Diary of a Spider. Vocabulary
Grade 2 - English Ongoing Assessment T-2(2013-2014) Lesson 4 Diary of a Spider Vocabulary Use what you know about the target vocabulary and context clues to answer questions 1 10. Mark the space for the
More informationDataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison
DataStories at SemEval-07 Task 6: Siamese LSTM with Attention for Humorous Text Comparison Christos Baziotis, Nikos Pelekis, Christos Doulkeridis University of Piraeus - Data Science Lab Piraeus, Greece
More informationAutomatic Rhythmic Notation from Single Voice Audio Sources
Automatic Rhythmic Notation from Single Voice Audio Sources Jack O Reilly, Shashwat Udit Introduction In this project we used machine learning technique to make estimations of rhythmic notation of a sung
More informationModelling Symbolic Music: Beyond the Piano Roll
JMLR: Workshop and Conference Proceedings 63:174 189, 2016 ACML 2016 Modelling Symbolic Music: Beyond the Piano Roll Christian Walder Data61 at CSIRO, Australia. christian.walder@data61.csiro.au Editors:
More informationA PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES
12th International Society for Music Information Retrieval Conference (ISMIR 2011) A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES Erdem Unal 1 Elaine Chew 2 Panayiotis Georgiou
More informationRaining Book Ideas. Raining Book Ideas
COPYRIGHT UMBRELLA BOOKS 2015 INDEX Pg 3 Pop-Up and Play Books Pg 4 Bath or Board Books Pg 5 Bath or Board Shaped Books Pg 6 Learn to Create Pg 7 1000 Stickers Pg 8 Peek a Boo Pg 9 Slide the Flower Pg
More informationA probabilistic approach to determining bass voice leading in melodic harmonisation
A probabilistic approach to determining bass voice leading in melodic harmonisation Dimos Makris a, Maximos Kaliakatsos-Papakostas b, and Emilios Cambouropoulos b a Department of Informatics, Ionian University,
More informationWHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?
WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? NICHOLAS BORG AND GEORGE HOKKANEN Abstract. The possibility of a hit song prediction algorithm is both academically interesting and industry motivated.
More informationStuart Hall: Encoding Decoding
Stuart Hall: Encoding Decoding Though we know the television programme is not a behavioural input, like a tap on the knee cap, it seems almost impossible for tradi@onal researchers to conceptualize the
More informationMUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES
MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES PACS: 43.60.Lq Hacihabiboglu, Huseyin 1,2 ; Canagarajah C. Nishan 2 1 Sonic Arts Research Centre (SARC) School of Computer Science Queen s University
More informationA Study on Music Genre Recognition and Classification Techniques
, pp.31-42 http://dx.doi.org/10.14257/ijmue.2014.9.4.04 A Study on Music Genre Recognition and Classification Techniques Aziz Nasridinov 1 and Young-Ho Park* 2 1 School of Computer Engineering, Dongguk
More informationPower Words come. she. here. * these words account for up to 50% of all words in school texts
a and the it is in was of to he I that here Power Words come you on for my went see like up go she said * these words account for up to 50% of all words in school texts Red Words look jump we away little
More information