Satoshi Iizuka* Edgar Simo-Serra* Hiroshi Ishikawa Waseda University. (*equal contribution)
|
|
- Stuart Hart
- 5 years ago
- Views:
Transcription
1 Satoshi Iizuka* Edgar Simo-Serra* Hiroshi Ishikawa Waseda University (*equal contribution)
2 Colorization of Black-and-white Pictures 2
3 Our Goal: Fully-automatic colorization 3
4 Colorization of Old Films 4
5 Related Work Scribble-based [Levin+ 2004; Yatziv+ 2004; An+ 2009; Xu+ 2013; Endo+ 2016] Specify colors with scribbles Require manual inputs [Levin+ 2004] Reference image-based [Chia+ 2011; Gupta+ 2012] Transfer colors of reference images Require very similar images Input Reference Output [Gupta+ 2012] 5
6 Related Work Automatic colorization with hand-crafted features [Cheng+ 2015] Uses existing multiple image features Computes chrominance via a shallow neural network Depends on the performance of semantic segmentation Only handles simple outdoor scenes Image features Input Neural Chroma Output 6
7 Contributions Novel end-to-end network that jointly learns global and local features for automatic image colorization New fusion layer that elegantly merges the global and local features Exploit classification labels for learning 7
8 Layers of Our Model Fully-connected layer All neurons are connected between layers Convolutional layer Takes into account underlying spatial structure No. of feature maps Neuron y x Fully-connected layer Convolutional layer 8
9 Our Model Mid-Level Features Fusion Layer Colorization Luminance Scaling Chrominance Upsampling Low-Level Features Global Features Two branches: local features and global features Composed of four networks 9
10 Low-Level Features Mid-Level Features Fusion Layer Colorization Luminance Scaling Shared weights Chrominance Upsampling Low-Level Features Global Features Extract low-level features such as edges and corners Lower resolution for efficient processing 10
11 Global Features Mid-Level Features Fusion Layer Colorization Luminance Scaling Shared weights Chrominance Upsampling Low-Level Features Global Features Compute a global 256-dimensional vector representation of the image 11
12 Mid-Level Features Mid-Level Features Fusion Layer Colorization Luminance Scaling Shared weights Chrominance Upsampling Low-Level Features Global Features Extract mid-level features such as texture 12
13 Fusion Layer Mid-Level Features Fusion Layer Colorization Luminance Scaling Shared weights Chrominance Upsampling Low-Level Features Global Features 13
14 Fusion Layer Combine the global features with the mid-level features The resulting features are independent of any resolution Mid-Level Features Fusion Layer y fusion u,v = σ b + W yglobal ymid u,v Global Features 14
15 Colorization Mid-Level Features Fusion Layer Colorization Luminance Scaling Shared weights Chrominance Upsampling Low-Level Features Global Features Compute chrominance from the fused features Restore the image to the input resolution 15
16 Training of Colors Mean Squared Error (MSE) as loss function Optimization using ADADELTA [Zeiler 2012] Adaptively sets a learning rate Forward Model MSE Backward Input Output Ground truth 16
17 Joint Training Mid-Level Features Fusion Layer Colorization Luminance Scaling Shared weights Chrominance Upsampling Low-Level Features Global Features Classification Training for classification jointly with the colorization Classification network connected to the global features 20.60% Formal Garden 16.13% Arch 13.50% Abbey 7.07% Botanical Garden 6.53% Golf Course Predicted labels 17
18 Dataset MIT Places Scene Dataset [Zhou+ 2014] 2.3 million training images with 205 scene labels pixels Abbey Airport terminal Aquarium Baseball field Dining room Forest road Gas station Gift shop 18
19
20 Computational Time Colorize within a few seconds 80ms 20
21 Colorization of MIT Places Dataset 21
22 Comparisons Input [Cheng+ 2015] Ours Ours (w/o global features) (w/ global features) 22
23 Effectiveness of Global Features Input w/o global features w/ global features 23
24 User Study 10 users participated We show 500 images of each type: total 1,500 images per user 90% of our results are considered natural Natural Unnatural 24
25 Colorization of Historical Photographs Mount Moran, 1941 Scott's Run, 1937 Youngsters, 1912 Burns Basement,
26 Style Transfer Low-Level Features 26
27 Style Transfer Low-Level Features 27
28 Style Transfer Adapting the colorization of one image to the style of another Local Global Local Global Local Global Inputs Output 28
29 Limitations Difficult to output colorful images Cannot restore exact colors Input Ground truth Output Input Ground truth Output 29
30 Conclusion Novel approach for image colorization by fusing global and local information Fusion layer Joint training of colorization and classification Style transfer Farm Land, 1933 California National Park, 1936 Homes, 1936 Spinners, 1910 Doffer Boys,
31 Thank you! Project Page Code on GitHub! Community Center, 1936 North Dome, 1936 Norris Dam, 1933 Miner,
arxiv: v1 [cs.cv] 27 Jan 2018
INTERACTIVE DEEP COLORIZATION WITH SIMULTANEOUS GLOBAL AND LOCAL INPUTS Yi Xiao 1, Peiyao Zhou 1, Yan Zheng 2 arxiv:1801.09083v1 [cs.cv] 27 Jan 2018 1 College of Computer Science and Electronic Engineering
More informationAn Introduction to Deep Image Aesthetics
Seminar in Laboratory of Visual Intelligence and Pattern Analysis (VIPA) An Introduction to Deep Image Aesthetics Yongcheng Jing College of Computer Science and Technology Zhejiang University Zhenchuan
More informationModule 1: Digital Video Signal Processing Lecture 5: Color coordinates and chromonance subsampling. The Lecture Contains:
The Lecture Contains: ITU-R BT.601 Digital Video Standard Chrominance (Chroma) Subsampling Video Quality Measures file:///d /...rse%20(ganesh%20rana)/my%20course_ganesh%20rana/prof.%20sumana%20gupta/final%20dvsp/lecture5/5_1.htm[12/30/2015
More informationResearch Article. ISSN (Print) *Corresponding author Shireen Fathima
Scholars Journal of Engineering and Technology (SJET) Sch. J. Eng. Tech., 2014; 2(4C):613-620 Scholars Academic and Scientific Publisher (An International Publisher for Academic and Scientific Resources)
More informationLEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception
LEARNING AUDIO SHEET MUSIC CORRESPONDENCES Matthias Dorfer Department of Computational Perception Short Introduction... I am a PhD Candidate in the Department of Computational Perception at Johannes Kepler
More informationDeep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj
Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj 1 Story so far MLPs are universal function approximators Boolean functions, classifiers, and regressions MLPs can be
More informationAdvanced Computer Networks
Advanced Computer Networks Video Basics Jianping Pan Spring 2017 3/10/17 csc466/579 1 Video is a sequence of images Recorded/displayed at a certain rate Types of video signals component video separate
More informationAutomatic Music Genre Classification
Automatic Music Genre Classification Nathan YongHoon Kwon, SUNY Binghamton Ingrid Tchakoua, Jackson State University Matthew Pietrosanu, University of Alberta Freya Fu, Colorado State University Yue Wang,
More informationOptimized Color Based Compression
Optimized Color Based Compression 1 K.P.SONIA FENCY, 2 C.FELSY 1 PG Student, Department Of Computer Science Ponjesly College Of Engineering Nagercoil,Tamilnadu, India 2 Asst. Professor, Department Of Computer
More informationFirst Step Towards Enhancing Word Embeddings with Pitch Accents for DNN-based Slot Filling on Recognized Text
First Step Towards Enhancing Word Embeddings with Pitch Accents for DNN-based Slot Filling on Recognized Text Sabrina Stehwien, Ngoc Thang Vu IMS, University of Stuttgart March 16, 2017 Slot Filling sequential
More informationDigital Media. Lecture 10: Video & Compression. Georgia Gwinnett College School of Science and Technology Modified from those of Dr.
Digital Media Lecture 10: Video & Compression Georgia Gwinnett College School of Science and Technology Modified from those of Dr. Jim Rowan Coping with Video Size Consider human vision limitations 1)
More informationAn Image Compression Technique Based on the Novel Approach of Colorization Based Coding
An Image Compression Technique Based on the Novel Approach of Colorization Based Coding Shireen Fathima 1, E Kavitha 2 PG Student [M.Tech in Electronics], Dept. of ECE, HKBK College of Engineering, Bangalore,
More informationSUMMIT LAW GROUP PLLC 315 FIFTH AVENUE SOUTH, SUITE 1000 SEATTLE, WASHINGTON Telephone: (206) Fax: (206)
Case 2:10-cv-01823-JLR Document 154 Filed 01/06/12 Page 1 of 153 1 The Honorable James L. Robart 2 3 4 5 6 7 UNITED STATES DISTRICT COURT FOR THE WESTERN DISTRICT OF WASHINGTON AT SEATTLE 8 9 10 11 12
More informationITU-T Video Coding Standards
An Overview of H.263 and H.263+ Thanks that Some slides come from Sharp Labs of America, Dr. Shawmin Lei January 1999 1 ITU-T Video Coding Standards H.261: for ISDN H.263: for PSTN (very low bit rate video)
More informationSupplementary Material for Video Propagation Networks
Supplementary Material for Video Propagation Networks Varun Jampani 1, Raghudeep Gadde 1,2 and Peter V. Gehler 1,2 1 Max Planck Institute for Intelligent Systems, Tübingen, Germany 2 Bernstein Center for
More informationInto the Depths: The Technical Details Behind AV1. Nathan Egge Mile High Video Workshop 2018 July 31, 2018
Into the Depths: The Technical Details Behind AV1 Nathan Egge Mile High Video Workshop 2018 July 31, 2018 North America Internet Traffic 82% of Internet traffic by 2021 Cisco Study
More informationIntra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences
Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences Michael Smith and John Villasenor For the past several decades,
More informationThe H.26L Video Coding Project
The H.26L Video Coding Project New ITU-T Q.6/SG16 (VCEG - Video Coding Experts Group) standardization activity for video compression August 1999: 1 st test model (TML-1) December 2001: 10 th test model
More informationAUDIOVISUAL COMMUNICATION
AUDIOVISUAL COMMUNICATION Laboratory Session: Recommendation ITU-T H.261 Fernando Pereira The objective of this lab session about Recommendation ITU-T H.261 is to get the students familiar with many aspects
More informationLSTM Neural Style Transfer in Music Using Computational Musicology
LSTM Neural Style Transfer in Music Using Computational Musicology Jett Oristaglio Dartmouth College, June 4 2017 1. Introduction In the 2016 paper A Neural Algorithm of Artistic Style, Gatys et al. discovered
More informationhttps://mediasolutions.ericsson.com/cms/wpcontent/uploads/2017/10/ibc pdf Why CbCr?
Disclaimers: Credit for images is given where possible, apologies for any omissions The optical demonstrations slides may not work on the target monitor / projector The HDR images have been tonemapped
More informationVideo 1 Video October 16, 2001
Video Video October 6, Video Event-based programs read() is blocking server only works with single socket audio, network input need I/O multiplexing event-based programming also need to handle time-outs,
More informationDeepID: Deep Learning for Face Recognition. Department of Electronic Engineering,
DeepID: Deep Learning for Face Recognition Xiaogang Wang Department of Electronic Engineering, The Chinese University i of Hong Kong Machine Learning with Big Data Machine learning with small data: overfitting,
More informationModule 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur
Module 8 VIDEO CODING STANDARDS Lesson 27 H.264 standard Lesson Objectives At the end of this lesson, the students should be able to: 1. State the broad objectives of the H.264 standard. 2. List the improved
More informationJoint Image and Text Representation for Aesthetics Analysis
Joint Image and Text Representation for Aesthetics Analysis Ye Zhou 1, Xin Lu 2, Junping Zhang 1, James Z. Wang 3 1 Fudan University, China 2 Adobe Systems Inc., USA 3 The Pennsylvania State University,
More informationConvolutional Neural Network-Based Block Up-sampling for Intra Frame Coding
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1 Convolutional Neural Network-Based Block Up-sampling for Intra Frame Coding Yue Li, Dong Liu, Member, IEEE, Houqiang Li, Senior Member,
More informationCS 7643: Deep Learning
CS 7643: Deep Learning Topics: Stride, padding Pooling layers Fully-connected layers as convolutions Backprop in conv layers Dhruv Batra Georgia Tech Invited Talks Sumit Chopra on CNNs for Pixel Labeling
More informationMUSI-6201 Computational Music Analysis
MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)
More informationTelevision History. Date / Place E. Nemer - 1
Television History Television to see from a distance Earlier Selenium photosensitive cells were used for converting light from pictures into electrical signals Real breakthrough invention of CRT AT&T Bell
More informationExperimenting with Musically Motivated Convolutional Neural Networks
Experimenting with Musically Motivated Convolutional Neural Networks Jordi Pons 1, Thomas Lidy 2 and Xavier Serra 1 1 Music Technology Group, Universitat Pompeu Fabra, Barcelona 2 Institute of Software
More informationChapter 10 Basic Video Compression Techniques
Chapter 10 Basic Video Compression Techniques 10.1 Introduction to Video compression 10.2 Video Compression with Motion Compensation 10.3 Video compression standard H.261 10.4 Video compression standard
More informationA Novel Macroblock-Level Filtering Upsampling Architecture for H.264/AVC Scalable Extension
05-Silva-AF:05-Silva-AF 8/19/11 6:18 AM Page 43 A Novel Macroblock-Level Filtering Upsampling Architecture for H.264/AVC Scalable Extension T. L. da Silva 1, L. A. S. Cruz 2, and L. V. Agostini 3 1 Telecommunications
More informationReconfigurable Neural Net Chip with 32K Connections
Reconfigurable Neural Net Chip with 32K Connections H.P. Graf, R. Janow, D. Henderson, and R. Lee AT&T Bell Laboratories, Room 4G320, Holmdel, NJ 07733 Abstract We describe a CMOS neural net chip with
More informationDeep Jammer: A Music Generation Model
Deep Jammer: A Music Generation Model Justin Svegliato and Sam Witty College of Information and Computer Sciences University of Massachusetts Amherst, MA 01003, USA {jsvegliato,switty}@cs.umass.edu Abstract
More informationImage Steganalysis: Challenges
Image Steganalysis: Challenges Jiwu Huang,China BUCHAREST 2017 Acknowledgement Members in my team Dr. Weiqi Luo and Dr. Fangjun Huang Sun Yat-sen Univ., China Dr. Bin Li and Dr. Shunquan Tan, Mr. Jishen
More informationImageNet Auto-Annotation with Segmentation Propagation
ImageNet Auto-Annotation with Segmentation Propagation Matthieu Guillaumin Daniel Küttel Vittorio Ferrari Bryan Anenberg & Michela Meister Outline Goal & Motivation System Overview Segmentation Transfer
More informationTRAFFIC SURVEILLANCE VIDEO MANAGEMENT SYSTEM
TRAFFIC SURVEILLANCE VIDEO MANAGEMENT SYSTEM K.Ganesan*, Kavitha.C, Kriti Tandon, Lakshmipriya.R TIFAC-Centre of Relevance and Excellence in Automotive Infotronics*, School of Information Technology and
More informationMPEG-2. ISO/IEC (or ITU-T H.262)
1 ISO/IEC 13818-2 (or ITU-T H.262) High quality encoding of interlaced video at 4-15 Mbps for digital video broadcast TV and digital storage media Applications Broadcast TV, Satellite TV, CATV, HDTV, video
More informationEfficient Implementation of Neural Network Deinterlacing
Efficient Implementation of Neural Network Deinterlacing Guiwon Seo, Hyunsoo Choi and Chulhee Lee Dept. Electrical and Electronic Engineering, Yonsei University 34 Shinchon-dong Seodeamun-gu, Seoul -749,
More information(12) Patent Application Publication (10) Pub. No.: US 2006/ A1
(19) United States US 20060222067A1 (12) Patent Application Publication (10) Pub. No.: US 2006/0222067 A1 Park et al. (43) Pub. Date: (54) METHOD FOR SCALABLY ENCODING AND DECODNG VIDEO SIGNAL (75) Inventors:
More informationChapter 2 Introduction to
Chapter 2 Introduction to H.264/AVC H.264/AVC [1] is the newest video coding standard of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture Experts Group (MPEG). The main improvements
More informationReconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn
Reconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn Introduction Active neurons communicate by action potential firing (spikes), accompanied
More informationColor Image Compression Using Colorization Based On Coding Technique
Color Image Compression Using Colorization Based On Coding Technique D.P.Kawade 1, Prof. S.N.Rawat 2 1,2 Department of Electronics and Telecommunication, Bhivarabai Sawant Institute of Technology and Research
More informationDetecting Musical Key with Supervised Learning
Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different
More informationTable of content. Table of content Introduction Concepts Hardware setup...4
Table of content Table of content... 1 Introduction... 2 1. Concepts...3 2. Hardware setup...4 2.1. ArtNet, Nodes and Switches...4 2.2. e:cue butlers...5 2.3. Computer...5 3. Installation...6 4. LED Mapper
More informationPrinciples of Video Compression
Principles of Video Compression Topics today Introduction Temporal Redundancy Reduction Coding for Video Conferencing (H.261, H.263) (CSIT 410) 2 Introduction Reduce video bit rates while maintaining an
More informationThe H.263+ Video Coding Standard: Complexity and Performance
The H.263+ Video Coding Standard: Complexity and Performance Berna Erol (bernae@ee.ubc.ca), Michael Gallant (mikeg@ee.ubc.ca), Guy C t (guyc@ee.ubc.ca), and Faouzi Kossentini (faouzi@ee.ubc.ca) Department
More informationAudio Structure Analysis
Lecture Music Processing Audio Structure Analysis Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Music Structure Analysis Music segmentation pitch content
More informationMultimedia Communications. Image and Video compression
Multimedia Communications Image and Video compression JPEG2000 JPEG2000: is based on wavelet decomposition two types of wavelet filters one similar to what discussed in Chapter 14 and the other one generates
More informationDeep learning for music data processing
Deep learning for music data processing A personal (re)view of the state-of-the-art Jordi Pons www.jordipons.me Music Technology Group, DTIC, Universitat Pompeu Fabra, Barcelona. 31st January 2017 Jordi
More informationChapter 2. Advanced Telecommunications and Signal Processing Program. E. Galarza, Raynard O. Hinds, Eric C. Reed, Lon E. Sun-
Chapter 2. Advanced Telecommunications and Signal Processing Program Academic and Research Staff Professor Jae S. Lim Visiting Scientists and Research Affiliates M. Carlos Kennedy Graduate Students John
More informationVideo Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure
Representations Multimedia Systems and Applications Video Compression Composite NTSC - 6MHz (4.2MHz video), 29.97 frames/second PAL - 6-8MHz (4.2-6MHz video), 50 frames/second Component Separation video
More informationGlobal Trade Medical Supplies
Features: Achieve Clarity True to the Source Data A medical monitor needs to be capable of high brightness in order to meet performance standards. However, in order to achieve high brightness in an LCD
More informationarxiv: v1 [cs.sd] 5 Apr 2017
REVISITING THE PROBLEM OF AUDIO-BASED HIT SONG PREDICTION USING CONVOLUTIONAL NEURAL NETWORKS Li-Chia Yang, Szu-Yu Chou, Jen-Yu Liu, Yi-Hsuan Yang, Yi-An Chen Research Center for Information Technology
More informationLecture 1: Introduction & Image and Video Coding Techniques (I)
Lecture 1: Introduction & Image and Video Coding Techniques (I) Dr. Reji Mathew Reji@unsw.edu.au School of EE&T UNSW A/Prof. Jian Zhang NICTA & CSE UNSW jzhang@cse.unsw.edu.au COMP9519 Multimedia Systems
More informationVisual Communication at Limited Colour Display Capability
Visual Communication at Limited Colour Display Capability Yan Lu, Wen Gao and Feng Wu Abstract: A novel scheme for visual communication by means of mobile devices with limited colour display capability
More informationImproved Performance For Color To Gray And Back Using Walsh, Hartley And Kekre Wavelet Transform With Various Color Spaces
International Journal Of Engineering Research And Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 13, Issue 11 (November 2017), PP.22-34 Improved Performance For Color To Gray And
More informationDELTA MODULATION AND DPCM CODING OF COLOR SIGNALS
DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS Item Type text; Proceedings Authors Habibi, A. Publisher International Foundation for Telemetering Journal International Telemetering Conference Proceedings
More informationChrominance Subsampling in Digital Images
Chrominance Subsampling in Digital Images Douglas A. Kerr Issue 2 December 3, 2009 ABSTRACT The JPEG and TIFF digital still image formats, along with various digital video formats, have provision for recording
More informationInformation Transmission Chapter 3, image and video
Information Transmission Chapter 3, image and video FREDRIK TUFVESSON ELECTRICAL AND INFORMATION TECHNOLOGY Images An image is a two-dimensional array of light values. Make it 1D by scanning Smallest element
More informationAnalysing Musical Pieces Using harmony-analyser.org Tools
Analysing Musical Pieces Using harmony-analyser.org Tools Ladislav Maršík Dept. of Software Engineering, Faculty of Mathematics and Physics Charles University, Malostranské nám. 25, 118 00 Prague 1, Czech
More informationVideo Processing Applications Image and Video Processing Dr. Anil Kokaram
Video Processing Applications Image and Video Processing Dr. Anil Kokaram anil.kokaram@tcd.ie This section covers applications of video processing as follows Motion Adaptive video processing for noise
More information(12) United States Patent (10) Patent No.: US 6,867,549 B2. Cok et al. (45) Date of Patent: Mar. 15, 2005
USOO6867549B2 (12) United States Patent (10) Patent No.: Cok et al. (45) Date of Patent: Mar. 15, 2005 (54) COLOR OLED DISPLAY HAVING 2003/O128225 A1 7/2003 Credelle et al.... 345/694 REPEATED PATTERNS
More informationOptimizing Digital Transfer of U-matic Video Recordings Leo Backman/DigiOmmel & Co.
Optimizing Digital Transfer of U-matic Video Recordings Leo Backman/DigiOmmel & Co. In this paper, a method of U-matic VCR video output signal extraction is investigated with the aim of improving the replay
More informationOverview E248W-19203R. LED Panel. Features
E248W-19203R Overview Whether you are a gamer or a movie buff, the Sceptre E248W-19203R monitor is the way to go. 1080P resolution (1920 x 1080 Pixels) delivers stunning color and picture detail on a 22
More informationMotion Video Compression
7 Motion Video Compression 7.1 Motion video Motion video contains massive amounts of redundant information. This is because each image has redundant information and also because there are very few changes
More informationError concealment techniques in H.264 video transmission over wireless networks
Error concealment techniques in H.264 video transmission over wireless networks M U L T I M E D I A P R O C E S S I N G ( E E 5 3 5 9 ) S P R I N G 2 0 1 1 D R. K. R. R A O F I N A L R E P O R T Murtaza
More informationImprovement of MPEG-2 Compression by Position-Dependent Encoding
Improvement of MPEG-2 Compression by Position-Dependent Encoding by Eric Reed B.S., Electrical Engineering Drexel University, 1994 Submitted to the Department of Electrical Engineering and Computer Science
More informationCompressed-Sensing-Enabled Video Streaming for Wireless Multimedia Sensor Networks Abstract:
Compressed-Sensing-Enabled Video Streaming for Wireless Multimedia Sensor Networks Abstract: This article1 presents the design of a networked system for joint compression, rate control and error correction
More informationAbout Giovanni De Poli. What is Model. Introduction. di Poli: Methodologies for Expressive Modeling of/for Music Performance
Methodologies for Expressiveness Modeling of and for Music Performance by Giovanni De Poli Center of Computational Sonology, Department of Information Engineering, University of Padova, Padova, Italy About
More informationChord Classification of an Audio Signal using Artificial Neural Network
Chord Classification of an Audio Signal using Artificial Neural Network Ronesh Shrestha Student, Department of Electrical and Electronic Engineering, Kathmandu University, Dhulikhel, Nepal ---------------------------------------------------------------------***---------------------------------------------------------------------
More informationOverview: Video Coding Standards
Overview: Video Coding Standards Video coding standards: applications and common structure ITU-T Rec. H.261 ISO/IEC MPEG-1 ISO/IEC MPEG-2 State-of-the-art: H.264/AVC Video Coding Standards no. 1 Applications
More informationVERY low bit-rate video coding has triggered intensive. Significance-Linked Connected Component Analysis for Very Low Bit-Rate Wavelet Video Coding
630 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 4, JUNE 1999 Significance-Linked Connected Component Analysis for Very Low Bit-Rate Wavelet Video Coding Jozsef Vass, Student
More information17 October About H.265/HEVC. Things you should know about the new encoding.
17 October 2014 About H.265/HEVC. Things you should know about the new encoding Axis view on H.265/HEVC > Axis wants to see appropriate performance improvement in the H.265 technology before start rolling
More informationCS229 Project Report Polyphonic Piano Transcription
CS229 Project Report Polyphonic Piano Transcription Mohammad Sadegh Ebrahimi Stanford University Jean-Baptiste Boin Stanford University sadegh@stanford.edu jbboin@stanford.edu 1. Introduction In this project
More informationAn Overview of Video Coding Algorithms
An Overview of Video Coding Algorithms Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Video coding can be viewed as image compression with a temporal
More informationModule 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur
Module 8 VIDEO CODING STANDARDS Lesson 24 MPEG-2 Standards Lesson Objectives At the end of this lesson, the students should be able to: 1. State the basic objectives of MPEG-2 standard. 2. Enlist the profiles
More informationDeep Aesthetic Quality Assessment with Semantic Information
1 Deep Aesthetic Quality Assessment with Semantic Information Yueying Kao, Ran He, Kaiqi Huang arxiv:1604.04970v3 [cs.cv] 21 Oct 2016 Abstract Human beings often assess the aesthetic quality of an image
More informationPredicting Aesthetic Radar Map Using a Hierarchical Multi-task Network
Predicting Aesthetic Radar Map Using a Hierarchical Multi-task Network Xin Jin 1,2,LeWu 1, Xinghui Zhou 1, Geng Zhao 1, Xiaokun Zhang 1, Xiaodong Li 1, and Shiming Ge 3(B) 1 Department of Cyber Security,
More informationMPEG has been established as an international standard
1100 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER 1999 Fast Extraction of Spatially Reduced Image Sequences from MPEG-2 Compressed Video Junehwa Song, Member,
More informationProcessing. Electrical Engineering, Department. IIT Kanpur. NPTEL Online - IIT Kanpur
NPTEL Online - IIT Kanpur Course Name Department Instructor : Digital Video Signal Processing Electrical Engineering, : IIT Kanpur : Prof. Sumana Gupta file:///d /...e%20(ganesh%20rana)/my%20course_ganesh%20rana/prof.%20sumana%20gupta/final%20dvsp/lecture1/main.htm[12/31/2015
More informationCOMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards
COMP 9 Advanced Distributed Systems Multimedia Networking Video Compression Standards Kevin Jeffay Department of Computer Science University of North Carolina at Chapel Hill jeffay@cs.unc.edu September,
More informationMultimedia Communications. Video compression
Multimedia Communications Video compression Video compression Of all the different sources of data, video produces the largest amount of data There are some differences in our perception with regard to
More informationNoise (Music) Composition Using Classification Algorithms Peter Wang (pwang01) December 15, 2017
Noise (Music) Composition Using Classification Algorithms Peter Wang (pwang01) December 15, 2017 Background Abstract I attempted a solution at using machine learning to compose music given a large corpus
More informationTechnical Specifications
1 Contents INTRODUCTION...3 ABOUT THIS LAB...3 IMPORTANCE OF THE MODULE...3 APPLYING IMAGE ENHANCEMENTS...4 Adjusting Toolbar Enhancement...4 EDITING A LOOKUP TABLE...5 Trace-editing the LUT...6 Comparing
More informationA CLASSIFICATION-BASED POLYPHONIC PIANO TRANSCRIPTION APPROACH USING LEARNED FEATURE REPRESENTATIONS
12th International Society for Music Information Retrieval Conference (ISMIR 2011) A CLASSIFICATION-BASED POLYPHONIC PIANO TRANSCRIPTION APPROACH USING LEARNED FEATURE REPRESENTATIONS Juhan Nam Stanford
More informationResearch on Color Reproduction Characteristics of Mobile Terminals
Applied Mechanics and Materials Submitted: 2014-09-14 ISSN: 1662-7482, Vol. 731, pp 80-86 Accepted: 2014-11-19 doi:10.4028/www.scientific.net/amm.731.80 Online: 2015-01-29 2015 Trans Tech Publications,
More informationFrame Processing Time Deviations in Video Processors
Tensilica White Paper Frame Processing Time Deviations in Video Processors May, 2008 1 Executive Summary Chips are increasingly made with processor designs licensed as semiconductor IP (intellectual property).
More informationMicro-DCI 53ML5100 Manual Loader
Micro-DCI 53ML5100 Manual Loader Two process variable inputs Two manually controlled current outputs Multiple Display Formats: Dual Channel Manual Loader, Single Channel Manual Loader, Manual Loader with
More informationIEEE Santa Clara ComSoc/CAS Weekend Workshop Event-based analog sensing
IEEE Santa Clara ComSoc/CAS Weekend Workshop Event-based analog sensing Theodore Yu theodore.yu@ti.com Texas Instruments Kilby Labs, Silicon Valley Labs September 29, 2012 1 Living in an analog world The
More informationAutomatic Piano Music Transcription
Automatic Piano Music Transcription Jianyu Fan Qiuhan Wang Xin Li Jianyu.Fan.Gr@dartmouth.edu Qiuhan.Wang.Gr@dartmouth.edu Xi.Li.Gr@dartmouth.edu 1. Introduction Writing down the score while listening
More informationStereo Super-resolution via a Deep Convolutional Network
Stereo Super-resolution via a Deep Convolutional Network Junxuan Li 1 Shaodi You 1,2 Antonio Robles-Kelly 1,2 1 College of Eng. and Comp. Sci., The Australian National University, Canberra ACT 0200, Australia
More informationOVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY
Information Transmission Chapter 3, image and video OVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY Learning outcomes Understanding raster image formats and what determines quality, video formats and
More informationChapter 2 Video Coding Standards and Video Formats
Chapter 2 Video Coding Standards and Video Formats Abstract Video formats, conversions among RGB, Y, Cb, Cr, and YUV are presented. These are basically continuation from Chap. 1 and thus complement the
More informationLecture 2 Video Formation and Representation
2013 Spring Term 1 Lecture 2 Video Formation and Representation Wen-Hsiao Peng ( 彭文孝 ) Multimedia Architecture and Processing Lab (MAPL) Department of Computer Science National Chiao Tung University 1
More informationRECOMMENDATION ITU-R BT.1203 *
Rec. TU-R BT.1203 1 RECOMMENDATON TU-R BT.1203 * User requirements for generic bit-rate reduction coding of digital TV signals (, and ) for an end-to-end television system (1995) The TU Radiocommunication
More informationReal-Time Spectrogram (RTS tm )
Real-Time Spectrogram (RTS tm ) View, edit and measure digital sound files The Real-Time Spectrogram (RTS tm ) displays the time-aligned spectrogram and waveform of a continuous sound file. The RTS can
More informationSupplementary material for Inverting Visual Representations with Convolutional Networks
Supplementary material for Inverting Visual Representations with Convolutional Networks Alexey Dosovitskiy Thomas Brox University of Freiburg Freiburg im Breisgau, Germany {dosovits,brox}@cs.uni-freiburg.de
More informationImage-to-Markup Generation with Coarse-to-Fine Attention
Image-to-Markup Generation with Coarse-to-Fine Attention Presenter: Ceyer Wakilpoor Yuntian Deng 1 Anssi Kanervisto 2 Alexander M. Rush 1 Harvard University 3 University of Eastern Finland ICML, 2017 Yuntian
More informationDCI Requirements Image - Dynamics
DCI Requirements Image - Dynamics Matt Cowan Entertainment Technology Consultants www.etconsult.com Gamma 2.6 12 bit Luminance Coding Black level coding Post Production Implications Measurement Processes
More information