Vadim V. Romanuke * (Professor, Polish Naval Academy, Gdynia, Poland)

Size: px
Start display at page:

Download "Vadim V. Romanuke * (Professor, Polish Naval Academy, Gdynia, Poland)"

Transcription

1 Electrical, Control and Commnication Engineering ISSN (online) ISSN (print) 20, vol. 4, no., pp doi: 0.247/ecce An Attempt of Finding an Appropriate Nmber of Convoltional Layers in CNNs Based on Benchmarks of Heterogeneos Datasets Vadim V. Romanke * (Professor, Polish Naval Academy, Gdynia, Poland) Abstract An attempt of finding an appropriate nmber of convoltional layers in convoltional neral networks is made. The benchmark datasets are, and, whose diversity and heterogeneosness mst serve for a general applicability of a rle presmed to yield that nmber. The rle is drawn from the best performances of convoltional neral networks bilt with 2 to 2 convoltional layers. It is not an exact best nmber of convoltional layers bt the reslt of a short process of trying a few versions of sch nmbers. For small images (like those in ), the initial nmber is 4. For datasets that have a few tens of image categories and more, initially setting five to eight convoltional layers is recommended depending on the complexity of the dataset. The fzziness in the rle is not removable becase of the reqired diversity and heterogeneosness. Keywords Convoltional neral networks; Convoltional layers; Error rate; Hyperparameters; Performance. I. THE PROBLEM OF AN APPROPRIATE NUMBER OF CONVOLUTIONAL LAYERS In machine learning for image recognition, the convoltional layer () is the core bilding block of a convoltional neral network (CNN). A is a set of learnable filters which actally are three-dimensional matrices, to which a bias vector is attached [], [2]. The parameters of a, called hyperparameters, are as follows [2], [3]:. Height F height of the filter (size along the vertical axis). Integer F height mst be positive. 2. Width F width of the filter (the horizontal axis). Integer Fwidth mst be positive, and commonly F width F height [4], [5]. 3. Depth K of the filter. The depth of the filter of the first is eqal to the nmber of color channels in the inpt image. The depth of the filter of a sbseqent is eqal to the nmber of filters of the antecedent [6]. 4. Stride s. Integer s mst be positive for controlling how depth colmns are allocated arond the spatial dimensions (width and height). Often s, so then a new depth colmn of nerons is allocated to spatial positions only one spatial nit apart [7]. 5. Zero-padding p. Integer p mst be non-negative for preserving exactly the spatial size of the otpt volmes [2], [5], []. All these hyperparameters are set by rles of thmb [2], [7]. Moreover, when CNN architectre is bilt, the nmber of s N (a positive integer) is set jst by experience. Ths, setting the integer N appropriately is an open isse. Answering this qestion can significantly improve performance. II. BACKGROUND AND MOTIVATION It is believed that complexity of an image recognition problem () is associated with the nmber of s. The complexity of s isses from the nmber of image categories, the nmber of featres (dimensionality), the inflence of color, the inflence of chrominance, diversities in images labelled as belonging to the same category [9], [0]. The more complex s may naïvely need a greater N. This has, however, not been proved yet. Moreover, it is nknown whether this is provable or not []. Unlike its hyperparameters, the nmber of s is not limited from above [], [2], [6], [7]. If the hyperparameters are selected appropriately, N shold be varied starting from 2 p to some integer N, at which the effectiveness of CNNs is less than at N. The effectiveness means performance and operation speed (comptational rate) [], [2], [5], [0], [2], [3]. Obviosly, the comptational rate slightly (at least) decreases as N increases, so this is a constraint preventing the assigning of a great N [6], [7], [9]. For instance, the position of the rnner-p in ILSVRC 204 was taken by the CNN that became known as VGGNet [], [4] containing 6 s. A downside of VGGNet is that it is very expensive to evalate and ses mch more memory and parameters (a MATLAB.mat file of VGGNet has the size of abot GB). Bt if some s nearest to the VGGNet otpt layer are removed, the performance is still the same and the nmber of necessary parameters is significantly redced [], [5], [6]. III. A GOAL FOR FINDING A RULE OF APPROPRIATELY SETTING THE NUMBER OF CONVLS The goal is to find a rle for appropriately setting the integer N regarding the nmber of image categories and the dimensionality of an. In other words, once an is given with its nmber of image categories and image size, the rle mst yield a certain integer N or a few versions of this * romankevadimv@gmail.com 20 Vadim V. Romanke. This is an open access article licensed nder the Creative Commons Attribtion License ( in the manner agreed with Sciendo. 5

2 Electrical, Control and Commnication Engineering 20, vol. 4, no. nmber. In the worst case, an integer interval for an appropriate nmber of s shold be formed. For stating the rle, for tasks need to be accomplished.. To form a variety of s for benchmarking. 2. To test the s on an admissible interval of integers N. 3. To establish the correspondence of the best performance to N. 4. To formalise the correspondence as a rle. The rle will allow rationally constrcting a pivot of CNNs which is a seqence of s. Having the pivot, the remaining parts of the CNNs (pooling layers, ReLUs, DropOt layers, normalisation layers) are allocated easier. This wold be a profond contribtion to the theory of CNNs for making image recognition more effective. category, the diversity of its entries is rather high. The objects were originally imaged by two cameras at six sets of lighting conditions, nine elevations, and eighteen azimths. Then they were jittered and clttered by random pertrbation of position, scaling, varying brightness and contrast. The disparities were adjsted and randomly picked so that the objects appeared placed on highly textred horizontal srfaces at a small random distance from those srfaces. In addition, a randomly picked distracting object was placed at the periphery of the image. IV. S FOR BENCHMARKING The rle is expected to be generally acceptable for a wide range of s. That is, it mst be generalisable. To prevent an from overfitting (this is a meta-overfitting to a grop of s an extension of the common overfitting to training sets), the benchmark s shold be dissimilar. Ths, the datasets with their entries shold satisfy a reqirement of dissimilarity in the following: ) the nmber of image categories; 2) the nmber of color channels; 3) the initial image size; 4) the origination of the image content; 5) the types of objects to be recognised. These five dissimilarities ensre diversity and heterogeneosness to s. However, this is not sfficient for benchmarking, since, for instance, the ImageNet dataset is too hge for statistical research. Therefore, an additional reqirement is that the size of the benchmark shold be moderate. This implies a medim image size (not larger than 2 pixels) as well as a fairly small nmber of image categories (a few tens at the most). There are three datasets that completely satisfy these reqirements: (Fig. ), (Fig. 2), (Fig. 3). Althogh has only 0 image categories, the diversity of its entries is the highest. The image categories labelled as airplane, atomobile, bird, cat, deer, dog, frog, horse, ship, trck are diverse themselves. consists of images, where each category is represented with 6000 entries. Fig.. A sbset of the dataset consisting of color images whose original size is in each of the three color channels [6], [9], [0]. The diversity of its entries is highest as the dataset is heterogeneos itself. The dataset consists of images (with a total of images served for training) representing fifty toys belonging to five generic categories (for-legged animals, hman figres, airplanes, trcks and cars). Althogh has only six image categories inclded one image backgrond Fig. 2. A sbset of the dataset consisting of 0 0 -bit greyscale images [6]. The diversity here is high bt has only six image categories. A far lighter and easier dataset is, which represents images of enlarged capital letters of the English alphabet. It has 26 categories, and it is a completely artificial dataset, and hence it is scalable as many images can be generated as needed, and their size is adjsted. There are three types of distortion scaling, rotation, shifting. The intensity of these distortions is reglated with their magnitdes. Fig. 3 shows a moderate intensity of the distortions. At sch intensity, entries (2000 entries per letter) are enogh for training and validating [3], [7], []. V. ADMISSIBILITY OF INTEGERS N Admissibility here implies rationality and reasonability, i.e. testing the s on an admissible interval of integers N mst expose the best performance as well as a moderate one, while the worse performance is expected closer to the endpoints of the interval. Setting a single is obviosly inappropriate (there wold not have been any convoltion), so let N 2 be the left endpoint of the interval for the worst-case reference. The imm integer N depends on the and its image size. The entries of are recognised sccessflly by for to six s for any image size between and The same goes for. For sccessfl training on the dataset, some versions of CNNs have only three s [3]. Eventally, the nmber of s is also adjsted with the nmber of pooling layers which follow the s. Hence, let N for images by applying no resizing for and downsampling the entries. Then let N 9 for 4 4 images and N 0 for images by psampling the entries and downsampling the entries. It is appropriate to set N for images. Separately, N 2 for the original 0 0 images. All the versions of CNN architectre to be tested are shown as binary combinations in Table I, where the pooling (2 2 sbsampling) is indicated with ones, and zeros indicate that a is not followed by a pooling layer [9], [9], [20]. 52

3 Electrical, Control and Commnication Engineering 20, vol. 4, no. Fig. 3. A sbset of the dataset consisting of -bit greyscale images created from originally monochrome 60 0 images [9]. Unlike or, images are extremely simple; however, they fall into 26 classes. TABLE I VERSIONS OF CNN ARCHITECTURE TO BE TESTED ON THE DATASETS # CNN architectre (N ) Size of s filters (in order of Image size s nmbering from the CNN inpt) (dimension) Datasets, , (2) 2, 2 64,, 4 33, , , 5, , 7, 6 4 (3) 9, 9, 9 64,, 9 5, 4, , 5, 5 0 5, 5, 3, (4) 2 7, 6, 5, , 7, 4, (4) 4 9, 9, 7, 6 96,, 5 (4) 9, 9,, , 3, 2, 2, (5) 7 5, 3, 3, 3, 3 4,, 0 (5) 5, 3, 3, 3, , 5, 4, 4, 2 96 (5) 20 5, 5, 5, 3, (6) 3, 2, 2, 2, 2, (6) 5, 3, 3, 2, 2, (6) 5, 5, 2, 2, 2, 2 64,, 24 0 (6) 5, 5, 4, 2, 2, (6) 5, 5, 3, 2, 2, (7) 3, 2, 2, 2, 2, 2, (7) 5, 3, 2, 2, 2, 2, (7) 5, 5, 4, 2, 2, 2, 64,, 29 5, 5, 4, 2, 2, 2, (7) 30 7, 6, 4, 3, 2, 2, () 3, 2, 2, 2, 2, 2, 2, () 5, 3, 2, 2, 2, 2, 2, () 5, 5, 3, 2, 2, 2, 2, 64,, () 5, 5, 2, 2, 2, 2, 2, () 5, 5, 3, 2, 2, 2, 2, (9) 5, 3, 2, 2, 2, 2, 2, 2, (9) 5, 3, 2, 2, 2, 2, 2, 2, 2 64,, (9) 5, 3, 2, 2, 2, 2, 2, 2, (9) 5, 3, 2, 2, 2, 2, 2, 2, (0) 3, 3, 2, 2, 2, 2, 2, 2, 2, (0) 5, 3, 2, 2, 2, 2, 2, 2, 2, 96,, (0) 5, 3, 2, 2, 2, 2, 2, 2, 2, () 5, 3, 2, 2, 2, 2, 2, 2, 2, 2, 96,, () 5, 3, 2, 2, 2, 2, 2, 2, 2, 2, (2) 5, 3, 2, 2, 2, 2, 2, 2, 2, 2, 2, 0 53

4 Electrical, Control and Commnication Engineering 20, vol. 4, no. The listed architectres are close to being qasi-optimal for the corresponding N. For accelerating the training processes, a single ReLU before the last is inserted, withot DropOt layers [2], [22]. Althogh it wold impair generalisation, or task is to obtain consistent statistics on performance. The performance consistency implies a good enogh differentiation of error rate over varios versions of CNN architectre (see Table I), which mst help in finding the most appropriate integer(s) N. v 5 0. v v v v 32, 4, 64, 96, VI. EXTRACTION OF INTEGERS TO THE BEST PERFORMANCE N CORRESPONDING It takes a few epochs to obtain a sfficiently discriminated performance. Let vp be the error rate for the with image size W W for the -th CNN architectre version (the first colmn in Table ) after the p -th epoch. Then the performance is normalised to either [9] or vp p v by Q W v q qq W p p () v v by Q W v q qq W for comparing among s, where Q (2) W is the set of the versions for the given and the given image size. For instance, and Q Q 32, 6,, 6, 2, 26, , 9, 4, 9, 24, 29, 34, 3, 4, 43 are the sets for researching the minimm and imm size of images. The sets are the same for. The dataset is researched in a wider range, starting with to 32, 6,, 6, 2, 26, 3 Q 0 5, 0, 5, 20, 25, 30, 35, 39, 42, 44, 45 Q. Figres 4 6 show the normalised error rates () polylined for flfilling trend comparisons along the axis, where Qˆ W Q W Qˆ W Q W,, ˆ Q W Q W. The final-epoch normalised error rates (2) are polylined in Figres 7 9 by the same axes. A similarity between a dataset s polylines holds. However, the polylines of final-epoch-performance (2) look more scattered Fig. 4. The normalised error rates () for. The best performance is observed at for s, except for the largest image size, for which the best performance corresponds to five s. v v v v v v 32, 4, 64, 96, 0, Fig. 5. The normalised error rates () for. The best performance is observed at five s, except for the smallest image size, where the best performance is provided by for s. 54

5 Electrical, Control and Commnication Engineering 20, vol. 4, no. v v v v v 32, 4, 64, 96, v, 5 W v 32, v 4, Fig. 6. The normalised error rates () for. The best performance is observed at five s, except for the smallest images, where the best performance is provided by three or for s. v 5, 0. W v 32, v 4, Fig.. The final-epoch normalised error rates (2) for. Unexpectedly, N = 5 fits for W 64, 96,0 whereas the smaller images prefer N = 5. v v 32, v 4, v 64, v 96, v 64, v 96, v 0, 5 v 64, v 96, Fig. 7. The final-epoch normalised error rates (2) for. The best N for W = 32 is 4, the best N for W = 96 is 6, N = 5 fits for the rest of the cases Fig. 9. The final-epoch normalised error rates (2) for. For W = 4, two minima exist, so the appropriateness of s is similar to that in Fig

6 Electrical, Control and Commnication Engineering 20, vol. 4, no. An apparent tendency that can be seen in Fig. 4 9 lies in the risk of CNN training failre when we increase the nmber of s. Too primitive architectres (consisting of only two s) do not work either. However, making a distinct conclsion on these polylines is hardly possible. So, and by frther averaging is needed. This will not concern the size ˆQ W are W = 0. As sets ˆQ W, ˆQ W, pairwise different (bt, perhaps fortnately, not disjointed), the average performance of the three s is to be viewed in the form (Fig. 0),,, v W v W v W vw, (3) 3,,, v W v W v W v (4) 3 ˆ ˆ ˆ for 32, 4, 64, 96 Q W Q W Q W For the dataset of the largest image size, formally, and 0, 0, v v (6) v 0, v 0, by Qˆ 0 (7). Data (6) and (7) being a segment longer than the rest, they are taken back from Figres 5 and, respectively. v 32, 5 v 32, 0.25 v 4, 0. v 4, v 64, v 64, v 96, 0.3 v 96, Fig. 0. The average performance of the three s by (3) and (4), wherein only three common CNN architectres constitte an argment axis for each of the eight polylines. In the vertical direction, there are not more than two points above the same CNN architectre version. Except for the image size of 4, and W. (5) 0 (only by final-epoch performance), all of these polylines (there are twosegmented lines, except for (6) consisting of three segments in Fig. 5) increase. Althogh Figre 0 only deals with the dimensionality of an, it gives s a straight conclsion on that s of a higher dimensionality reqire more s. Nevertheless, the appropriate nmber of s for sch s is not mch greater than that for lower dimensionalities: with the image size increased three times (from 32 p to 96), the appropriate N does not change more than from 4 to 6 (if all the polylines are considered). Moreover, considering only the eight polylines in Figre 0, the appropriate N is jst 5 for any image size, except for images, where the appropriate N is 4 (see e.g. [9]). VII. THE RULE FOR AN APPROPRIATE N Apparently, as the image size increases, we may need more s. Then, however, the appropriate N shold always be slightly increased to prevent the risk of CNN training failre. Setting seven s for the benchmarked datasets has adverse conseqences. How does the nmber of image categories/classes inflence the appropriateness of N? Table II, which contains integers N that correspond to the error rate minima (in Figres 4 0) helps s see this. As can be easily seen, the dependence of the appropriate integer N on the nmber of classes is hardly perceptible. It rather depends on the complexity of the. And the nmber of classes is one of the components of the complexity of s. TABLE II THE APPROPRIATE NUMBER OF CONVLS THAT CORRESPONDS TO THE ERROR RATE MINIMA IN FIGURES 4 0 Datasets with the increasing nmbers of classes Error rate Error rate Error rate Error rate Error rate Error rate W () (2) () (2) () (2)

7 Electrical, Control and Commnication Engineering 20, vol. 4, no. Hence, the rle for appropriate N in CNNs is to try fewer s (an initial nmber) and then increase the nmber of s ntil the CNN performance starts deteriorating. For small images (like those in ), that initial nmber is 4. For mch complex s (in particlar, ones with a few tens of image categories and more), it is recommended to initially set N = 5. Definitely, the initial nmber of s for s with a few thosand image categories is recommended to be set at 6, 7 or. Starting with N 0 is not recommended. VIII. CONCLUSION The attempt of finding an appropriate nmber of s in CNNs has been based on benchmarks of heterogeneos datasets. The heterogeneosness is principally needed for ensring applicability to the appropriateness rle. Generally, the rle cannot give an exact nmber of s or even a few versions for this nmber otright. The rle is rather a short process of trying a few versions of N, starting from N 4 for datasets whose image size is less than 00 and whose nmber of image categories is a few tens. In other cases, N 5, 6, 7, at the beginning, where the greater N corresponds to s with a higher degree of complexity [23]. It seems that sch fzziness in the rle is not removable becase of the reqired diversity and heterogeneosness of s. REFERENCES [] H. H. Aghdam and E. J. Heravi, Gide to Convoltional Neral Networks: A Practical Application to Traffic-Sign Detection and Classification. Cham, Switzerland: Springer, [2] A. Gibson and J. Patterson, Deep Learning: A Practitioner s Approach. O Reilly Media, 207. [3] S. Srinivas, R. K. Sarvadevabhatla, K. R. Mopri, N. Prabh, S. S. S. Krthiventi, and R. V. Bab, Chapter 2 An Introdction to Deep Convoltional Neral Nets for Compter Vision, in Deep Learning for Medical Image Analysis, S. K. Zho, H. Greenspan, and D. Shen, Eds. Academic Press, 207, pp [4] V. Andrearczyk and P. F. Whelan, Using Filter Banks in Convoltional Neral Networks for Textre Classification, Pattern Recognition Letters, vol. 4, pp , Dec [5] Z. Liao and G. Carneiro, A Deep Convoltional Neral Network Modle that Promotes Competition of Mltiple-Size Filters, Pattern Recognition, vol. 7, pp , [6] D. Ciresan, U. Meier, J. Masci, L. M. Gambardella, and J. Schmidhber, Flexible, High Performance Convoltional Neral Networks for Image Classification, in Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, vol. 2, pp , 20. [7] A. Krizhevsky, I. Stskever, and G. E. Hinton, ImageNet Classification With Deep Convoltional Neral Networks, Commnications of the ACM, vol. 60, iss. 6, pp. 4 90, [] J. Mtch and D. G. Lowe, Object Class Recognition and Localization Using Sparse Featres With Limited Receptive Fields, International Jornal of Compter Vision, vol. 0, iss., pp , [9] V. V. Romanke, Appropriate Nmber and Allocation of ReLUs in Convoltional Neral Networks, Research Blletin of the National Technical University of Ukraine Kyiv Polytechnic Institte, no., pp. 69 7, [0] P. Date, J. A. Hendler, and C. D. Carothers, Design Index for Deep Neral Networks, Procedia Compter Science, vol., pp. 3 3, [] K. Simonyan and A. Zisserman, Very Deep Convoltional Networks for Large-Scale Image Recognition, Compter Vision and Pattern Recognition, 205. [2] V. V. Romanke, Boosting Ensembles of Heavy Two-Layer Perceptrons for Increasing Classification Accracy in Recognizing Shifted-Trned- Scaled Flat Images With Binary Featres, Jornal of Information and Organizational Sciences, vol. 39, no., pp. 75 4, 205. [3] V. V. Romanke, Two-Layer Perceptron for Classifying Flat Scaled- Trned-Shifted Objects by Additional Featre Distortions in Training, Jornal of Uncertain Systems, vol. 9, no. 4, pp , 205. [4] P. K. Rhee, E. Erdenee, S. D. Kyn, M. U. Ahmed, and S. Jin, Active and Semi-Spervised Learning for Object Detection With Imperfect Data, Cognitive Systems Research, vol. 45, pp , [5] P. Tang, H. Wang, and S. Kwong, G-MS2F: GoogLeNet Based Mlti- Stage Featre Fsion of Deep CNN for Scene Recognition, Nerocompting, vol. 225, pp. 97, [6] C. Szegedy, W. Li, Y. Jia, P. Sermanet, S. Reed, D. Angelov, D. Erhan, V. Vanhocke, and A. Rabinovich, Going Deeper With Convoltions, Compter Vision and Pattern Recognition, 204. [7] V. V. Romanke, Classifying Scaled-Trned-Shifted Objects With Optimal Pixel-to-Scale-Trn-Shift Standard Deviations Ratio in Training 2-Layer Perceptron on Scaled-Trned-Shifted 400-Featred Objects Under Normally Distribted Featre Distortion, Electrical, Control and Commnication Engineering, vol. 3, iss., pp , [] V. V. Romanke, Classification Error Percentage Decrement of Two- Layer Perceptron for Classifying Scaled Objects on the Pattern of Monochrome 60-by-0-Images of 26 Alphabet Letters by Training With Pixel-Distorted Scaled Images, Scientific blletin of Chernivtsi National University of Yriy Fedkovych. Series: Compter systems and components, vol. 4, iss. 3, pp , 203. [9] M. Sn, Z. Song, X. Jiang, J. Pan, and Y. Pang, Learning Pooling for Convoltional Neral Network, Nerocompting, vol. 224, pp , [20] D. Scherer, A. Müller, and S. Behnke, Evalation of Pooling Operations in Convoltional Architectres for Object Recognition, in International Conference on Artificial Neral Networks (ICANN 200), pp. 92 0, [2] S. Lai, L. Jin, and W. Yang, Toward High-Performance Online HCCR: A CNN Approach With DropDistortion, Path Signatre and Spatial Stochastic Max-Pooling, Pattern Recognition Letters, vol. 9, pp , [22] N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Stskever, and R. R. Salakhtdinov, Dropot: A Simple Way to Prevent Neral Networks From Overfitting, Jornal of Machine Learning Research, vol. 5, pp , 204. [23] L. P. F. Garcia, A. C. P. L. F. de Carvalho, and A. C. Lorena, Effect of Label Noise in the Complexity of Classification Problems, Nerocompting, vol. 60, pp. 0 9, Vadim V. Romanke was born in 979. The higher edcation was received in 200. In 2006, he received the degree of Candidate of Technical Sciences in Mathematical Modelling and Comptational Methods. His candidate dissertation sggested a way of increasing the interference noise immnity of data transferred over radio systems. Mr. Romanke received his degree of Doctor of Technical Sciences in mathematical modelling and comptational methods in 204. His Doctor-of-Science dissertation solved the problem of increasing the efficiency of the identification of models for mltistage technical control and rn-in nder mltivariate ncertainties of their parameters and relationships. In 206, he received the stats of Fll Professor. Mr. Romanke is a Professor at the Faclty of Navigation and Naval Weapons at the Polish Naval Academy. His research interests concern decision-making, game theory, statistical approximation, and control engineering based on statistical correspondence. Vadim Romanke has good programming skills in MATLAB. For practical implementations, Mr. Romanke ses Python. Also, he directs a branch of fitting statistical approximators at the Centre of Parallel Comptations managed by Khmelnitskiy National University (Ukraine). Address for correspondence: 69 Śmidowicza Street, Gdynia, Poland, romankevadimv@gmail.com ORCID id: 57

Speech Recognition Combining MFCCs and Image Features

Speech Recognition Combining MFCCs and Image Features Speech Recognition Combining MFCCs and Image Featres S. Karlos from Department of Mathematics N. Fazakis from Department of Electrical and Compter Engineering K. Karanikola from Department of Mathematics

More information

Novel Blind Recognition Algorithm of Frame Synchronization Words Based on Soft- Decision in Digital Communication Systems

Novel Blind Recognition Algorithm of Frame Synchronization Words Based on Soft- Decision in Digital Communication Systems RESEARCH ARTICLE Novel Blind Recognition Algorithm of Frame Synchronization Words Based on Soft- Decision in Digital Commnication Systems Jiangyi Qin*, Zhiping Hang, Chnw Li, Shaojing S, Jing Zho College

More information

A Parallel Multilevel-Huffman Decompression Scheme for IP Cores with Multiple Scan Chains

A Parallel Multilevel-Huffman Decompression Scheme for IP Cores with Multiple Scan Chains A Parallel Mltilevel-Hffman Decompression Scheme for IP Cores with Mltiple Scan Chains X Kavosianos, E Kalligeros 2 and D Nikolos 2 Compter Science Dept, University of Ioannina, 45 Ioannina, Greece 2 Compter

More information

A Buyers Guide to Laser Projection

A Buyers Guide to Laser Projection The Eropean Digital Cinema Form A Byers Gide to Laser Projection AUTUMN 2018 Table of Contents Slides 2-5 Introdctory notes Slides 6-22 1: Technical Considerations Slides 23-31 2. Financial and lifetime

More information

Review: What is it? What does it do? slti $4, $5, 6

Review: What is it? What does it do? slti $4, $5, 6 Review: What is it? What does it do? Reg Src Instrction Instrction [3-] I [25-2] I [2-6] I [5 - ] 2 Src Op Reslt em em emtoreg I [5 - ] etend slti $, $5, 6 Reg Src Instrction Instrction [3-] I [25-2] I

More information

Analog Signal Input. ! Note: B.1 Analog Connections. Programming for Analog Channels

Analog Signal Input. ! Note: B.1 Analog Connections. Programming for Analog Channels B Analog Signal Inpt B.1 Analog Connections Refer to the diagram (page B-10) showing the VAN analog boards for connection of analog inpts. Be sre yo follow the indicated positive and negative polarity

More information

Chapter 4 (Part I) The Processor. Baback Izadi Division of Engineering Programs

Chapter 4 (Part I) The Processor. Baback Izadi Division of Engineering Programs EGC442 Introdction to Compter Architectre Chapter 4 (Part I) The Processor Baback Izadi Division of Engineering Programs bai@engr.newpaltz.ed Introdction CPU performance factors Instrction cont Determined

More information

A Real-time Framework for Video Time and Pitch Scale Modification

A Real-time Framework for Video Time and Pitch Scale Modification Dblin Institte of Technology ARROW@DIT Conference papers Adio Research Grop 2008-06-01 A Real-time Framework for Video Time and Pitch Scale Modification Ivan Damnjanovic Qeen Mary University London Dan

More information

With Ease. BETTY WAGNER Associate Trinity College London, Associate Music Australia READING LEDGER LINE NOTES

With Ease. BETTY WAGNER Associate Trinity College London, Associate Music Australia READING LEDGER LINE NOTES READING LEDGER LINE NOTES With Ease f G f o o BETTY WAGNER Associate Trinity College London, Associate Msic Astralia READING LEDGER LINE NOTES A Nova WITH EASE Book Company Page Pblication http://www.msic-with-ease.com

More information

MINIMED 640G SYSTEM^ Getting Started. WITH THE MiniMed 640G INSULIN PUMP

MINIMED 640G SYSTEM^ Getting Started. WITH THE MiniMed 640G INSULIN PUMP MINIMED 640G SYSTEM^ Getting Started WITH THE MiniMed 640G INSULIN PUMP let s get started! Table of Contents Section 1: Getting Started... 3 Getting Started with the MiniMed 640G Inslin Pmp...3 1.1 Pmp

More information

Scene Classification with Inception-7. Christian Szegedy with Julian Ibarz and Vincent Vanhoucke

Scene Classification with Inception-7. Christian Szegedy with Julian Ibarz and Vincent Vanhoucke Scene Classification with Inception-7 Christian Szegedy with Julian Ibarz and Vincent Vanhoucke Julian Ibarz Vincent Vanhoucke Task Classification of images into 10 different classes: Bedroom Bridge Church

More information

A Model for Scale-Degree Reinterpretation: Melodic Structure, Modulation, and Cadence Choice in the Chorale Harmonizations of J. S.

A Model for Scale-Degree Reinterpretation: Melodic Structure, Modulation, and Cadence Choice in the Chorale Harmonizations of J. S. Empirical Msicology Review Vol. 10, No. 3, 2015 A Model for Scale-Degree Reinterpretation: Melodic Strctre, Modlation, and Cadence Choice in the Chorale Harmonizations of J. S. Bach TREVOR de CLERCQ[1]

More information

BRAND GUIDELINES 2017

BRAND GUIDELINES 2017 BRAND GUIDELINES 2017 01 CONTENTS Introdction 02 Or Brand 04 Brand Positioning Statement 06 Reasons to Believe 07 Tone of Voice 09 Visal Gidelines 10 Typography: Print & Web 11 Color Palette 13 Using the

More information

770pp. THEORIA 64 (2009)

770pp. THEORIA 64 (2009) DOV M. GABBAY AND JOHN WOODS: The Rise of Modern Logic: From Leibniz to Frege. [Handbook of the History of Logic, vol. 3]. Elsevier North Holland, Amsterdam, 2004, 770pp. This volme contains essays on

More information

HELMUT T. ZWAHLEN AND UMA DEVI VEL

HELMUT T. ZWAHLEN AND UMA DEVI VEL TRANSPORTATION RESEARCH RECORD 1456 125 Conspicity in Terms of Peripheral Visal Detection and Recognition of Florescent Color Targets Verss N onflorescent Color Targets Against Different Backgronds in

More information

Cast Away on the Letter A

Cast Away on the Letter A Cast Away on the Letter A TEACHER S GUIDE ELA COMMON CORE STANDARDS 4TH GRADE: For 4th Grade: Key Ideas and Details CCSS.ELA-LITERACY.RL.4.2 Determine a theme of a story, drama, or poem from details in

More information

Easy Estimation of Spectral Purity of Test Signals for ADC Testing. David Slepička

Easy Estimation of Spectral Purity of Test Signals for ADC Testing. David Slepička Sep. -4, 008, lorence, Italy Easy Estimation of Spectral Prity of Test Signals for ADC Testing David Slepička Czech Technical University in Prage, aclty of Electrical Engineering, Dept. of Measrement Technická,

More information

c:: Frequency response characteristics for sinusoidal movement in the fovea and periphery* ==> 0.' SOO O.S 2.0

c:: Frequency response characteristics for sinusoidal movement in the fovea and periphery* ==> 0.' SOO O.S 2.0 Freqency response characteristics for sinsoidal movement in the fovea and periphery* C. WILLIAM TYLER and JEAN TORRES Northeastern University, Boston, Massachsetts 211 Threshold sensitivity was measred

More information

Brain-actuated Control of Wheelchair Using Fuzzy Neural Networks

Brain-actuated Control of Wheelchair Using Fuzzy Neural Networks Int'l Conf. Artificial Intelligence ICAI'6 67 Brain-actated Control of Wheelchair Using Fzzy Neral Networks Rahib H.Abiyev, Nrllah Akkaya, Ersin Aytac, Irfan Günsel, Ahmet Ça man, Sanan Abizade Near East

More information

SPECTRA RESEARCH Institute

SPECTRA RESEARCH Institute SPECTRA RESEARCH Institte Final Report Neroelectric Activity and Analysis in Spport of Direct Brainwave to Compter Interface Development Richard H. Dickhat prepared for the Office of Naval Research nder

More information

Detecting Musical Key with Supervised Learning

Detecting Musical Key with Supervised Learning Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different

More information

Music Composition with RNN

Music Composition with RNN Music Composition with RNN Jason Wang Department of Statistics Stanford University zwang01@stanford.edu Abstract Music composition is an interesting problem that tests the creativity capacities of artificial

More information

DISTRIBUTION STATEMENT A 7001Ö

DISTRIBUTION STATEMENT A 7001Ö Serial Number 09/678.881 Filing Date 4 October 2000 Inventor Robert C. Higgins NOTICE The above identified patent application is available for licensing. Requests for information should be addressed to:

More information

Montgomery Modular Exponentiation on Reconfigurable Hardware æ

Montgomery Modular Exponentiation on Reconfigurable Hardware æ Montgomery Modlar Exponentiation on Reconfigrable Hardware æ Thomas Blm Worcester Polytechnic Institte ECE Department Worcester, MA 0609-2280, USA tblm@ece.wpi.ed Christof Paar christof@ece.wpi.ed Abstract

More information

In 2007, Pew Research conducted a survey to assess Americans knowledge of

In 2007, Pew Research conducted a survey to assess Americans knowledge of CHAPTER 12 Sample Srveys In 2007, Pew Research condcted a srvey to assess Americans knowledge of crrent events. They asked a random sample of 1,502 U.S. adlts 23 factal qestions abot topics crrently in

More information

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007 A combination of approaches to solve Tas How Many Ratings? of the KDD CUP 2007 Jorge Sueiras C/ Arequipa +34 9 382 45 54 orge.sueiras@neo-metrics.com Daniel Vélez C/ Arequipa +34 9 382 45 54 José Luis

More information

CS 7643: Deep Learning

CS 7643: Deep Learning CS 7643: Deep Learning Topics: Stride, padding Pooling layers Fully-connected layers as convolutions Backprop in conv layers Dhruv Batra Georgia Tech Invited Talks Sumit Chopra on CNNs for Pixel Labeling

More information

Doubletalk Detection

Doubletalk Detection ELEN-E4810 Digital Signal Processing Fall 2004 Doubletalk Detection Adam Dolin David Klaver Abstract: When processing a particular voice signal it is often assumed that the signal contains only one speaker,

More information

Using Device-Specific Data Acquisition for Automated Laboratory Testing

Using Device-Specific Data Acquisition for Automated Laboratory Testing TRANSPOR'IATION RESEARCH RECORD 1432 9 Using Device-Specific Data Acqisition for Atomated Laboratory Testing THOMAS C. SHEAHAN, DON J. DEGROOT, AND JOHN T. GERMAINE Compter-based data acqisition systems

More information

Judging a Book by its Cover

Judging a Book by its Cover Judging a Book by its Cover Brian Kenji Iwana, Syed Tahseen Raza Rizvi, Sheraz Ahmed, Andreas Dengel, Seiichi Uchida Department of Advanced Information Technology, Kyushu University, Fukuoka, Japan Email:

More information

NAA ENHANCING THE QUALITY OF MARKING PROJECT: THE EFFECT OF SAMPLE SIZE ON INCREASED PRECISION IN DETECTING ERRANT MARKING

NAA ENHANCING THE QUALITY OF MARKING PROJECT: THE EFFECT OF SAMPLE SIZE ON INCREASED PRECISION IN DETECTING ERRANT MARKING NAA ENHANCING THE QUALITY OF MARKING PROJECT: THE EFFECT OF SAMPLE SIZE ON INCREASED PRECISION IN DETECTING ERRANT MARKING Mudhaffar Al-Bayatti and Ben Jones February 00 This report was commissioned by

More information

Music Genre Classification and Variance Comparison on Number of Genres

Music Genre Classification and Variance Comparison on Number of Genres Music Genre Classification and Variance Comparison on Number of Genres Miguel Francisco, miguelf@stanford.edu Dong Myung Kim, dmk8265@stanford.edu 1 Abstract In this project we apply machine learning techniques

More information

IMAGE AESTHETIC PREDICTORS BASED ON WEIGHTED CNNS. Oce Print Logic Technologies, Creteil, France

IMAGE AESTHETIC PREDICTORS BASED ON WEIGHTED CNNS. Oce Print Logic Technologies, Creteil, France IMAGE AESTHETIC PREDICTORS BASED ON WEIGHTED CNNS Bin Jin, Maria V. Ortiz Segovia2 and Sabine Su sstrunk EPFL, Lausanne, Switzerland; 2 Oce Print Logic Technologies, Creteil, France ABSTRACT Convolutional

More information

PERCEPTUAL QUALITY COMPARISON BETWEEN SINGLE-LAYER AND SCALABLE VIDEOS AT THE SAME SPATIAL, TEMPORAL AND AMPLITUDE RESOLUTIONS. Yuanyi Xue, Yao Wang

PERCEPTUAL QUALITY COMPARISON BETWEEN SINGLE-LAYER AND SCALABLE VIDEOS AT THE SAME SPATIAL, TEMPORAL AND AMPLITUDE RESOLUTIONS. Yuanyi Xue, Yao Wang PERCEPTUAL QUALITY COMPARISON BETWEEN SINGLE-LAYER AND SCALABLE VIDEOS AT THE SAME SPATIAL, TEMPORAL AND AMPLITUDE RESOLUTIONS Yuanyi Xue, Yao Wang Department of Electrical and Computer Engineering Polytechnic

More information

Distortion Analysis Of Tamil Language Characters Recognition

Distortion Analysis Of Tamil Language Characters Recognition www.ijcsi.org 390 Distortion Analysis Of Tamil Language Characters Recognition Gowri.N 1, R. Bhaskaran 2, 1. T.B.A.K. College for Women, Kilakarai, 2. School Of Mathematics, Madurai Kamaraj University,

More information

Joint Image and Text Representation for Aesthetics Analysis

Joint Image and Text Representation for Aesthetics Analysis Joint Image and Text Representation for Aesthetics Analysis Ye Zhou 1, Xin Lu 2, Junping Zhang 1, James Z. Wang 3 1 Fudan University, China 2 Adobe Systems Inc., USA 3 The Pennsylvania State University,

More information

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring 2009 Week 6 Class Notes Pitch Perception Introduction Pitch may be described as that attribute of auditory sensation in terms

More information

Lecture 2 Video Formation and Representation

Lecture 2 Video Formation and Representation 2013 Spring Term 1 Lecture 2 Video Formation and Representation Wen-Hsiao Peng ( 彭文孝 ) Multimedia Architecture and Processing Lab (MAPL) Department of Computer Science National Chiao Tung University 1

More information

LTC 8800 Series Allegiant Matrix/Control Systems - Modular

LTC 8800 Series Allegiant Matrix/Control Systems - Modular Video LTC 88 Series Allegiant Matrix/Control Systems - Modlar LTC 88 Series Allegiant Matrix/Control Systems - Modlar www.boschsecrity.com 5 Camera by 4 monitor switching Expandable to larger matrix sizes

More information

Pipelining. Improve performance by increasing instruction throughput Program execution order. Data access. Instruction. fetch. Data access.

Pipelining. Improve performance by increasing instruction throughput Program execution order. Data access. Instruction. fetch. Data access. Chapter 6 Pipelining Improve performance by increasing instrction throghpt Program eection order Time (in instrctions) lw $, ($) Instrction fetch 2 4 6 8 2 4 6 8 ALU Data access lw $2, 2($) 8 ns Instrction

More information

Precision testing methods of Event Timer A032-ET

Precision testing methods of Event Timer A032-ET Precision testing methods of Event Timer A032-ET Event Timer A032-ET provides extreme precision. Therefore exact determination of its characteristics in commonly accepted way is impossible or, at least,

More information

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder. Video Streaming Based on Frame Skipping and Interpolation Techniques Fadlallah Ali Fadlallah Department of Computer Science Sudan University of Science and Technology Khartoum-SUDAN fadali@sustech.edu

More information

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY Eugene Mikyung Kim Department of Music Technology, Korea National University of Arts eugene@u.northwestern.edu ABSTRACT

More information

DQMx Series. Digital QAM Multiplexer INSTRUCTION MANUAL. Model Stock No. Description

DQMx Series. Digital QAM Multiplexer INSTRUCTION MANUAL. Model Stock No. Description One Jake Brown Road Old Bridge, NJ 08857-1000 USA (800) 523-6049 (732) 679-4000 FAX: (732) 679-4353 www.blondertonge.com INSTRUCTION MANUAL DQMx Series Digital QAM Mltiplexer Model Stock No. Description

More information

A Transfer Learning Based Feature Extractor for Polyphonic Sound Event Detection Using Connectionist Temporal Classification

A Transfer Learning Based Feature Extractor for Polyphonic Sound Event Detection Using Connectionist Temporal Classification INTERSPEECH 17 August, 17, Stockholm, Sweden A Transfer Learning Based Feature Extractor for Polyphonic Sound Event Detection Using Connectionist Temporal Classification Yun Wang and Florian Metze Language

More information

Product Overview 2009

Product Overview 2009 Prodct Overview 2009 Living high tech 1 Contents Editorial...3 The new ECoS 4 The new ECoS - Jst Play...5 Fnctions detailed...7 Expandibility...9 ECoS 10 ECoS...10 Expandibility...11 Navigator 12 Eqipment

More information

E-Vision Laser 4K Series High Brightness Digital Video Projector

E-Vision Laser 4K Series High Brightness Digital Video Projector E-Vision Laser 4K Series High Brightness Digital Video Projector 4INSTALLATION AND QUICK-START GUIDE 4CONNECTION GUIDE 4OPERATING GUIDE 4REFERENCE GUIDE 118-157A Abot This Docment Follow the instrctions

More information

CS229 Project Report Polyphonic Piano Transcription

CS229 Project Report Polyphonic Piano Transcription CS229 Project Report Polyphonic Piano Transcription Mohammad Sadegh Ebrahimi Stanford University Jean-Baptiste Boin Stanford University sadegh@stanford.edu jbboin@stanford.edu 1. Introduction In this project

More information

Noise Flooding for Detecting Audio Adversarial Examples Against Automatic Speech Recognition

Noise Flooding for Detecting Audio Adversarial Examples Against Automatic Speech Recognition Noise Flooding for Detecting Audio Adversarial Examples Against Automatic Speech Recognition Krishan Rajaratnam The College University of Chicago Chicago, USA krajaratnam@uchicago.edu Jugal Kalita Department

More information

TRAFFIC SURVEILLANCE VIDEO MANAGEMENT SYSTEM

TRAFFIC SURVEILLANCE VIDEO MANAGEMENT SYSTEM TRAFFIC SURVEILLANCE VIDEO MANAGEMENT SYSTEM K.Ganesan*, Kavitha.C, Kriti Tandon, Lakshmipriya.R TIFAC-Centre of Relevance and Excellence in Automotive Infotronics*, School of Information Technology and

More information

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Mohamed Hassan, Taha Landolsi, Husameldin Mukhtar, and Tamer Shanableh College of Engineering American

More information

Research on sampling of vibration signals based on compressed sensing

Research on sampling of vibration signals based on compressed sensing Research on sampling of vibration signals based on compressed sensing Hongchun Sun 1, Zhiyuan Wang 2, Yong Xu 3 School of Mechanical Engineering and Automation, Northeastern University, Shenyang, China

More information

2. Problem formulation

2. Problem formulation Artificial Neural Networks in the Automatic License Plate Recognition. Ascencio López José Ignacio, Ramírez Martínez José María Facultad de Ciencias Universidad Autónoma de Baja California Km. 103 Carretera

More information

Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj

Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj 1 Story so far MLPs are universal function approximators Boolean functions, classifiers, and regressions MLPs can be

More information

An Introduction to Deep Image Aesthetics

An Introduction to Deep Image Aesthetics Seminar in Laboratory of Visual Intelligence and Pattern Analysis (VIPA) An Introduction to Deep Image Aesthetics Yongcheng Jing College of Computer Science and Technology Zhejiang University Zhenchuan

More information

Less is More: Picking Informative Frames for Video Captioning

Less is More: Picking Informative Frames for Video Captioning Less is More: Picking Informative Frames for Video Captioning ECCV 2018 Yangyu Chen 1, Shuhui Wang 2, Weigang Zhang 3 and Qingming Huang 1,2 1 University of Chinese Academy of Science, Beijing, 100049,

More information

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur Module 8 VIDEO CODING STANDARDS Lesson 27 H.264 standard Lesson Objectives At the end of this lesson, the students should be able to: 1. State the broad objectives of the H.264 standard. 2. List the improved

More information

MUSI-6201 Computational Music Analysis

MUSI-6201 Computational Music Analysis MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)

More information

DeepID: Deep Learning for Face Recognition. Department of Electronic Engineering,

DeepID: Deep Learning for Face Recognition. Department of Electronic Engineering, DeepID: Deep Learning for Face Recognition Xiaogang Wang Department of Electronic Engineering, The Chinese University i of Hong Kong Machine Learning with Big Data Machine learning with small data: overfitting,

More information

Module 3: Video Sampling Lecture 16: Sampling of video in two dimensions: Progressive vs Interlaced scans. The Lecture Contains:

Module 3: Video Sampling Lecture 16: Sampling of video in two dimensions: Progressive vs Interlaced scans. The Lecture Contains: The Lecture Contains: Sampling of Video Signals Choice of sampling rates Sampling a Video in Two Dimensions: Progressive vs. Interlaced Scans file:///d /...e%20(ganesh%20rana)/my%20course_ganesh%20rana/prof.%20sumana%20gupta/final%20dvsp/lecture16/16_1.htm[12/31/2015

More information

Identifying Table Tennis Balls From Real Match Scenes Using Image Processing And Artificial Intelligence Techniques

Identifying Table Tennis Balls From Real Match Scenes Using Image Processing And Artificial Intelligence Techniques Identifying Table Tennis Balls From Real Match Scenes Using Image Processing And Artificial Intelligence Techniques K. C. P. Wong Department of Communication and Systems Open University Milton Keynes,

More information

Understanding Compression Technologies for HD and Megapixel Surveillance

Understanding Compression Technologies for HD and Megapixel Surveillance When the security industry began the transition from using VHS tapes to hard disks for video surveillance storage, the question of how to compress and store video became a top consideration for video surveillance

More information

Audio Cover Song Identification using Convolutional Neural Network

Audio Cover Song Identification using Convolutional Neural Network Audio Cover Song Identification using Convolutional Neural Network Sungkyun Chang 1,4, Juheon Lee 2,4, Sang Keun Choe 3,4 and Kyogu Lee 1,4 Music and Audio Research Group 1, College of Liberal Studies

More information

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER PERCEPTUAL QUALITY OF H./AVC DEBLOCKING FILTER Y. Zhong, I. Richardson, A. Miller and Y. Zhao School of Enginnering, The Robert Gordon University, Schoolhill, Aberdeen, AB1 1FR, UK Phone: + 1, Fax: + 1,

More information

A. Ideal Ratio Mask If there is no RIR, the IRM for time frame t and frequency f can be expressed as [17]: ( IRM(t, f) =

A. Ideal Ratio Mask If there is no RIR, the IRM for time frame t and frequency f can be expressed as [17]: ( IRM(t, f) = 1 Two-Stage Monaural Source Separation in Reverberant Room Environments using Deep Neural Networks Yang Sun, Student Member, IEEE, Wenwu Wang, Senior Member, IEEE, Jonathon Chambers, Fellow, IEEE, and

More information

LOCOCODE versus PCA and ICA. Jurgen Schmidhuber. IDSIA, Corso Elvezia 36. CH-6900-Lugano, Switzerland. Abstract

LOCOCODE versus PCA and ICA. Jurgen Schmidhuber. IDSIA, Corso Elvezia 36. CH-6900-Lugano, Switzerland. Abstract LOCOCODE versus PCA and ICA Sepp Hochreiter Technische Universitat Munchen 80290 Munchen, Germany Jurgen Schmidhuber IDSIA, Corso Elvezia 36 CH-6900-Lugano, Switzerland Abstract We compare the performance

More information

Feature-Based Analysis of Haydn String Quartets

Feature-Based Analysis of Haydn String Quartets Feature-Based Analysis of Haydn String Quartets Lawson Wong 5/5/2 Introduction When listening to multi-movement works, amateur listeners have almost certainly asked the following situation : Am I still

More information

THE EVENT ARGUMENT and ARGUMENT INTRODUCERS: little v, and the Applicative Head. λe <s,t> v Appl

THE EVENT ARGUMENT and ARGUMENT INTRODUCERS: little v, and the Applicative Head. λe <s,t> v Appl THE EVENT ARGUMENT and ARGUMENT INTRODUCERS: little v, and the Applicative Head λe v Appl OUR ROADMAP Review of morphosyntactic fnction of v Adding events to or notation How little v came to be The

More information

Part 1: Introduction to Computer Graphics

Part 1: Introduction to Computer Graphics Part 1: Introduction to Computer Graphics 1. Define computer graphics? The branch of science and technology concerned with methods and techniques for converting data to or from visual presentation using

More information

Experimental Study on Two-Phase Flow Instability in System Including Downcomers

Experimental Study on Two-Phase Flow Instability in System Including Downcomers Jornal of Nclear Science and Technology SSN: 0022-3131 (Print) 1881-1248 (Online) Jornal homepage: https://www.tandfonline.com/loi/tnst Experimental Stdy on Two-Phase Flow nstability in System nclding

More information

On-Supporting Energy Balanced K-Barrier Coverage In Wireless Sensor Networks

On-Supporting Energy Balanced K-Barrier Coverage In Wireless Sensor Networks On-Supporting Energy Balanced K-Barrier Coverage In Wireless Sensor Networks Chih-Yung Chang cychang@mail.tku.edu.t w Li-Ling Hung Aletheia University llhung@mail.au.edu.tw Yu-Chieh Chen ycchen@wireless.cs.tk

More information

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Research Article. ISSN (Print) *Corresponding author Shireen Fathima Scholars Journal of Engineering and Technology (SJET) Sch. J. Eng. Tech., 2014; 2(4C):613-620 Scholars Academic and Scientific Publisher (An International Publisher for Academic and Scientific Resources)

More information

Smart Traffic Control System Using Image Processing

Smart Traffic Control System Using Image Processing Smart Traffic Control System Using Image Processing Prashant Jadhav 1, Pratiksha Kelkar 2, Kunal Patil 3, Snehal Thorat 4 1234Bachelor of IT, Department of IT, Theem College Of Engineering, Maharashtra,

More information

LSTM Neural Style Transfer in Music Using Computational Musicology

LSTM Neural Style Transfer in Music Using Computational Musicology LSTM Neural Style Transfer in Music Using Computational Musicology Jett Oristaglio Dartmouth College, June 4 2017 1. Introduction In the 2016 paper A Neural Algorithm of Artistic Style, Gatys et al. discovered

More information

Image-to-Markup Generation with Coarse-to-Fine Attention

Image-to-Markup Generation with Coarse-to-Fine Attention Image-to-Markup Generation with Coarse-to-Fine Attention Presenter: Ceyer Wakilpoor Yuntian Deng 1 Anssi Kanervisto 2 Alexander M. Rush 1 Harvard University 3 University of Eastern Finland ICML, 2017 Yuntian

More information

HELICAL SCAN TECHNOLOGY: ADVANCEMENT BY DESIGN

HELICAL SCAN TECHNOLOGY: ADVANCEMENT BY DESIGN HELICAL SCAN TECHNOLOGY: ADVANCEMENT BY DESIGN By Curt Mulder And Kelly Scharf Exabyte Corporation THIC Conference Del Mar, CA 1/20/98 1685 38 th Street Boulder, CO 80301 +1-303-442-4333 +1-303-417-7080

More information

100Gb/s Single-lane SERDES Discussion. Phil Sun, Credo Semiconductor IEEE New Ethernet Applications Ad Hoc May 24, 2017

100Gb/s Single-lane SERDES Discussion. Phil Sun, Credo Semiconductor IEEE New Ethernet Applications Ad Hoc May 24, 2017 100Gb/s Single-lane SERDES Discussion Phil Sun, Credo Semiconductor IEEE 802.3 New Ethernet Applications Ad Hoc May 24, 2017 Introduction This contribution tries to share thoughts on 100Gb/s single-lane

More information

OPTIMIZING VIDEO SCALERS USING REAL-TIME VERIFICATION TECHNIQUES

OPTIMIZING VIDEO SCALERS USING REAL-TIME VERIFICATION TECHNIQUES OPTIMIZING VIDEO SCALERS USING REAL-TIME VERIFICATION TECHNIQUES Paritosh Gupta Department of Electrical Engineering and Computer Science, University of Michigan paritosg@umich.edu Valeria Bertacco Department

More information

EDDY CURRENT IMAGE PROCESSING FOR CRACK SIZE CHARACTERIZATION

EDDY CURRENT IMAGE PROCESSING FOR CRACK SIZE CHARACTERIZATION EDDY CURRENT MAGE PROCESSNG FOR CRACK SZE CHARACTERZATON R.O. McCary General Electric Co., Corporate Research and Development P. 0. Box 8 Schenectady, N. Y. 12309 NTRODUCTON Estimation of crack length

More information

Lecture 9 Source Separation

Lecture 9 Source Separation 10420CS 573100 音樂資訊檢索 Music Information Retrieval Lecture 9 Source Separation Yi-Hsuan Yang Ph.D. http://www.citi.sinica.edu.tw/pages/yang/ yang@citi.sinica.edu.tw Music & Audio Computing Lab, Research

More information

Broken Wires Diagnosis Method Numerical Simulation Based on Smart Cable Structure

Broken Wires Diagnosis Method Numerical Simulation Based on Smart Cable Structure PHOTONIC SENSORS / Vol. 4, No. 4, 2014: 366 372 Broken Wires Diagnosis Method Numerical Simulation Based on Smart Cable Structure Sheng LI 1*, Min ZHOU 2, and Yan YANG 3 1 National Engineering Laboratory

More information

An Improved Fuzzy Controlled Asynchronous Transfer Mode (ATM) Network

An Improved Fuzzy Controlled Asynchronous Transfer Mode (ATM) Network An Improved Fuzzy Controlled Asynchronous Transfer Mode (ATM) Network C. IHEKWEABA and G.N. ONOH Abstract This paper presents basic features of the Asynchronous Transfer Mode (ATM). It further showcases

More information

An Efficient Spurious Power Suppression Technique (SPST) and its Applications on MPEG-4 AVC/H.264 Transform Coding Design

An Efficient Spurious Power Suppression Technique (SPST) and its Applications on MPEG-4 AVC/H.264 Transform Coding Design An Efficient Sprios Sppression echniqe (SPS) and s Applications on PEG-4 AVC/H64 ransform Coding De Kan-Hng Chen, Ko-Chan Chao, Jinn-Shyan Wang, Yan-Sn Ch Department of Electrical Engineering, National

More information

How to Obtain a Good Stereo Sound Stage in Cars

How to Obtain a Good Stereo Sound Stage in Cars Page 1 How to Obtain a Good Stereo Sound Stage in Cars Author: Lars-Johan Brännmark, Chief Scientist, Dirac Research First Published: November 2017 Latest Update: November 2017 Designing a sound system

More information

Representations of Sound in Deep Learning of Audio Features from Music

Representations of Sound in Deep Learning of Audio Features from Music Representations of Sound in Deep Learning of Audio Features from Music Sergey Shuvaev, Hamza Giaffar, and Alexei A. Koulakov Cold Spring Harbor Laboratory, Cold Spring Harbor, NY Abstract The work of a

More information

Deep learning for music data processing

Deep learning for music data processing Deep learning for music data processing A personal (re)view of the state-of-the-art Jordi Pons www.jordipons.me Music Technology Group, DTIC, Universitat Pompeu Fabra, Barcelona. 31st January 2017 Jordi

More information

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception LEARNING AUDIO SHEET MUSIC CORRESPONDENCES Matthias Dorfer Department of Computational Perception Short Introduction... I am a PhD Candidate in the Department of Computational Perception at Johannes Kepler

More information

Math 81 Graphing. Cartesian Coordinate System Plotting Ordered Pairs (x, y) (x is horizontal, y is vertical) center is (0,0) Quadrants:

Math 81 Graphing. Cartesian Coordinate System Plotting Ordered Pairs (x, y) (x is horizontal, y is vertical) center is (0,0) Quadrants: Math 81 Graphing Cartesian Coordinate System Plotting Ordered Pairs (x, y) (x is horizontal, y is vertical) center is (0,0) Ex 1. Plot and indicate which quadrant they re in. A (0,2) B (3, 5) C (-2, -4)

More information

Improving Performance in Neural Networks Using a Boosting Algorithm

Improving Performance in Neural Networks Using a Boosting Algorithm - Improving Performance in Neural Networks Using a Boosting Algorithm Harris Drucker AT&T Bell Laboratories Holmdel, NJ 07733 Robert Schapire AT&T Bell Laboratories Murray Hill, NJ 07974 Patrice Simard

More information

System Quality Indicators

System Quality Indicators Chapter 2 System Quality Indicators The integration of systems on a chip, has led to a revolution in the electronic industry. Large, complex system functions can be integrated in a single IC, paving the

More information

MPEG has been established as an international standard

MPEG has been established as an international standard 1100 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER 1999 Fast Extraction of Spatially Reduced Image Sequences from MPEG-2 Compressed Video Junehwa Song, Member,

More information

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015 Optimization of Multi-Channel BCH Error Decoding for Common Cases Russell Dill Master's Thesis Defense April 20, 2015 Bose-Chaudhuri-Hocquenghem (BCH) BCH is an Error Correcting Code (ECC) and is used

More information

What is Statistics? 13.1 What is Statistics? Statistics

What is Statistics? 13.1 What is Statistics? Statistics 13.1 What is Statistics? What is Statistics? The collection of all outcomes, responses, measurements, or counts that are of interest. A portion or subset of the population. Statistics Is the science of

More information

EX65 Explosion Protected Camera

EX65 Explosion Protected Camera Video EX65 Explosion Protected EX65 Explosion Protected www.boschsecrity.com Electropolished 316L stainless steel or alminm constrction High-resoltion, high-sensitivity Dinion 2X imager with WDR Integrated

More information

IDENTIFYING TABLE TENNIS BALLS FROM REAL MATCH SCENES USING IMAGE PROCESSING AND ARTIFICIAL INTELLIGENCE TECHNIQUES

IDENTIFYING TABLE TENNIS BALLS FROM REAL MATCH SCENES USING IMAGE PROCESSING AND ARTIFICIAL INTELLIGENCE TECHNIQUES IDENTIFYING TABLE TENNIS BALLS FROM REAL MATCH SCENES USING IMAGE PROCESSING AND ARTIFICIAL INTELLIGENCE TECHNIQUES Dr. K. C. P. WONG Department of Communication and Systems Open University, Walton Hall

More information

SMART VEHICLE SCREENING SYSTEM USING ARTIFICIAL INTELLIGENCE METHODS

SMART VEHICLE SCREENING SYSTEM USING ARTIFICIAL INTELLIGENCE METHODS 1 TERNOPIL ACADEMY OF NATIONAL ECONOMY INSTITUTE OF COMPUTER INFORMATION TECHNOLOGIES SMART VEHICLE SCREENING SYSTEM USING ARTIFICIAL INTELLIGENCE METHODS Presenters: Volodymyr Turchenko Vasyl Koval The

More information

HIGHlite 4K Series High Brightness Digital Video Projector

HIGHlite 4K Series High Brightness Digital Video Projector HIGHlite 4K Series High Brightness Digital Video Projector 4INSTALLATION AND QUICK-START GUIDE 4CONNECTION GUIDE 4OPERATING GUIDE 4REFERENCE GUIDE Rev A Febrary 2018 118-083A Abot This Docment Follow the

More information

Color Image Compression Using Colorization Based On Coding Technique

Color Image Compression Using Colorization Based On Coding Technique Color Image Compression Using Colorization Based On Coding Technique D.P.Kawade 1, Prof. S.N.Rawat 2 1,2 Department of Electronics and Telecommunication, Bhivarabai Sawant Institute of Technology and Research

More information

Enabling editors through machine learning

Enabling editors through machine learning Meta Follow Meta is an AI company that provides academics & innovation-driven companies with powerful views of t Dec 9, 2016 9 min read Enabling editors through machine learning Examining the data science

More information

CS 1674: Intro to Computer Vision. Face Detection. Prof. Adriana Kovashka University of Pittsburgh November 7, 2016

CS 1674: Intro to Computer Vision. Face Detection. Prof. Adriana Kovashka University of Pittsburgh November 7, 2016 CS 1674: Intro to Computer Vision Face Detection Prof. Adriana Kovashka University of Pittsburgh November 7, 2016 Today Window-based generic object detection basic pipeline boosting classifiers face detection

More information