Combining usage and content in an online recommendation system for music in the Long Tail

Size: px
Start display at page:

Download "Combining usage and content in an online recommendation system for music in the Long Tail"

Transcription

1 Int J Multimed Info Retr (2013) 2:3 13 DOI /s REGULAR PAPER Combining usage and content in an online recommendation system for music in the Long Tail Marcos Aurélio Domingues Fabien Gouyon Alípio Mário Jorge José Paulo Leal João Vinagre Luís Lemos Mohamed Sordo Received: 25 June 2012 / Revised: 21 September 2012 / Accepted: 8 November 2012 / Published online: 27 November 2012 Springer-Verlag London 2012 Abstract Nowadays, a large number of people consume music from the web. Web sites and online services now typically contain millions of music tracks, which complicates search, retrieval, and discovery of music. Music recommender systems can address these issues by recommending relevant and novel music to a user based on personal musical tastes. In this paper, we propose a hybrid music recommender system, which combines usage and content data. We describe an online evaluation experiment performed in real-time on a commercial web site, specialized in content from the very Long Tail of music content. We compare it against two standalone recommender systems, the first system based on usage and the second one based on content data (namely, audio and textual tags). The results show that the proposed hybrid recommender shows advantages with respect to usage-based and M. A. Domingues (B) F. Gouyon INESC TEC-INESC Technology and Science, Porto, Portugal maddomingues@gmail.com; marcos.a.domingues@inescporto.pt F. Gouyon fgouyon@inescporto.pt A. M. Jorge J. Vinagre L. Lemos LIAAD/INESC TEC and FCUP, U. Porto, Porto, Portugal amjorge@fc.up.pt J. Vinagre jnsilva@inescporto.pt L. Lemos llemos@inescporto.pt J. P. Leal CRACS/INESC TEC and FCUP, U. Porto, Porto, Portugal zp@dcc.fc.up.pt M. Sordo Universitat Pompeu Fabra, Barcelona, Spain mohamed.sordo@upf.edu content-based systems, namely, higher user absolute acceptance rate, higher user activity rate and higher user loyalty. Keywords Music recommendation Hybrid recommender system Usage data Tags Audio features 1 Introduction Music discovery and consumption has changed dramatically in recent years. According to recent reports, e.g., from consultancy firms [21], the web has become an increasingly relevant source of music discovery, recently reaching the importance of traditional sources such as AM/FM radios, music TVs, or friends. Most people now consume music on their personal computers and mobile devices via Internet. However, with virtually millions of pieces of music henceforth tracks available from thousands of web sites or online services, avoiding overwhelming choices and finding the right music has become a challenge for users. Music recommender systems have emerged in response to this problem. A music recommender system is an information-filtering technology which can be used to output an ordered list of music tracks that are likely to be of interest to the user [10]. Music recommendation has flourished on the Internet, and web sites as Last.fm 1, Amazon 2, Audiobaba 3,Mog 4,Musicovery 5, Shazam 6 and Pandora 7 are successful examples of

2 4 Int J Multimed Info Retr (2013) 2:3 13 music recommenders that adapt recommendations to particular user s tastes. Diverse strategies to content filtering exist: (1) demographic filtering, (2) collaborative filtering (e.g., Amazon), (3) content-based (e.g., Pandora), and (4) hybrid approaches. The first strategy is the simplest but has been proven to have severe limitations [10], hence the focus in this paper is on strategies (2) (4). Collaborative filtering is based on usage data (typically rating data), i.e., recommendations are made to a user depending on personal past usage and on huge amounts of usage data from other users. This technique has proven to be extremely effective, for instance in Apple s Genius music recommender (part of itunes) and to produce better recommendations than other techniques, as judged by panels of users [3]. However, this is only true when usage data are available. Indeed, particular related problems of collaborative filtering are the early-rater problem (items that are seldom rated, if at all, such as new items, or the less-popular items from the end of the Long Tail [2], are never recommended), and popularity bias (items with many ratings are similar to lots of other items, and are, hence, very often recommended) [10]. Content-based approaches are not based on usage data, but on the very content of the items themselves. This content can be described automatically [5] (e.g., Muffin 8 ), based on experts annotations (e.g., Pandora), or on mining contextual data of the items (by web mining, social tagging, etc.). Anchoring recommendation on the content itself is supposed to solve the early-rater and popularity bias problems, however these approaches are typically less successful than collaborative filtering [27] due to still relatively limited performance of automatic music content description algorithms, making item similarity prone to mistakes. Another problem of the content-based paradigm is lack of personalization (similarity does not account for any data about or from the particular user to whom the recommendation is made) [10]. To address the previous problems and achieve better recommendations than stand-alone techniques, usage and content-based approaches have been combined in different ways as hybrid recommenders [7]. In this paper, we propose a hybrid recommender system implemented for Palco Principal 9, a Portuguese web site of music of diverse genres: typical Portuguese music as Fado of course, but also hip-hop, jazz, etc. Most of its music tracks are underground, unknown/unpopular and rarely accessed/rated by the users. In fact, only 19.7 % of its artists also exist on the Last.fm web site. This is a good example of very Long Tail content, for which traditional usage-based recommenders typically do not work so well [10]. The hybrid recommender is evaluated online on the Palco Principal with real-time user interaction. It is compared against a usage-based recommender and a content-based recommender. We also propose performance measures to determine the impact of the recommenders in user activity and loyalty. The paper is organized as follows. In Sect. 2, we present some related work. Section 3 shows how to calculate similarities among items with usage-based data, and with content-based data accounting for two types of content-based descriptions (namely, tags or audio features). In Sect. 4, we describe the usage-based and the content-based recommender approaches which serve as benchmarks. In that section, we also present our hybrid proposal. Section 5 shows the results obtained with a case study setup to evaluate our hybrid proposal against the usage- and content-based approaches. In Sect. 6, we discuss the results and present our next steps. Finally, a summary is presented in Sect Related works Recently, the music information retrieval community has focused its efforts in research and development of music recommendation services. The main music recommenders proposed in the literature are based on collaborative filtering [22,26], content-based [8,9] and hybrid [6,29] approaches. Although the music recommendation system topic is not new, it is now inspired by new capabilities of large online services that provide not only millions of music tracks for listening to, but also radio station hosting. In [1], the authors propose a music recommender system that uses music tracks and meta-data from Internet radio streams as data source, instead of the more commonly used: users playlists and feedback data. By using latent factor models and data from radio streams, a recommender system is built to predict the corresponding probability distribution of items to be played. A hybrid recommender system to recommend Internet radio stations is proposed in [30]. The system combines collaborative and user-based models. First, two ranked lists of recommended stations are generated by using the former models. Then, the lists are aggregated by the weighted sum of their ranks, generating a final list of ranked stations that is sorted and recommended to the user. Other fact that has inspired researches in music recommendation system is that music consumption carry on biased toward a few popular tracks. Therefore, recommender systems carry on suffering from the problem with the lesspopular items from the end of the Long Tail. A deep study about the Long Tail problem in music recommendation system is presented in Ref. [11]. The author not only defines and characterizes the Long Tail problem but also proposes some solutions to address it. In Ref. [19], the authors propose a recommendation system for artists in the Long Tail using the conventional item-based collaborative filtering technique. The system first identifies a suitable candidate pool of Long

3 Int J Multimed Info Retr (2013) 2: Tail artists to build the recommendation model. When a user requests recommendations, the model generates a candidate set with the most similar artists within the pool. Then, each candidate is scored, according to its similarity, and the highest scored candidates are recommended to the user. In a more recent paper [15], the Long Tail problem is addressed using a method for sharing information across different items (music tracks) of the same taxonomy, which mitigates the problem of predicting music with insufficient data. To recommend music tracks, the recommendation system, which is based on matrix factorization, incorporates a rich bias model with terms that capture information from the taxonomy of music tracks and different temporal dynamics of music ratings. 3 Different modalities for item similarities Item-based recommender systems exploit similarity among items [22]. The system looks into the set of items that users have rated and computes the similarity between pairs of items, generating a matrix representing the similarities between all the pairs of items, according to a similarity measure. An abstract representation of a similarity matrix is shown below. Here, each item i can be, for example, a music track. particular user added a particular track (item) to his playlist and, ergo, likes this music. Usage data such as this is a particular case of preference data where each user rates some items on a given scale (e.g., 1 5). In this case, we have a binary scale (i.e., likes/does not like). To compute the similarity between pairs of music tracks from usage data, for example, m 1 and m 2, we first identify the users who have included the tracks in their playlists. Then, we compute the similarity sim(m 1, m 2 ) between m 1 and m 2. Each track m i can be seen as a binary vector m i with as many positions as the number of users. In each position, there is a 1 if the track is in the playlist of the corresponding user and a 0 otherwise. In Ref. [22], the authors present three methods to measure similarity between pairs of items: cosine angle, Pearson s correlation and adjusted cosine angle. In this paper, we use the cosine angle, defined as sim(m 1, m 2 ) = cos( m 1, m 1 m 2 m 2 ) = m 1 m 2, (1) where the operator denotes the dot-product of the two vectors. The values of sim(m 1, m 2 ) range from 1 to1.avalue closer to 1 or 1 means that the music tracks, m 1 and m 2,are very similar or dissimilar, respectively. On the other hand, if the value of sim(m 1, m 2 ) is close to 0 it means that there is no correlation between the two music tracks. After calculating the similarity values between any two tracks, we are able to compute the degree (score) to which any given track is recommendable to any given user (represented by the set of tracks in the respective playlist). This process will be detailed in Sect The effectiveness of an item-based recommender system depends on the method used to calculate the similarity among the items in the matrix. Thus, in the next sections, we present three different methods to calculate the similarity among music tracks. These methods tap into two different types of data: usage-based data on the one hand, and content-based data (both tags and audio features) on the other hand. 3.1 Usage-based similarity The simplest form of usage data is a pair < user, item > meaning that user had a positive interaction with item. Examples are user viewed a document from a collection, user listened to a track, user liked a film, user bought a book, etc. The positive nature of the interaction is often inferred from behavior. In the case of this work, we have access to playlists, which are collections of music tracks created and organized by individual users. The fact that a user adds a track to a playlist is regarded as a preference. Therefore, a < user, item > pair means, in our case, that a 3.2 Tag-based similarity Social tags are free text labels introduced by users (usually non-experts) of any system to describe the content of a web or multimedia item. In music, social tags are assigned to items such as artists, playlists or music tracks [17]. In our particular case, tags describe the content of music tracks, and are typically words or short phrases related to genre, instrument and influence. For example, music tracks in our data are typically tagged with tags like flute, guitar, folk, feminine voice, rock or Daft Punk. The combination of the annotations provided by hundreds or thousands of music users lead to the emergence of a body of domain-specific knowledge, usually referred to as folksonomy. One way to exploit such knowledge is by looking at the correlations between tags. To capture the tag correlation, an M N matrix of tracks and tags is built, where M is the number of tracks and N the number of tags, e.g., see Fig. 1. Matrix elements with values different than 0 mean that a given tag N j has been used to annotate a given music track M i. The rationale is that

4 6 Int J Multimed Info Retr (2013) 2:3 13 Fig. 1 Illustration of the singular value decomposition + dimension reduction used in LSA music tracks with similar tag annotations are more prone to be similar. This technique, however, has limitations. First, the dimensions M and N can be extremely large, thus making the problem computationally expensive. And second, the matrix is usually very sparse. It is very unlikely that users will tag a music track with more than 100 tags. Moreover, many tags introduced by users are rarely used, whilst few others are very common. This phenomenon, usually referred to as Long Tail distribution, is very common in social networks [2,10] To overcome this problem, an information-retrieval technique called latent semantic analysis (LSA) [12] is used to analyze the inherent structure of the matrix. LSA assumes a latent semantic structure that lies underneath the randomness of word choice and spelling in noisy datasets [4]. Basically, LSA consists of two steps. In the first step, a projection of the original M N space to a continuous space of concepts is performed, using statistical or algebraic techniques, such as singular value decomposition (SVD). Given the original sparse matrix, M, the SVD of M is computed as follows: M = U V, (2) where U is an M M unitary matrix of M, an M N diagonal matrix whose diagonal entries are the singular values of M, and V represents the conjugate transpose of V,anN N unitary matrix of M [14]. Given that the first singular values of a matrix tend to encompass most of the information from this matrix, the latter technique, additionally, allows us to reduce the dimensionality of the original matrix, by choosing a relatively small number of singular values (L), while still preserving the similarity structure among rows or columns. Finding the right number of dimensions, L, is not a trivial task. It depends on the applicability of the resulting vectors of concepts.

5 Int J Multimed Info Retr (2013) 2: In this paper, we empirically chose a value of 50. Figure 1 depicts this process. Moreover, information-retrieval literature [4,20] states that, after raw data have been mapped into this latent semantic space, topic (in our case, music tracks) separability is improved. The second step of LSA refers to the distance measure used for calculating the similarity between pairs of music tracks. The most prominent similarity distance in the literature is the cosine distance, defined as: sim(t 1, t 2 ) = cos( t 1, t 2 ) = t1 t 2 t 1 t 2, (3) where t 1 and t 2 are binary vectors with all the tag concepts (i.e., they correspond to a row in matrix U in Fig. 1). A value of 1 or 0 represents the presence or absence, respectively, of the tag concept for the given music track. 3.3 Audio-based similarity For this approach, we have used the free MARSYAS framework 10 to extract 16 audio features from 46-ms frames of the audio signals with no overlap. The features are the spectral centroid, rolloff frequency, spectral flux, and 13 MFCCs, including MFCC0 [28]. Features are aggregated in 1s texture windows, and then averaged over the whole file. Final features are the average and standard deviation (SD). Although better audio features exist [6,24,25], we chose these features because of a relatively low dimensionality, low computational time, availability of code and the fact that they are widespread in the literature. After extracting the audio features for each track, we calculate the similarity among the tracks. The similarity is calculated by the Euclidian distance through the 16 audio features. Here, we define the Euclidian distance between two tracks, a 1 and a 2, as follows sim(a 1, a 2 )= Euclidian( a 1, a 2 )= 16 ( a 1 f a 2 f ) 2, f =1 where a 1 and a 2 are vectors with the 16 audio features. Note that contrarily to the cosine, where the similarity is directly proportional to the measure, with the Euclidian distance, the similarity is inversely proportional to the measure, i.e., the lower the measure the higher the similarity (4) 4 Music recommendation based on diverse modalities In this section, we show how the similarity methods presented in Sect. 3 can be used to recommend music tracks. We start by describing a usage-based and a content-based recommender system, which are used as benchmark systems in this paper. Then, we propose a hybrid recommender system that combines both usage and content. Notice that we are only dealing with recommendation of individual music items, which a given user can then add in a playlist he/she edits manually. We do not address the problem of recommending playlists of music items [13]. 4.1 Usage-based recommendation Usage-based recommendation is made on the basis of the similarity matrix between tracks described in Sect. 3.1.Given a user, his playlists are merged and the music tracks in it are used as seeds (S) for the recommendations. The general procedure follows the Item-based Collaborative Filtering algorithm [22]. For each recommendable music track m (typically any track that is not included in the user s playlists) we fetch its k closest neighbors N(m). These are the k tracks with maximum similarity to m. We then calculate the activation weight ActWeight of each track m which is not already in the playlists of the user [18]. For that, we first identify the intersection N(m) S between the neighbors of the track m and the seeds that characterize the user. Then, we sum the similarity values between each track s in this intersection and m. To normalize the activation weight, this sum is divided by the sum of similarities of m with each of its neighbors. Note that we exclude for recommendation tracks that are already in the playlist. s N(m) S sim(m, s) ActWeight(m) =. (5) sim(m, n) n N(m) Finally, we can recommend to a user the tracks with highest activation weight. 4.2 Content-based recommendation The content-based recommender system that we describe in this section combines tags and audio features to recommend music tracks. Arguably, there is no clear consensus in the literature about the definition of content versus context of music items. In this paper, tags are considered descriptors of musical items content rather than context, hence the combination of tags and audio features in the same recommendation engine. As for the usage data at hand, we choose to consider it as contextual data, as is usually done in the literature [23]. As proposed in Ref. [10], audio features should be good for low-level similarities (e.g., the main timbre of music tracks),

6 8 Int J Multimed Info Retr (2013) 2:3 13 while tags should be good supplements as they account for higher-level information that could not be reliably computed from audio (e.g., feminine voice). The system starts by computing two item item similarity matrices (Sect. 3). One matrix is computed using tags (Sect. 3.2) and the other one using audio features (Sect. 3.3). Once we have the two matrices, we can generate the recommendations. Given a seed music track, s S, the system first fetches its k closest neighbors on each matrix, generating two lists of recommendable music tracks, i.e., one based on tags and the other based on audio features. Then, the system ranks each list separately, taking into account the similarities, and computes a final rank where the position is the sum of the two scores in every independent ranking. Finally, the k best-ranked music tracks, according to the final ranking, are recommended. 4.3 Recommendation combining usage and content The recommendation strategy that combines usage and content data, referred to as Mix, is described in this section. Given a user playlist, we produce three lists of k recommendations. One obtained from usage data (R u ), one from tags (R t ) and the third from audio data (R a ). These three lists are sorted by inverse order of relevance of the recommendations. For each list, the recommended tracks are assigned ranks from k (top recommendation) to 1. The combined rank for each track is the average of the three ranks. For example, if a track m is the first recommendation in R u, second in R t and does not occur in R a, and assuming k = 100, the combined rank is ( )/3 = Blacklisting the recommendations In our music recommendation application, we also have a source of negative information, called blacklist (B). When recommendations are shown to the user, he has the option of blacklisting a particular recommendation. This way, the blacklisted track is not shown again. Here, we exclude from each similarity matrix the tracks in the blacklist B of the seed user. Moreover, the blacklist information is used to calculate a global acceptance index AccI of each track. This index captures the tendency of a track for being blacklisted and is calculated from the number of times a track is blacklisted B(m) and the number of times it is included in a playlist P(m). The value 1 means that the track is not included in any blacklist. B(m) AccI (m) = 1 B(m) + P(m) + 1. (6) After calculating AccI (m) it is multiplied by the final rank to obtain the score of the track. This will penalize tracks that are blacklisted by a large number of users. 5 Case study The recommendation strategies described in the previous section have been deployed on Palco Principal, a start-up company that holds a web site of Portuguese music since Besides music recommendations, the site also provides services like news, advertisements, social networking and an application for users to access the services of the site through their mobile phone. During the period of our study, the site had about 76,000 registered users (61,223 listeners and 14,777 artists/ bands who uploaded music) and 61,000 music tracks. From the tags available in the site, we used 373 tags which can be categorized into three classes: genre (e.g., hip hop), instrument (e.g., clarinet) and influence (e.g., Daft Punk). There is a minimum of 1, a mean of 3.52 and a maximum of 36 tags per track. Minimum of 1 is due to the fact that the web site imposes users to provide at least 1 tag for each track uploaded. As already stated, most of the music tracks in the Palco Principal are underground, in other words, they are unknown/unpopular and rarely accessed/rated by the users. In fact, 79 % of the music tracks have from 1 to 10 accesses, 19 % have from 11 to 100 accesses, and only 2 % of the music tracks have more than 100 accesses. Furthermore, only 19.7 % of the artists on the Palco Principal web site also exist on the Last.fm web site. This means that we are in front of a very Long Tail problem [2]. In the site, each of the recommenders are used separately. When a user opens the page for managing playlists, the recommender is invoked in real-time and the results are shown to the user (Fig. 2). The user can then listen to recommended tracks, select tracks to add to his playlist (by clicking on the heart) or to his blacklist (by clicking on the cross). Notice that the recommender systems assume that users already have their own playlist beforehand. For a newly signed-up user, who does not have a playlist, the system recommends the top listened tracks on the Web site. 5.1 Evaluation methodology To compare the merits of the three recommenders (Usage, Content and Mix) we have performed an online evaluation [16] and followed the reactions of users during 22 weeks, between October 20, 2010 and March 22, These were real users with no knowledge of the evaluation in course. Each new user was assigned one of the three recommenders during this period. The assignment was decided by the remainder of the division of the user ID by 3. This way, we had a random assignment of users to each of the recommenders, and the same user would always get recommendations from the same source. User activity has been recorded in two different ways. One was Google Analytics (GA) and the other was the site s

7 Int J Multimed Info Retr (2013) 2: Fig. 2 Recommendations as shown to the user internal database (DB). In the case of GA, we have associated events to user actions of adding to playlist and adding to blacklist. In the case of DB, we have the playlist and blacklist tables in the database. To be able to identify whether each track added to the playlists had been automatically recommended, we added a source field indicating which recommender had done the job. In the end, we have observed some non-significant differences in the values obtained from GA and DB, which comforted us in the quality of the data to be analyzed. To measure the variation of the recommenders effects, we have divided the 22 weeks into 11 periods of 2 weeks. For each period, we have measured the number of sessions (S), the number of additions to playlists (P) and the number of additions to blacklists (B) for each recommender. From these three basic measures we have defined the following derived measures: Activity rate = (P + B)/S, (7) Absolute acceptance rate = P/S, (8) Relative acceptance rate = P/( P + B). (9) Google Analytics also provides information about the number and frequency of users who return to the site. For a given period, L(x) is the number of users who return x times to the site. Loyalty can then be measured in many different ways. We have tried to capture loyalty by counting users returning three times or more and using as reference the number of users who return less than three times. We call this measure loyalty3 rate. x 3 Loyalty3 rate = L(x) L(1) + L(2). (10) For each measure, and each recommender, we have collected samples with values from the 11 periods. We then compare averages and standard deviations (SDs) of the measures and perform two-tailed t tests (α = 0.05) to determine the significance of the differences. We also show graphically the evolution of the measures during the evaluation period. 5.2 Results In this section, we discuss the results obtained with our case study. During the evaluation period there were about 57,000 sessions involving recommendations, where 1,327 users made 3,267 additions to playlists and 3, additions to blacklists. We start by analyzing the relative acceptance rate. In Table 1, Mix shows a slightly lower relative acceptance rate than Content and Usage. However, the differences are not significant (this is due to the high variability of all three Table 1 Relative acceptance rate Systems Mean SD p value Mix Content Usage Differences between methods are not statistically significant (p value > 0.05)

8 10 Int J Multimed Info Retr (2013) 2:3 13 Fig. 3 Relative acceptance rate per period Fig. 4 Activity rate per period Table 2 Activity rate Systems Mean SD p value Mix Content 0.074* Usage 0.088* * Recommendation methods whose differences with Mix are statistically significant (p value < 0.05) Table 3 Absolute acceptance rate Systems Mean SD p value Mix Content 0.037* Usage 0.054* * Recommendation methods whose differences with Mix are statistically significant (p value < 0.05) recommenders with respect to the 11 periods of 2 weeks, Fig. 3, as shown in the relatively high SDs), and all three recommenders have an average relative acceptance around 0.5. This can be understood as follows: in response to a given recommendation, the user is as likely to react with an addition to playlist (i.e., a positive reaction) than an addition to blacklist (i.e., a negative reaction). This appears to be true for all three recommenders and does not, however, mean that the three recommenders have a similar performance. Indeed, given a recommendation, a user can either react by an addition to playlist or to blacklist or not react at all, which in our opinion, is another negative reaction. As can be seen in Table 2, activity rate measure, our data show that for the same number of recommendations, the Mix recommender results in more user activity than the other two. The system Mix has gains of and 87 % when compared to Content and Usage, respectively. In other words, it appears that users are more likely to react to recommendations when confronted with recommendations of Mix than those of the other two. This means that users will generate more additions to playlist, and more additions to blacklist, with Mix than with Content and Usage. This increased activity is very visible in Fig. 4. For the period 6 (from December 29, 2010 to January 11, 2011), Mix is worse than Content and Usage. In all remaining periods, Mix always outperforms the other two systems. In Table 3, absolute acceptance rate, we can see that Mix is significantly better than Content and Usage systems. When compared to Content, Mix presents a gain of 119 %. With Fig. 5 Absolute acceptance rate per period respect to the Usage, it shows a gain of 50 %. This means that users getting the Mix suggestions had a significant tendency to react more positively to recommendations. In Fig. 5, we can see that the behavior of the Mix tends to be much better than the competitors with time. This may be due to a higher variety in recommendations motivating users to listen to more tracks and interact more. We also computed the loyalty3 rate. This indicator shows the proportion of the number of users visiting the site three or more times with respect to the ones who return at most twice. In Table 4, we see that the Mix recommender is similar to Content but significantly better than Usage. There, the system Mix presents a gain of 16 % when compared to Usage. In Fig. 6, we can see that there is a higher difference in the beginning but, afterwards, the three solutions tend to have similar results.

9 Int J Multimed Info Retr (2013) 2: Table 4 Loyalty3 rate Systems Mean SD p value Mix Content Usage 1.620* * Recommendation methods whose differences with Mix are statistically significant (p value < 0.05) Fig. 8 Absolute acceptance rate versus loyalty3 rate Fig. 6 Loyalty3 rate per period 5.3 Relating activity and loyalty One interesting question is how does the level of response to recommendations affect loyalty. To try to answer that question, we have looked at the relation between each of the activity/acceptance measures with the loyalty3 measure. In Fig. 7, we can see that the rise in activity tends to increase the loyalty of the users in the case of the Mix recommender (with a Pearson correlation of 0.56) and to a lesser extent in the case of the Content recommender (0.12 correlation). The Usage recommender shows a practically zero correlation between activity rate and loyalty. We can see that Mix shows a wider dispersion of values. One tentative explanation for these observations is that recommenders bring more activity and involvement and generate more loyalty. However, the relation between activity rate and loyalty is not directly Fig. 9 Relative acceptance rate versus loyalty3 rate observable here, since we are not considering the activity of loyal users only, but comparing activity and loyalty for all users and for each two weeks period. The wider spread of loyalty3 for the Mix recommender suggests that its recommendations may be more controversial. Whereas activity seems to be positively related with loyalty, acceptance (absolute and relative) does not, in the case of the Mix recommender (Figs. 8, 9). This may suggest that returning users, despite being more active, tend to reject more Mix recommendations. The Content recommender shows positive correlation between acceptance and loyalty (0.36 and 0.30 for absolute acceptance rate and relative acceptance rate, respectively). This recommender has the lowest average activity and acceptance rates but it is the one that shows a better relation with the loyalty measure. This may indicate that it is able to generate catchy recommendations for returning users. 6 Discussion Fig. 7 Activity rate versus loyalty3 rate In general, our case study shows that Mix generates more activity and at least the same amount of positive responses of Content and Usage (or more, depending on the evaluation measure). This may be due to a higher variety in

10 12 Int J Multimed Info Retr (2013) 2:3 13 recommendations generated by Mix, motivatinguserstolisten to more tracks and interact more with them. We see in Figs. 6 and 7 that Mix has good results in terms of promoting user loyalty. All in all, this makes us argue that Mix is a better option to music recommendation than the other two recommenders. We should note, however, that Mix presents a poor performance in the mid periods. This is very visible in Figs. 3, 4 and 5. This may be due to the conditions under which this study has been conducted. During the evaluation, for operational reasons, there were no updates in the recommender models. This may have caused some saturation in the recommendations to the users, which might have lead to a general lower response rate during the mid periods by the Mix recommender. Consequently, we believe that the most important point of future work is related to an adaptation over time of the recommender models. We are currently developing a monitoring tool for continuously collecting and analyzing the activity of the recommenders of the site. This will allow the owners of the site to keep an eye on the impact of the recommenders. On the other hand, it will give us more reliable data and will enable us to look into other facets of the recommendations, such as variety and sensitivity to the order. With that information we will be able to better understand what makes users more active, as well as to design recommenders that may have different mixes, depending on the profile of the user. 7 Summary In this paper, we have proposed and evaluated a music recommender system that combines usage and content data. Evaluation was conducted online, with real users, on a commercial music web site, during 22 weeks. Our work is end-to-end and included the development of the recommenders, their deployment and maintenance and all the evaluation setup. The users were dynamically divided into three groups and we have collected data, using Google Analytics and the site s internal database, on how users responded to the recommendations shown. We have proposed some measures for comparing the performances of the solutions. We concluded that Mix is, overall, a better option to provide music recommendation than the other two systems. Mix is currently the core recommendation engine on Future work relates to adaptation over time of recommendation models and more detailed monitoring of user data. Acknowledgments This work is supported by the QREN-AdI Palco3.0/3121 PONORTE; the European Commission, FP7 (Seventh Framework Programme), ICT Networked Media and Search Systems, grant agreement No ; and the ERDF through the Programme COMPETE and by the Portuguese Government through FCT-Foundation for Science and Technology, project ref. PTDC/EAT- MMU/ /2009-(FCOM FEDER ). References 1. Aizenberg N, Koren Y, Somekh O (2012) Build your own music recommender by modeling internet radio streams. In: Proceedings of the 21st international conference on world wide web, New York, NY, USA, 2012, ACM, pp Anderson C (2006) The Long Tail: why the future of business is selling less of more. Hyperion 3. Barrington L, Oda R, Lanckriet G (2009) Smarter than genius? human evaluation of music recommender systems. In: International symposium on music information retrieval (ISMIR 2009), Japan 4. Bellegarda J (2005) Latent semantic mapping [information retrieval]. Signal Process Mag IEEE 22(5): Bogdanov D, Haro M, Fuhrmann F, Gomez E, Herrera P (2010) Content-based music recommendation based on user preference examples. In: RecSys 10 workshop on music recommendation and discovery (Womrad 2010) Spain 6. Bogdanov D, Serra J, Wack N, Herrera P, Serra X (2011) Unifying low-level and high-level music similarity measures. IEEE Trans Multimed 13(4): Burke R (2002) Hybrid recommender systems: survey and experiments. User Model User Adapted Interaction 12: Cano P, Koppenberger M, Wack N (2005) Content-based music audio recommendation. In: Proceedings of the 13th annual ACM international conference on multimedia Cano P, Koppenberger M, Wack N (2005) An industrial strength content-based music recommendation system. In: Proceedings of ACM SIGIR conference on research and development in information retrieval (SIGIR 05) Celma O (2008) Music recommendation and discovery in the Long TailL: Ph.D thesis. Universitat Pompeu Fabra, Barcelona 11. Celma O (2010) Music recommendation and discovery: the Long Tail, Long Fail, and Long Play in the digital music space. Springer, Berlin 12. Deerwester SC, Dumais ST, Landauer TK, Furnas GW, Harshman RA (1990) Indexing by latent semantic analysis. J Am Soc Inform Sci 41: Fields B, Lamere P (2010) Finding a path through the jukebox: the playlist tutorial. In: ISMIR proceedings/tutorial4lamere-field.pdf. Accessed 22 Jan Golub G Kahan W (1965) Calculating the singular values and pseudo-inverse of a matrix. J Soc Ind Appl Math Ser B Numer Anal : Koenigstein N, Dror G, Koren Y (2011) Yahoo! music recommendations: modeling music ratings with temporal dynamics and item taxonomy. In: Proceedings of the fifth ACM conference on recommender systems, New York, NY, USA, 2011, ACM, pp Kohavi R, Longbotham R, Sommerfield D, Henne RM (2009) Controlled experiments on the web: survey and practical guide. Data Mining Knowl Discov 18(1): Lamere P (2008) Social tagging and music information retrieval. J New Music Res 37(2): Lemos L, Jorge AM, Leal JP (2010) Deployment and evaluation of a usage based collaborative filtering recommendation system with blacklists. In: Third international workshop on web and text intelligence (WTI 2010), Brazil 19. Levy M, Bosteels K (2010) Music recommendation and the long tail. In: Proceedings of the workshop on music recommendation and discovery (WOMRAD), pp Papadimitriou C, Tamaki H, Raghavan P, Vempala S (1998) Latent semantic indexing: a probabilistic analysis. In: Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems, ACM, pp Ruppert P, Hart R, Evans S (2007) The 2007 digital music survey. Entertain Media Res. emr-digital-music-survey Accessed 22 Jan 2012

11 Int J Multimed Info Retr (2013) 2: Sarwar B, Karypis G, Konstan J, Riedl J (2001) Item-based collaborative filtering recommendation algorithms. In: Tenth international conference on world wide web (WWW 2001), Hong Kong, pp Schedl M, Knees P (2009) Context-based music similarity estimation. In: Proceedings of the 3rd international workshop on learning the semantics of audio signals (LSAS 2009) 24. Schnitzer D, Flexer A, Schedl M, Widmer G (2011) Using mutual proximity to improve content-based audio similarity. In: Proceedings of the 12th international society for music information retrieval conference (ISMIR 2011) 25. Seyerlehner K, Schedl M, Pohle T, Knees K (2010) Using blocklevel features for genre classification, tag classification and music similarity estimation. In: Proceedings of the 6th annual music information retrieval evaluation, exchange (MIREX-10) 26. Shardanand U, Maes P (1995) Social information filtering: algorithms for automating word of mouth. In: Proceedings of SIGCHI conference on human factors in computing systems, pp Slaney M (2011) Web-scale multimedia analysis: does content matter? IEEE Multimed 18: Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5): Yoshii K, Goto M, Komatani K, Ogata T, Okuno HG (2006) Hybrid collaborative and content-based music recommendation using probabilistic model with latent user preferences. In: Proceedings of the 7th international conference on music information retrieval, p Zaharchuk V, Ignatov DI, Konstantinov A, Nikolenko S (2012) A new recommender system for the interactive radio network fmhost. In: Proceedings of the international workshop on experimental economics and machine learning (EEML). pp 72 85

Ameliorating Music Recommendation

Ameliorating Music Recommendation Ameliorating Music Recommendation Integrating Music Content, Music Context, and User Context for Improved Music Retrieval and Recommendation MoMM 2013, Dec 3 1 Why is music recommendation important? Nowadays

More information

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM A QUER B EAMPLE MUSIC RETRIEVAL ALGORITHM H. HARB AND L. CHEN Maths-Info department, Ecole Centrale de Lyon. 36, av. Guy de Collongue, 69134, Ecully, France, EUROPE E-mail: {hadi.harb, liming.chen}@ec-lyon.fr

More information

GENDER IDENTIFICATION AND AGE ESTIMATION OF USERS BASED ON MUSIC METADATA

GENDER IDENTIFICATION AND AGE ESTIMATION OF USERS BASED ON MUSIC METADATA GENDER IDENTIFICATION AND AGE ESTIMATION OF USERS BASED ON MUSIC METADATA Ming-Ju Wu Computer Science Department National Tsing Hua University Hsinchu, Taiwan brian.wu@mirlab.org Jyh-Shing Roger Jang Computer

More information

MUSI-6201 Computational Music Analysis

MUSI-6201 Computational Music Analysis MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)

More information

Enhancing Music Maps

Enhancing Music Maps Enhancing Music Maps Jakob Frank Vienna University of Technology, Vienna, Austria http://www.ifs.tuwien.ac.at/mir frank@ifs.tuwien.ac.at Abstract. Private as well as commercial music collections keep growing

More information

USING ARTIST SIMILARITY TO PROPAGATE SEMANTIC INFORMATION

USING ARTIST SIMILARITY TO PROPAGATE SEMANTIC INFORMATION USING ARTIST SIMILARITY TO PROPAGATE SEMANTIC INFORMATION Joon Hee Kim, Brian Tomasik, Douglas Turnbull Department of Computer Science, Swarthmore College {joonhee.kim@alum, btomasi1@alum, turnbull@cs}.swarthmore.edu

More information

Music Recommendation from Song Sets

Music Recommendation from Song Sets Music Recommendation from Song Sets Beth Logan Cambridge Research Laboratory HP Laboratories Cambridge HPL-2004-148 August 30, 2004* E-mail: Beth.Logan@hp.com music analysis, information retrieval, multimedia

More information

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? NICHOLAS BORG AND GEORGE HOKKANEN Abstract. The possibility of a hit song prediction algorithm is both academically interesting and industry motivated.

More information

Music Genre Classification and Variance Comparison on Number of Genres

Music Genre Classification and Variance Comparison on Number of Genres Music Genre Classification and Variance Comparison on Number of Genres Miguel Francisco, miguelf@stanford.edu Dong Myung Kim, dmk8265@stanford.edu 1 Abstract In this project we apply machine learning techniques

More information

NEXTONE PLAYER: A MUSIC RECOMMENDATION SYSTEM BASED ON USER BEHAVIOR

NEXTONE PLAYER: A MUSIC RECOMMENDATION SYSTEM BASED ON USER BEHAVIOR 12th International Society for Music Information Retrieval Conference (ISMIR 2011) NEXTONE PLAYER: A MUSIC RECOMMENDATION SYSTEM BASED ON USER BEHAVIOR Yajie Hu Department of Computer Science University

More information

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs Abstract Large numbers of TV channels are available to TV consumers

More information

Subjective Similarity of Music: Data Collection for Individuality Analysis

Subjective Similarity of Music: Data Collection for Individuality Analysis Subjective Similarity of Music: Data Collection for Individuality Analysis Shota Kawabuchi and Chiyomi Miyajima and Norihide Kitaoka and Kazuya Takeda Nagoya University, Nagoya, Japan E-mail: shota.kawabuchi@g.sp.m.is.nagoya-u.ac.jp

More information

Toward Evaluation Techniques for Music Similarity

Toward Evaluation Techniques for Music Similarity Toward Evaluation Techniques for Music Similarity Beth Logan, Daniel P.W. Ellis 1, Adam Berenzweig 1 Cambridge Research Laboratory HP Laboratories Cambridge HPL-2003-159 July 29 th, 2003* E-mail: Beth.Logan@hp.com,

More information

Information Processing and Management

Information Processing and Management Information Processing and Management 49 (2013) 13 33 Contents lists available at SciVerse ScienceDirect Information Processing and Management journal homepage: www.elsevier.com/locate/infoproman Semantic

More information

HOW SIMILAR IS TOO SIMILAR?: EXPLORING USERS PERCEPTIONS OF SIMILARITY IN PLAYLIST EVALUATION

HOW SIMILAR IS TOO SIMILAR?: EXPLORING USERS PERCEPTIONS OF SIMILARITY IN PLAYLIST EVALUATION 12th International Society for Music Information Retrieval Conference (ISMIR 2011) HOW SIMILAR IS TOO SIMILAR?: EXPLORING USERS PERCEPTIONS OF SIMILARITY IN PLAYLIST EVALUATION Jin Ha Lee University of

More information

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset Ricardo Malheiro, Renato Panda, Paulo Gomes, Rui Paiva CISUC Centre for Informatics and Systems of the University of Coimbra {rsmal,

More information

Using Genre Classification to Make Content-based Music Recommendations

Using Genre Classification to Make Content-based Music Recommendations Using Genre Classification to Make Content-based Music Recommendations Robbie Jones (rmjones@stanford.edu) and Karen Lu (karenlu@stanford.edu) CS 221, Autumn 2016 Stanford University I. Introduction Our

More information

HIT SONG SCIENCE IS NOT YET A SCIENCE

HIT SONG SCIENCE IS NOT YET A SCIENCE HIT SONG SCIENCE IS NOT YET A SCIENCE François Pachet Sony CSL pachet@csl.sony.fr Pierre Roy Sony CSL roy@csl.sony.fr ABSTRACT We describe a large-scale experiment aiming at validating the hypothesis that

More information

Assigning and Visualizing Music Genres by Web-based Co-Occurrence Analysis

Assigning and Visualizing Music Genres by Web-based Co-Occurrence Analysis Assigning and Visualizing Music Genres by Web-based Co-Occurrence Analysis Markus Schedl 1, Tim Pohle 1, Peter Knees 1, Gerhard Widmer 1,2 1 Department of Computational Perception, Johannes Kepler University,

More information

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION ULAŞ BAĞCI AND ENGIN ERZIN arxiv:0907.3220v1 [cs.sd] 18 Jul 2009 ABSTRACT. Music genre classification is an essential tool for

More information

Automatic Music Clustering using Audio Attributes

Automatic Music Clustering using Audio Attributes Automatic Music Clustering using Audio Attributes Abhishek Sen BTech (Electronics) Veermata Jijabai Technological Institute (VJTI), Mumbai, India abhishekpsen@gmail.com Abstract Music brings people together,

More information

From Low-level to High-level: Comparative Study of Music Similarity Measures

From Low-level to High-level: Comparative Study of Music Similarity Measures From Low-level to High-level: Comparative Study of Music Similarity Measures Dmitry Bogdanov, Joan Serrà, Nicolas Wack, and Perfecto Herrera Music Technology Group Universitat Pompeu Fabra Roc Boronat,

More information

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval DAY 1 Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval Jay LeBoeuf Imagine Research jay{at}imagine-research.com Rebecca

More information

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Dalwon Jang 1, Seungjae Lee 2, Jun Seok Lee 2, Minho Jin 1, Jin S. Seo 2, Sunil Lee 1 and Chang D. Yoo 1 1 Korea Advanced

More information

Supervised Learning in Genre Classification

Supervised Learning in Genre Classification Supervised Learning in Genre Classification Introduction & Motivation Mohit Rajani and Luke Ekkizogloy {i.mohit,luke.ekkizogloy}@gmail.com Stanford University, CS229: Machine Learning, 2009 Now that music

More information

ON INTER-RATER AGREEMENT IN AUDIO MUSIC SIMILARITY

ON INTER-RATER AGREEMENT IN AUDIO MUSIC SIMILARITY ON INTER-RATER AGREEMENT IN AUDIO MUSIC SIMILARITY Arthur Flexer Austrian Research Institute for Artificial Intelligence (OFAI) Freyung 6/6, Vienna, Austria arthur.flexer@ofai.at ABSTRACT One of the central

More information

Unifying Low-level and High-level Music. Similarity Measures

Unifying Low-level and High-level Music. Similarity Measures Unifying Low-level and High-level Music 1 Similarity Measures Dmitry Bogdanov, Joan Serrà, Nicolas Wack, Perfecto Herrera, and Xavier Serra Abstract Measuring music similarity is essential for multimedia

More information

Multidimensional analysis of interdependence in a string quartet

Multidimensional analysis of interdependence in a string quartet International Symposium on Performance Science The Author 2013 ISBN tbc All rights reserved Multidimensional analysis of interdependence in a string quartet Panos Papiotis 1, Marco Marchini 1, and Esteban

More information

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function EE391 Special Report (Spring 25) Automatic Chord Recognition Using A Summary Autocorrelation Function Advisor: Professor Julius Smith Kyogu Lee Center for Computer Research in Music and Acoustics (CCRMA)

More information

Contextual music information retrieval and recommendation: State of the art and challenges

Contextual music information retrieval and recommendation: State of the art and challenges C O M P U T E R S C I E N C E R E V I E W ( ) Available online at www.sciencedirect.com journal homepage: www.elsevier.com/locate/cosrev Survey Contextual music information retrieval and recommendation:

More information

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng S. Zhu, P. Ji, W. Kuang and J. Yang Institute of Acoustics, CAS, O.21, Bei-Si-huan-Xi Road, 100190 Beijing,

More information

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC 12th International Society for Music Information Retrieval Conference (ISMIR 2011) MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC Sam Davies, Penelope Allen, Mark

More information

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007 A combination of approaches to solve Tas How Many Ratings? of the KDD CUP 2007 Jorge Sueiras C/ Arequipa +34 9 382 45 54 orge.sueiras@neo-metrics.com Daniel Vélez C/ Arequipa +34 9 382 45 54 José Luis

More information

3

3 2 3 4 6 7 Technological Research Rec Sys Music Industry 8 9 (Source: Edison Research, 2016) 10 11 12 13 e.g., music preference, experience, musical training, demographics e.g., self-regulation, emotion

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

Chord Classification of an Audio Signal using Artificial Neural Network

Chord Classification of an Audio Signal using Artificial Neural Network Chord Classification of an Audio Signal using Artificial Neural Network Ronesh Shrestha Student, Department of Electrical and Electronic Engineering, Kathmandu University, Dhulikhel, Nepal ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS Andrew N. Robertson, Mark D. Plumbley Centre for Digital Music

More information

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS Justin Salamon Music Technology Group Universitat Pompeu Fabra, Barcelona, Spain justin.salamon@upf.edu Emilia

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 AN HMM BASED INVESTIGATION OF DIFFERENCES BETWEEN MUSICAL INSTRUMENTS OF THE SAME TYPE PACS: 43.75.-z Eichner, Matthias; Wolff, Matthias;

More information

Music Information Retrieval. Juan P Bello

Music Information Retrieval. Juan P Bello Music Information Retrieval Juan P Bello What is MIR? Imagine a world where you walk up to a computer and sing the song fragment that has been plaguing you since breakfast. The computer accepts your off-key

More information

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. X, NO. X, MONTH Unifying Low-level and High-level Music Similarity Measures

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. X, NO. X, MONTH Unifying Low-level and High-level Music Similarity Measures IEEE TRANSACTIONS ON MULTIMEDIA, VOL. X, NO. X, MONTH 2010. 1 Unifying Low-level and High-level Music Similarity Measures Dmitry Bogdanov, Joan Serrà, Nicolas Wack, Perfecto Herrera, and Xavier Serra Abstract

More information

ARE TAGS BETTER THAN AUDIO FEATURES? THE EFFECT OF JOINT USE OF TAGS AND AUDIO CONTENT FEATURES FOR ARTISTIC STYLE CLUSTERING

ARE TAGS BETTER THAN AUDIO FEATURES? THE EFFECT OF JOINT USE OF TAGS AND AUDIO CONTENT FEATURES FOR ARTISTIC STYLE CLUSTERING ARE TAGS BETTER THAN AUDIO FEATURES? THE EFFECT OF JOINT USE OF TAGS AND AUDIO CONTENT FEATURES FOR ARTISTIC STYLE CLUSTERING Dingding Wang School of Computer Science Florida International University Miami,

More information

Adaptive Key Frame Selection for Efficient Video Coding

Adaptive Key Frame Selection for Efficient Video Coding Adaptive Key Frame Selection for Efficient Video Coding Jaebum Jun, Sunyoung Lee, Zanming He, Myungjung Lee, and Euee S. Jang Digital Media Lab., Hanyang University 17 Haengdang-dong, Seongdong-gu, Seoul,

More information

Detecting Musical Key with Supervised Learning

Detecting Musical Key with Supervised Learning Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different

More information

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Computational Models of Music Similarity 1 Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Abstract The perceived similarity of two pieces of music is multi-dimensional,

More information

Estimation of inter-rater reliability

Estimation of inter-rater reliability Estimation of inter-rater reliability January 2013 Note: This report is best printed in colour so that the graphs are clear. Vikas Dhawan & Tom Bramley ARD Research Division Cambridge Assessment Ofqual/13/5260

More information

MPEG has been established as an international standard

MPEG has been established as an international standard 1100 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER 1999 Fast Extraction of Spatially Reduced Image Sequences from MPEG-2 Compressed Video Junehwa Song, Member,

More information

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Kazuyoshi Yoshii, Masataka Goto and Hiroshi G. Okuno Department of Intelligence Science and Technology National

More information

Context-based Music Similarity Estimation

Context-based Music Similarity Estimation Context-based Music Similarity Estimation Markus Schedl and Peter Knees Johannes Kepler University Linz Department of Computational Perception {markus.schedl,peter.knees}@jku.at http://www.cp.jku.at Abstract.

More information

Music Information Retrieval

Music Information Retrieval CTP 431 Music and Audio Computing Music Information Retrieval Graduate School of Culture Technology (GSCT) Juhan Nam 1 Introduction ü Instrument: Piano ü Composer: Chopin ü Key: E-minor ü Melody - ELO

More information

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT CSVT -02-05-09 1 Color Quantization of Compressed Video Sequences Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 Abstract This paper presents a novel color quantization algorithm for compressed video

More information

Automatic Rhythmic Notation from Single Voice Audio Sources

Automatic Rhythmic Notation from Single Voice Audio Sources Automatic Rhythmic Notation from Single Voice Audio Sources Jack O Reilly, Shashwat Udit Introduction In this project we used machine learning technique to make estimations of rhythmic notation of a sung

More information

MPEG-7 AUDIO SPECTRUM BASIS AS A SIGNATURE OF VIOLIN SOUND

MPEG-7 AUDIO SPECTRUM BASIS AS A SIGNATURE OF VIOLIN SOUND MPEG-7 AUDIO SPECTRUM BASIS AS A SIGNATURE OF VIOLIN SOUND Aleksander Kaminiarz, Ewa Łukasik Institute of Computing Science, Poznań University of Technology. Piotrowo 2, 60-965 Poznań, Poland e-mail: Ewa.Lukasik@cs.put.poznan.pl

More information

Music Structure Analysis

Music Structure Analysis Lecture Music Processing Music Structure Analysis Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Book: Fundamentals of Music Processing Meinard Müller Fundamentals

More information

Modeling memory for melodies

Modeling memory for melodies Modeling memory for melodies Daniel Müllensiefen 1 and Christian Hennig 2 1 Musikwissenschaftliches Institut, Universität Hamburg, 20354 Hamburg, Germany 2 Department of Statistical Science, University

More information

ADDITIONAL EVIDENCE THAT COMMON LOW-LEVEL FEATURES OF INDIVIDUAL AUDIO FRAMES ARE NOT REPRESENTATIVE OF MUSIC GENRE

ADDITIONAL EVIDENCE THAT COMMON LOW-LEVEL FEATURES OF INDIVIDUAL AUDIO FRAMES ARE NOT REPRESENTATIVE OF MUSIC GENRE ADDITIONAL EVIDENCE THAT COMMON LOW-LEVEL FEATURES OF INDIVIDUAL AUDIO FRAMES ARE NOT REPRESENTATIVE OF MUSIC GENRE Gonçalo Marques 1, Miguel Lopes 2, Mohamed Sordo 3, Thibault Langlois 4, Fabien Gouyon

More information

However, in studies of expressive timing, the aim is to investigate production rather than perception of timing, that is, independently of the listene

However, in studies of expressive timing, the aim is to investigate production rather than perception of timing, that is, independently of the listene Beat Extraction from Expressive Musical Performances Simon Dixon, Werner Goebl and Emilios Cambouropoulos Austrian Research Institute for Artificial Intelligence, Schottengasse 3, A-1010 Vienna, Austria.

More information

Music Information Retrieval with Temporal Features and Timbre

Music Information Retrieval with Temporal Features and Timbre Music Information Retrieval with Temporal Features and Timbre Angelina A. Tzacheva and Keith J. Bell University of South Carolina Upstate, Department of Informatics 800 University Way, Spartanburg, SC

More information

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski Music Mood Classification - an SVM based approach Sebastian Napiorkowski Topics on Computer Music (Seminar Report) HPAC - RWTH - SS2015 Contents 1. Motivation 2. Quantification and Definition of Mood 3.

More information

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections 1/23 Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections Rudolf Mayer, Andreas Rauber Vienna University of Technology {mayer,rauber}@ifs.tuwien.ac.at Robert Neumayer

More information

Interactive Visualization for Music Rediscovery and Serendipity

Interactive Visualization for Music Rediscovery and Serendipity Interactive Visualization for Music Rediscovery and Serendipity Ricardo Dias Joana Pinto INESC-ID, Instituto Superior Te cnico, Universidade de Lisboa Portugal {ricardo.dias, joanadiaspinto}@tecnico.ulisboa.pt

More information

Ameliorating Music Recommendation

Ameliorating Music Recommendation Ameliorating Music Recommendation Integrating Music Content, Music Context, and User Context for Improved Music Retrieval and Recommendation Markus Schedl Department of Computational Perception Johannes

More information

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC Scientific Journal of Impact Factor (SJIF): 5.71 International Journal of Advance Engineering and Research Development Volume 5, Issue 04, April -2018 e-issn (O): 2348-4470 p-issn (P): 2348-6406 MUSICAL

More information

Modelling Prioritisation Decision-making in Software Evolution

Modelling Prioritisation Decision-making in Software Evolution Modelling Prioritisation Decision-making in Software Evolution Denisse Muñante 1, Fitsum Meshesha Kifetew 1, and Oliver Albrecht 2 1 Fondazione Bruno Kessler, Italy munante kifetew@fbk.eu 2 SEnerCon GmbH,

More information

Measuring Playlist Diversity for Recommendation Systems

Measuring Playlist Diversity for Recommendation Systems Measuring Playlist Diversity for Recommendation Systems Malcolm Slaney Yahoo! Research Labs 701 North First Street Sunnyvale, CA 94089 malcolm@ieee.org Abstract We describe a way to measure the diversity

More information

MODELS of music begin with a representation of the

MODELS of music begin with a representation of the 602 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010 Modeling Music as a Dynamic Texture Luke Barrington, Student Member, IEEE, Antoni B. Chan, Member, IEEE, and

More information

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam CTP431- Music and Audio Computing Music Information Retrieval Graduate School of Culture Technology KAIST Juhan Nam 1 Introduction ü Instrument: Piano ü Genre: Classical ü Composer: Chopin ü Key: E-minor

More information

Content-based music retrieval

Content-based music retrieval Music retrieval 1 Music retrieval 2 Content-based music retrieval Music information retrieval (MIR) is currently an active research area See proceedings of ISMIR conference and annual MIREX evaluations

More information

The song remains the same: identifying versions of the same piece using tonal descriptors

The song remains the same: identifying versions of the same piece using tonal descriptors The song remains the same: identifying versions of the same piece using tonal descriptors Emilia Gómez Music Technology Group, Universitat Pompeu Fabra Ocata, 83, Barcelona emilia.gomez@iua.upf.edu Abstract

More information

Improving music composition through peer feedback: experiment and preliminary results

Improving music composition through peer feedback: experiment and preliminary results Improving music composition through peer feedback: experiment and preliminary results Daniel Martín and Benjamin Frantz and François Pachet Sony CSL Paris {daniel.martin,pachet}@csl.sony.fr Abstract To

More information

in the Howard County Public School System and Rocketship Education

in the Howard County Public School System and Rocketship Education Technical Appendix May 2016 DREAMBOX LEARNING ACHIEVEMENT GROWTH in the Howard County Public School System and Rocketship Education Abstract In this technical appendix, we present analyses of the relationship

More information

Audio Structure Analysis

Audio Structure Analysis Lecture Music Processing Audio Structure Analysis Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Music Structure Analysis Music segmentation pitch content

More information

A repetition-based framework for lyric alignment in popular songs

A repetition-based framework for lyric alignment in popular songs A repetition-based framework for lyric alignment in popular songs ABSTRACT LUONG Minh Thang and KAN Min Yen Department of Computer Science, School of Computing, National University of Singapore We examine

More information

Life Soundtrack Recovery for Alzheimer s disease patients

Life Soundtrack Recovery for Alzheimer s disease patients Life Soundtrack Recovery for Alzheimer s disease patients Felipe Luis Navarro Valero Master Thesis UPF / 2013 Master in Sound and Music Computing Master thesis supervisors: Emilia Gómez Perfecto Herrera

More information

Effects of acoustic degradations on cover song recognition

Effects of acoustic degradations on cover song recognition Signal Processing in Acoustics: Paper 68 Effects of acoustic degradations on cover song recognition Julien Osmalskyj (a), Jean-Jacques Embrechts (b) (a) University of Liège, Belgium, josmalsky@ulg.ac.be

More information

GOOD-SOUNDS.ORG: A FRAMEWORK TO EXPLORE GOODNESS IN INSTRUMENTAL SOUNDS

GOOD-SOUNDS.ORG: A FRAMEWORK TO EXPLORE GOODNESS IN INSTRUMENTAL SOUNDS GOOD-SOUNDS.ORG: A FRAMEWORK TO EXPLORE GOODNESS IN INSTRUMENTAL SOUNDS Giuseppe Bandiera 1 Oriol Romani Picas 1 Hiroshi Tokuda 2 Wataru Hariya 2 Koji Oishi 2 Xavier Serra 1 1 Music Technology Group, Universitat

More information

Release Year Prediction for Songs

Release Year Prediction for Songs Release Year Prediction for Songs [CSE 258 Assignment 2] Ruyu Tan University of California San Diego PID: A53099216 rut003@ucsd.edu Jiaying Liu University of California San Diego PID: A53107720 jil672@ucsd.edu

More information

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Mohamed Hassan, Taha Landolsi, Husameldin Mukhtar, and Tamer Shanableh College of Engineering American

More information

MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES

MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES PACS: 43.60.Lq Hacihabiboglu, Huseyin 1,2 ; Canagarajah C. Nishan 2 1 Sonic Arts Research Centre (SARC) School of Computer Science Queen s University

More information

Temporal Dynamics in Music Listening Behavior: A Case Study of Online Music Service

Temporal Dynamics in Music Listening Behavior: A Case Study of Online Music Service 9th IEEE/ACIS International Conference on Computer and Information Science Temporal Dynamics in Music Listening Behavior: A Case Study of Online Music Service Chan Ho Park Division of Technology and Development

More information

Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation

Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation for Polyphonic Electro-Acoustic Music Annotation Sebastien Gulluni 2, Slim Essid 2, Olivier Buisson, and Gaël Richard 2 Institut National de l Audiovisuel, 4 avenue de l Europe 94366 Bry-sur-marne Cedex,

More information

Identifying Related Documents For Research Paper Recommender By CPA and COA

Identifying Related Documents For Research Paper Recommender By CPA and COA Preprint of: Bela Gipp and Jöran Beel. Identifying Related uments For Research Paper Recommender By CPA And COA. In S. I. Ao, C. Douglas, W. S. Grundfest, and J. Burgstone, editors, International Conference

More information

Perfecto Herrera Boyer

Perfecto Herrera Boyer MIRages: an account of music audio extractors, semantic description and context-awareness, in the three ages of MIR Perfecto Herrera Boyer Music, DTIC, UPF PhD Thesis defence Directors: Xavier Serra &

More information

Lyrics Classification using Naive Bayes

Lyrics Classification using Naive Bayes Lyrics Classification using Naive Bayes Dalibor Bužić *, Jasminka Dobša ** * College for Information Technologies, Klaićeva 7, Zagreb, Croatia ** Faculty of Organization and Informatics, Pavlinska 2, Varaždin,

More information

STRUCTURAL ANALYSIS AND SEGMENTATION OF MUSIC SIGNALS

STRUCTURAL ANALYSIS AND SEGMENTATION OF MUSIC SIGNALS STRUCTURAL ANALYSIS AND SEGMENTATION OF MUSIC SIGNALS A DISSERTATION SUBMITTED TO THE DEPARTMENT OF TECHNOLOGY OF THE UNIVERSITAT POMPEU FABRA FOR THE PROGRAM IN COMPUTER SCIENCE AND DIGITAL COMMUNICATION

More information

EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach

EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach Song Hui Chon Stanford University Everyone has different musical taste,

More information

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION Halfdan Rump, Shigeki Miyabe, Emiru Tsunoo, Nobukata Ono, Shigeki Sagama The University of Tokyo, Graduate

More information

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions 1128 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 10, OCTOBER 2001 An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions Kwok-Wai Wong, Kin-Man Lam,

More information

arxiv: v1 [cs.ir] 16 Jan 2019

arxiv: v1 [cs.ir] 16 Jan 2019 It s Only Words And Words Are All I Have Manash Pratim Barman 1, Kavish Dahekar 2, Abhinav Anshuman 3, and Amit Awekar 4 1 Indian Institute of Information Technology, Guwahati 2 SAP Labs, Bengaluru 3 Dell

More information

Creating a Feature Vector to Identify Similarity between MIDI Files

Creating a Feature Vector to Identify Similarity between MIDI Files Creating a Feature Vector to Identify Similarity between MIDI Files Joseph Stroud 2017 Honors Thesis Advised by Sergio Alvarez Computer Science Department, Boston College 1 Abstract Today there are many

More information

HUMAN PERCEPTION AND COMPUTER EXTRACTION OF MUSICAL BEAT STRENGTH

HUMAN PERCEPTION AND COMPUTER EXTRACTION OF MUSICAL BEAT STRENGTH Proc. of the th Int. Conference on Digital Audio Effects (DAFx-), Hamburg, Germany, September -8, HUMAN PERCEPTION AND COMPUTER EXTRACTION OF MUSICAL BEAT STRENGTH George Tzanetakis, Georg Essl Computer

More information

EVALUATING THE EVALUATION MEASURES FOR BEAT TRACKING

EVALUATING THE EVALUATION MEASURES FOR BEAT TRACKING EVALUATING THE EVALUATION MEASURES FOR BEAT TRACKING Mathew E. P. Davies Sound and Music Computing Group INESC TEC, Porto, Portugal mdavies@inesctec.pt Sebastian Böck Department of Computational Perception

More information

Mood Tracking of Radio Station Broadcasts

Mood Tracking of Radio Station Broadcasts Mood Tracking of Radio Station Broadcasts Jacek Grekow Faculty of Computer Science, Bialystok University of Technology, Wiejska 45A, Bialystok 15-351, Poland j.grekow@pb.edu.pl Abstract. This paper presents

More information

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling International Conference on Electronic Design and Signal Processing (ICEDSP) 0 Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling Aditya Acharya Dept. of

More information

Music Genre Classification

Music Genre Classification Music Genre Classification chunya25 Fall 2017 1 Introduction A genre is defined as a category of artistic composition, characterized by similarities in form, style, or subject matter. [1] Some researchers

More information

A Discriminative Approach to Topic-based Citation Recommendation

A Discriminative Approach to Topic-based Citation Recommendation A Discriminative Approach to Topic-based Citation Recommendation Jie Tang and Jing Zhang Department of Computer Science and Technology, Tsinghua University, Beijing, 100084. China jietang@tsinghua.edu.cn,zhangjing@keg.cs.tsinghua.edu.cn

More information

SWITCHED INFINITY: SUPPORTING AN INFINITE HD LINEUP WITH SDV

SWITCHED INFINITY: SUPPORTING AN INFINITE HD LINEUP WITH SDV SWITCHED INFINITY: SUPPORTING AN INFINITE HD LINEUP WITH SDV First Presented at the SCTE Cable-Tec Expo 2010 John Civiletto, Executive Director of Platform Architecture. Cox Communications Ludovic Milin,

More information

A TEXT RETRIEVAL APPROACH TO CONTENT-BASED AUDIO RETRIEVAL

A TEXT RETRIEVAL APPROACH TO CONTENT-BASED AUDIO RETRIEVAL A TEXT RETRIEVAL APPROACH TO CONTENT-BASED AUDIO RETRIEVAL Matthew Riley University of Texas at Austin mriley@gmail.com Eric Heinen University of Texas at Austin eheinen@mail.utexas.edu Joydeep Ghosh University

More information

Influence of Discovery Search Tools on Science and Engineering e-books Usage

Influence of Discovery Search Tools on Science and Engineering e-books Usage Paper ID #5841 Influence of Discovery Search Tools on Science and Engineering e-books Usage Mr. Eugene Barsky, University of British Columbia Eugene Barsky is a Science and Engineering Librarian at the

More information

EVALUATING THE GENRE CLASSIFICATION PERFORMANCE OF LYRICAL FEATURES RELATIVE TO AUDIO, SYMBOLIC AND CULTURAL FEATURES

EVALUATING THE GENRE CLASSIFICATION PERFORMANCE OF LYRICAL FEATURES RELATIVE TO AUDIO, SYMBOLIC AND CULTURAL FEATURES EVALUATING THE GENRE CLASSIFICATION PERFORMANCE OF LYRICAL FEATURES RELATIVE TO AUDIO, SYMBOLIC AND CULTURAL FEATURES Cory McKay, John Ashley Burgoyne, Jason Hockman, Jordan B. L. Smith, Gabriel Vigliensoni

More information