IIIT Hyderabad Publications |
|||||||||
|
Computational Study of Ornaments in Hindustani MusicAuthor: Achyuth Narayan Samudrala Date: 2018-04-13 Report no: IIIT/TH/2018/7 Advisor:Navjyoti Singh AbstractIndian classical music is monophonic in nature. Melody is key to any performance. The entire performance is based on a single raga (melodic mode). Some performances are based on a coalescence of two or more ragas. Only a small duration in any musical performance in Indian art music is the precomposed portion. The rest involves a gradual systematic exploration of the raga using various forms ¯ of improvisation. The performer has great freedom in exploring his own improvisational patterns in order to delve into the intricacies of the given raga. However the performer is bound by the frame- ¯ work of the raga, t ¯ ala and other factors such as ghar ¯ an¯ a (the musical school which he belongs to), laya ¯ (tempo). There is a great variation in quality and nature of improvisation among different practitioners based on their cognitive and imaginative abilities. There are two important tools used in improvisation: alankar/alank ¯ aran (musical ornamentation) and t ¯ anas. In this thesis we restrict our study to the present ¯ day usage of alankaran ¯ 1 . Ornamentation is an indispensable part of improvisation. The performer tries to evoke the mood of the raga (melodic-mode) with the delicate exploration of flourishes and ornamentation. Indian classical ¯ music is cognitively associated with these decorations instead of the actual underlying notes. We are interested in discovering the hidden musical patterns and structures in Indian music which are fundamental in unfolding of a raga. Recognizing these patterns can further help in music search, retrieval, recommendation and pedagogy. This thesis makes an attempt to study these ornaments computationally. We begin the study with the objective to computationally detect these ornaments in Dhrupad performances on rudra veen ¯ . a. Dhrupad is the oldest surviving form of Hindustani classical music, is monophonic ¯ in nature and has a wealth of ornamentation. We have crowd-sourced multimedia annotations of primary alankara-s, namely, meen ¯ . da (glides between notes), andolana (vacillations in the note region) and ¯ gamaka (two notes alternation). These were used as the ground truth. Two signals in the time domain, a query signal, which consists of fundamental ornament templates, and a reference signal were compared. Using a variant of dynamic time warping, we then detect the matching subsequences in the reference signal with high accuracy. This method of iterative template matching is naive and has drawbacks, some of which are: 1.) Restrictiveness of the experiment to a set of non-exhaustive list of ornament templates, 2.) Computational overheads of comparing huge pitch vectors using dynamic time warping. While DTW matching was an effective method of detecting these ornaments, we didnt use any domain specific knowledge to detect/identify these ornaments computationally. Cognitively ragas are ¯ 1Most of these definitions are either explained in the introduction section or in the glossary at the end vi vii associated with ornamentation.These along with the ways in which notes are approached,sustained and quitted constitute flows. We call them flows due to the constant change in the frequency and amplitude domains. The rate of change in frequency and amplitude domains together constitute the fundamental nature of ornamentation. We thus model ornamentation as first and second derivative of frequency and amplitude information to see whether the property of rate of change is fundamental to the idea of flows. We were able to achieve very high clustering accuracy by using this representation and our custom defined distance measure. We achieved an accuracy of 81% in distinguishing between Murki vs Kan and 84.5% in Andolana vs Ghaseet. This again shows that transition is at the heart of the definition ¯ of flows and ornaments. However this doesnt account for hidden musical patterns which might not be captured by the manually engineered representation and custom distance. It is important to note that ornaments and flows are intricate Indian music decorations which only experienced musicians can recognize and comprehend. In all of our previous experiments we have used small annotated datasets which do not exhaustively capture patterns underlying ornaments. The biggest challenge in learning and making sense of flows or ornaments is the unavailability of annotated music data. Thus, to have a big data driven exhaustive exploration of these melodic structures implies having either 1.) a multi performer crowdsourced effort to create a large exhaustive dataset, or 2.) an unsupervised/semi-supervised method of learning. To avoid the shortcomings of non-exhaustive small datasets and hand engineered representations, we also develop a robust unsupervised method to learn fixed length representations of ornaments. In the domain of speech processing, neural networks have recently successfully been used for encoding acoustic information into fixed length vectors. Existing research has shown that it is possible to transform audio segments into fixed dimensional vectors. In this thesis we extend the same model for learning fixed length features using sequence to sequence autoencoder to represent audio segments of ornaments and flows in musical data without human annotation (Curated dataset of 50 hours). We show that these learned audio representations capture essential sequential structural information and are useful in distinguishing between few different types of ornaments (recorded in isolation) in the vector space. We started this study to have a big data exploration of ornaments in Indian music but we discovered that ornaments are not present in an isolated form in most Indian music performances. Thus we conclude that its important to study ornaments as a combination of multiple melodic structures. We have also discovered that some ornaments have similar musical properties as some others which is why their learned representations are closer in the vector space. Further, we have also noticed that alankar based ¯ on tala and rhythm can very well be distinguished from the rest of the ornaments in the vector space. We ¯ finally discuss about how this model can be used for real time ornament transcription systems, ornament retrieval and tagging. This further has great applications in Indian music pedagogy, search, retrieval and recommendation Full thesis: pdf Centre for Exact Humanities |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |