Multimodal Sentiment Analysis of Telugu Songs

Authors: Harika Abburi,Eashwar Sai Akhil,Suryakanth V Gangashetty,Radhika Mamidi
Location Hilton, New York City, USA.
Date: 2016-07-10
Report no: IIIT/TR/2016/11

Abstract

In this paper, an approach to detect the sentiment of a song based on its multi-modality natures (text and audio) is presented. The textual lyric features are extracted from the bag of words. By using these features, Doc2Vec will generate a single vector for each song. Support Vector Machine (SVM), Naive Bayes (NB) and a combination of both these clas- sifiers are developed to classify the sentiment us- ing the textual lyric features. Audio features are used as an add-on to the lyrical ones which include prosody features, temporal features, spectral fea- tures, tempo and chroma features. Gaussian Mix- ture Models (GMM), SVM and a combination of both these classifiers are developed to classify the sentiment using audio features. GMM are known for capturing the distribution in the features and SVM are known for discriminating the features. Hence these models are combined to improve the performance of sentiment analysis. Performance is further improved by combining the text and audio feature domains. These text and audio features are extracted at the beginning, ending and for the whole song. From our experimental results, it is observed that the first 30 seconds(s) of a song gives better performance for detecting the sentiment of the song rather than the last 30s or from the whole song.

Full paper: pdf

Centre for Language Technologies Research Centre

IIIT Hyderabad Publications

Multimodal Sentiment Analysis of Telugu Songs

Abstract