IIIT Hyderabad Publications |
|||||||||
|
Speech enhancement for multi microphone using kepstrum approachAuthor: Puram Bala Manikya Prasad Date: 2020-06-24 Report no: IIIT/TH/2020/59 Advisor:Suryakanth V Gangashetty AbstractKeywords: Adaptive noise cancellation, Kepstrum approach, Beamforming, Speech enhancement. Mobile telephony is one of the major ways of communication now a days around the world. Conversation and communication is possible from any place in the world at any point of time. Even though permanent reachability and connectivity is achieved, there is still scope of development and improvement in the way of communication mostly under noisy environment. The communication performance through mobiles significantly get affected when the surrounding environment contains noise interference like traffic or office or kitchen noises which can lead to poor speech quality both in subjective and objective measures. In this thesis, a novel method for enhancing the speech quality during noisy conditions is proposed. In contrast to classical noise cancellation and suppression methods, the proposed method uses the temporal and spectral dependencies of noise and speech signals. Therefore, the thesis titled, ”Speech Enhancement for Multi Microphone using Kepstrum Approach” presents a Kepstrum method for speech enhancement. The proposed method performs based on Kepstrum analysis, that provides a mathematical representation to speech enhancement applications. It can be applied to system identification applications where acoustic transfer function is unknown between two microphones. It is independent of acoustic path model order and provides mathematical representation with FFT based processing. The front end application of this method for speech enhancements provides an improved performance and noise cancellation with many favorable effects. The developed and proposed enhancement techniques in this thesis are evaluated both subjectively and objectively by means of speech intelligibility and auditory metrics. It is shown that proposed method achieves better results compared to the classical state of-the-art approaches with respect to both noise attenuation and speech distortions. Along with using the proposed method in mobile phones, it can also be used for any Internet of things (IoT), far-field communication, conferencing calls, hearing aids etc Full thesis: pdf Centre for Language Technologies Research Centre |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |