Shafik, R. A. and Yousaf-Zai, F. Q. (2004) A Novel Approach to Noisy Speech recognition using DTW algorithm with Mel-Frequency Cepstral Coefficients. Journal of Engineering and Technology (JET-IUT), 5 (2). pp. 21-29. Full text not available from this repository.
Abstract
A new and effective approach to recognition of noisy speech is introduced. End-Point-Detection algorithm is used to measure the noise power and to automatically initiate recording of a spoken word. Unvoiced components of the recorded speech, buried under noise, viz. ambient noise or hiss noise or telephone noise, were then optimally minimized by Finite Impulse Response (FIR) band pass Filter. The speech signal was then sampled and speech features were extracted using low-level and customized Mel-Frequency Cepstral Coefficients (MFCC), which were later dynamically time-warped to find the average minimal distance from Euclidean distance matrices to help facilitate the recognition of speech. For generalization, speech data from three speakers, of three different level of pitch, were collected and were compared to a mid-pitch speaker to establish both speaker independent and speaker dependent efficacy and accuracy. Such a speech recognition system can be both fast and effective even in quite noisy environments.
| Item Type: | Article | ||||
|---|---|---|---|---|---|
| Creator/Authors: |
| ||||
| Research Group: | Old ECS Groups > Information - Signals, Images, Systems Current ECS Groups > Electronic and Software Systems | ||||
| Date: | December 2004 | ||||
| Information about this record: | |||||
| Performance Indicator: | EZ~02~01~11 | ||||
| Citations: | |||||
| ID Code: | 13218 | ||||
| Last Modified: | 23 Sep 2011 10:34 | ||||
| Deposited On: | 30 Nov 2006 by Shafik, Rishad | ||||
Tools & Metadata
Corrections
ECS staff and postgraduates may modify this record






