Sunday, January 16, 2011

EC2024 SPEECH PROCESSING

EC2024 SPEECH PROCESSING L T P C
3 0 0 3
UNIT I MECHANICS OF SPEECH 9
Speech production: Mechanism of speech production, Acoustic phonetics - Digital
models for speech signals - Representations of speech waveform: Sampling speech
signals, basics of quantization, delta modulation, and Differential PCM - Auditory
perception: psycho acoustics.
UNIT II TIME DOMAIN METHODS FOR SPEECH PROCESSING 9
Time domain parameters of Speech signal – Methods for extracting the parameters
Energy, Average Magnitude, Zero crossing Rate – Silence Discrimination using ZCR
and energy – Short Time Auto Correlation Function – Pitch period estimation using Auto
Correlation Function.
UNIT III FREQUENCY DOMAIN METHOD FOR SPEECH PROCESSING 9
Short Time Fourier analysis: Fourier transform and linear filtering interpretations,
Sampling rates - Spectrographic displays - Pitch and formant extraction - Analysis by
Synthesis - Analysis synthesis systems: Phase vocoder, Channel Vocoder -
Homomorphic speech analysis: Cepstral analysis of Speech, Formant and Pitch
Estimation, Homomorphic Vocoders.
UNIT IV LINEAR PREDICTIVE ANALYSIS OF SPEECH 9
Basic Principles of linear predictive analysis – Auto correlation method – Covariance
method – Solution of LPC equations – Cholesky method – Durbin’s Recursive algorithm,
– Application of LPC parameters – Pitch detection using LPC parameters – Formant
analysis – VELP – CELP.
UNIT V APPLICATION OF SPEECH & AUDIO SIGNAL PROCESSING 9
Algorithms: Dynamic time warping, K-means clusering and Vector quantization,
Gaussian mixture modeling, hidden Markov modeling - Automatic Speech Recognition:
Feature Extraction for ASR, Deterministic sequence recognition, Statistical Sequence
recognition, Language models - Speaker identification and verification – Voice response
system – Speech synthesis: basics of articulatory, source-filter, and concatenative
synthesis – VOIP
TOTAL= 45 PERIODS
TEXT BOOK:
1. Thomas F, Quatieri, Discrete-Time Speech Signal Processing, Prentice Hall /
Pearson Education, 2004.
REFERENCES:
1. Ben Gold and Nelson Morgan, Speech and Audio Signal Processing, John Wiley and
Sons Inc., Singapore, 2004
2. L.R.Rabiner and R.W.Schaffer – Digital Processing of Speech signals – Prentice Hall
-1979
3. L.R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition, Prentice Hall,
1993.
4. J.R. Deller, J.H.L. Hansen and J.G. Proakis, Discrete Time Processing of Speech
Signals, John Wiley, IEEE Press, 1999.

0 comments until now.

Post a Comment