WELCOME TO BSDU - KNOWLEDGE RESOURCE CENTER


BHARTIYA SKILL DEVELOPMENT UNIVERSITY, JAIPUR
KNOWLEDGE RESOURCE CENTER (LIBRARY)
Online Public Access catalogue(OPAC)

“Library is a heart of an institution" ― Dr S. Radhakrishnan

“Never Stop Reading"

Speech and Audio Processing (Record no. 533)

000 -LEADER
fixed length control field 04866nam a2200229Ia 4500
001 - CONTROL NUMBER
control field 0001998
003 - CONTROL NUMBER IDENTIFIER
control field OSt
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20190315163719.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 170602s9999 xx 000 0 und d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9788126534081
028 ## - PUBLISHER NUMBER
Qualifying information 2016
Source Allied Informatics, Jaipur
040 ## - CATALOGING SOURCE
Language of cataloging English
Original cataloging agency BSDU
Transcribing agency BSDU
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 621.382
Item number APT
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name Apte, Shaila D
245 #0 - TITLE STATEMENT
Title Speech and Audio Processing
260 ## - PUBLICATION, DISTRIBUTION, ETC.
Name of publisher, distributor, etc. Wiley India Pvt. Ltd. India
Place of publication, distribution, etc. New Delhi
Date of publication, distribution, etc. 2012,c2012
300 ## - PHYSICAL DESCRIPTION
Extent 438
500 ## - GENERAL NOTE
General note Speech and Audio Processing is a text targeted towards the final year undergraduate Speech Processing course and PG students in ECE, CS, and IT streams. This book aims at explaining the basic concepts in a clear-cut and simplified manner. It begins with the human speech production mechanism and then goes on to the fundamental parameters of speech such as pitch frequency, formants, spectral features like log spectrum, 3-D spectrogram, cepstral features, MFCC, linear prediction coefficients, transform-domain parameters, template matching techniques, etc. It deals with applications like speech coding, speech recognition, speaker recognition, and speech synthesis.
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc. note Contents
Fundamentals of Speech

· The Human Speech Production Mechanism

· LTI Model for Speech Production

· Nature of the Speech Signal

· Linear Time-Varying Model

· Phonetics

· Types of Speech

· Voiced and Unvoiced Decision Making

· Audio File Formats: Nature of the WAV File



Parameters of Speech: Pitch and Formants

· Fundamental Frequency or Pitch Frequency

· Parallel Processing Approach for Calculation of Pitch Frequency

· Pitch Period Measurement Using Spectral Domain

· Cepstral Domain

· Formants and Their Relation With LPC

· Evaluation of Formants Using Cepstrum

· Evaluation of Formants Using Log Spectrum

· Evaluation of Formants Using Power Spectral Density Estimate

· Estimation of Formants: Other Methods



Spectral Parameters of Speech

· Homomorphic Processing

· Cepstral Analysis of Speech: Cepstral Coefficients

· The Auditory System as a Filter Bank

· Mel Frequency Cepstral Coefficients (MFCCs)

· Perceptual Linear Prediction (PLP)

· Log Frequency Power Coefficients (LFPCs)

· Relative Spectral Perceptual Linear Prediction (Rasta-PLP): Strategies for Robustness

· Short-Time Spectral Analysis of Speech: Short-Time Fourier Transform (STFT)

· Wavelet Transform Analysis of Speech



Linear Prediction of Speech

· Lattice Structure Realization

· Forward Linear Prediction

· Autocorrelation Method

· Covariance Method

· Lattice Methods

· Selection of Order of the Predictor

· Line Spectral Frequencies/Line Spectral Pair Frequencies



Speech Quantization and Coding

· Uniform and Non-Uniform Quantizers and Coder

· Companded Quantizers

· Uniform Quantization of Non-uniform Sources: Adaptive Quantizers

· Waveform Coding of Speech

· Comparison of Different Waveform Coding Techniques

· Parametric Speech Coding Techniques

· Sinusoidal Speech Coding Techniques

· Mixed Excitation Linear Prediction Coder

· Multi-Mode Speech Coding (Hybrid Coder)

· Transform Domain Coding of Speech



Speech Processing Applications

· Speech Recognition Systems

· Architecture of a Large Vocabulary Continuous Speech Recognition System

· Deterministic Sequence Recognition for ASR

· Statistical Sequence Recognition for ASR

· Statistical Pattern Recognition and Parameter Estimation

· VQ-HMM-Based Speech Recognition

· Discriminant Acoustic Probability Estimation

· Word Spotting/Keyword Spotting

· Speech Recognition and Understanding

· Speaker Recognition

· Distortion Measures: Mathematical and Perceptual

· Speech Enhancement

· Adaptive Echo Cancellation



Speech Synthesis

· A Text-to-Speech System

· Synthesizer Technologies

· Speech Synthesis Using Other Methods

· Speech Transformations

· Emotion Recognition from Speech

· Watermarking for Authentication of a Speech/Music Signal



Basics of Musical Instruments and Music Synthesis

· Indian Musical Instruments

· Features Used for Classification

· Music Synthesis

· Musical Instrument Digital Interface (MIDI)

· Streaming Audio

· Piano Note Synthesis Using LPC and WT

· Audio Standards



Summary

Key Terms

Multiple Choice Questions

Review Questions

Problems (Write MATLAB Programs)

Suggested Projects (Write MATLAB Programs)

Answers



Frequently Asked Short Questions with Answers

Frequently Asked Long Questions with Pointers

Bibliography

Index
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Electronics
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Koha item type Books
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Not for loan Permanent Location Current Location Date acquired Cost, normal purchase price Full call number Barcode Date last seen Uniform Resource Identifier Cost, replacement price Price effective from Koha item type Collection code Shelving location
          BSDU Knowledge Resource Center, Jaipur BSDU Knowledge Resource Center, Jaipur 2016-12-09 699.00 621.382 APT 001998 2020-02-12 With CD 699.00 2017-06-02 Books    
          BSDU Knowledge Resource Center, Jaipur BSDU Knowledge Resource Center, Jaipur 2016-12-09 699.00 621.382 APT 001999 2020-02-12 With CD 699.00 2017-06-02 Books    
        Not For Loan BSDU Knowledge Resource Center, Jaipur BSDU Knowledge Resource Center, Jaipur 2016-12-09 699.00 621.382 APT 002000 2020-02-12 With CD 699.00 2017-06-02 Books Not for Loan  
        Not For Loan BSDU Knowledge Resource Center, Jaipur BSDU Knowledge Resource Center, Jaipur 2018-02-28   621.382 APT CD89 2018-02-28     2018-02-28 CDs & DVDs   Audio Visual
        Not For Loan BSDU Knowledge Resource Center, Jaipur BSDU Knowledge Resource Center, Jaipur 2018-02-28   621.382 APT CD90 2018-02-28     2018-02-28 CDs & DVDs   Audio Visual
        Not For Loan BSDU Knowledge Resource Center, Jaipur BSDU Knowledge Resource Center, Jaipur 2018-02-28   621.382 APT CD91 2018-02-28     2018-02-28 CDs & DVDs   Audio Visual

2019. All rights reserved.
Implemented & Maintained by Total IT Software Solutions Pvt. Ltd.