|
|
|
|
3 | (18) |
|
|
3 | (3) |
|
1.2 Phoneme Boundary Segmentation: Present Technique |
|
|
6 | (5) |
|
1.3 ANN as a Speech Processing and Recognition Tool |
|
|
11 | (4) |
|
1.3.1 Speech Recognition Using RNN |
|
|
12 | (2) |
|
1.3.2 Speech Recognition Using SOM |
|
|
14 | (1) |
|
|
15 | (1) |
|
|
16 | (1) |
|
|
16 | (5) |
|
|
17 | (4) |
|
2 Speech Processing Technology: Basic Consideration |
|
|
21 | (26) |
|
2.1 Fundamentals of Speech Recognition |
|
|
21 | (3) |
|
2.2 Speech Communication Chain |
|
|
24 | (4) |
|
2.3 Mechanism of Speech Perception |
|
|
28 | (5) |
|
2.3.1 Physical Mechanism of Perception |
|
|
28 | (2) |
|
2.3.2 Perception of Sound |
|
|
30 | (2) |
|
2.3.3 Basic Unit of Speech Perception |
|
|
32 | (1) |
|
2.4 Theories and Models of Spoken Word Recognition |
|
|
33 | (11) |
|
2.4.1 Motor Theory of Speech Perception |
|
|
35 | (2) |
|
2.4.2 Analysis-by-Synthesis Model |
|
|
37 | (1) |
|
2.4.3 Direct Realist Theory of Speech Perception |
|
|
37 | (1) |
|
|
38 | (2) |
|
|
40 | (2) |
|
|
42 | (1) |
|
2.4.7 Neighborhood Activation Model |
|
|
43 | (1) |
|
|
44 | (3) |
|
|
44 | (3) |
|
3 Fundamental Considerations of ANN |
|
|
47 | (30) |
|
|
47 | (3) |
|
|
50 | (2) |
|
3.3 Prediction and Classification Using ANN |
|
|
52 | (1) |
|
3.4 Multi Layer Perceptron |
|
|
53 | (2) |
|
3.5 Recurrent Neural Network |
|
|
55 | (8) |
|
3.6 Probabilistic Neural Network |
|
|
63 | (3) |
|
3.6.1 Architecture of a PNN Network |
|
|
64 | (2) |
|
|
66 | (5) |
|
3.7.1 Competitive Learning and Self-Organizing Map (SOM) |
|
|
69 | (2) |
|
3.8 Learning Vector Quantization |
|
|
71 | (3) |
|
|
72 | (1) |
|
|
72 | (2) |
|
|
74 | (3) |
|
|
74 | (3) |
|
4 Sounds of Assamese Language |
|
|
77 | (18) |
|
|
77 | (1) |
|
4.2 Formation of Assamese Language |
|
|
78 | (1) |
|
4.3 Phonemes of Assamese Language |
|
|
79 | (12) |
|
|
80 | (4) |
|
|
84 | (1) |
|
|
84 | (2) |
|
|
86 | (2) |
|
|
88 | (1) |
|
|
89 | (1) |
|
|
90 | (1) |
|
4.4 Some Specific Phonemical Features of Assamese Language |
|
|
91 | (1) |
|
4.5 Dialects of Assamese Language |
|
|
92 | (1) |
|
|
92 | (3) |
|
|
92 | (3) |
|
5 State of Research of Speech Recognition |
|
|
95 | (22) |
|
|
95 | (2) |
|
5.2 A Brief Overview of Speech Recognition Technology |
|
|
97 | (1) |
|
5.3 Review of Speech Recognition During the Last Two Decades |
|
|
98 | (3) |
|
5.4 Research of Speech Recognition in Indian Languages |
|
|
101 | (8) |
|
5.4.1 Statistical Approach |
|
|
102 | (4) |
|
|
106 | (3) |
|
|
109 | (8) |
|
|
109 | (8) |
|
|
|
6 Phoneme Segmentation Technique Using Self-Organizing Map (SOM) |
|
|
117 | (20) |
|
|
117 | (1) |
|
6.2 Linear Prediction Coefficient (LPC) as Speech Feature |
|
|
118 | (1) |
|
6.3 Application of SOM and PNN for Phoneme Boundary Segmentation |
|
|
119 | (2) |
|
6.4 DWT-Based Speech Segmentation |
|
|
121 | (2) |
|
6.5 Proposed SOM- and PNN-Based Segmentation Algorithm |
|
|
123 | (3) |
|
6.5.1 PNN Learning Algorithm |
|
|
124 | (1) |
|
6.5.2 SOM Weight Vector Extraction Algorithm |
|
|
125 | (1) |
|
6.5.3 PNN-Based Decision Algorithm |
|
|
125 | (1) |
|
6.6 Experimental Details and Result |
|
|
126 | (8) |
|
6.6.1 Experimental Speech Signals |
|
|
126 | (1) |
|
|
127 | (2) |
|
6.6.3 Role of PNN Smoothing Parameter |
|
|
129 | (3) |
|
6.6.4 Comparison of SOM- and DWT-Based Segmentation |
|
|
132 | (2) |
|
|
134 | (3) |
|
|
135 | (2) |
|
7 Application of Phoneme Segmentation Technique in Spoken Word Recognition |
|
|
137 | (16) |
|
|
137 | (2) |
|
7.2 Linear Prediction Model for Estimation of Formant Frequency |
|
|
139 | (3) |
|
7.2.1 Human Vocal Tract and Linear Prediction Model of Speech |
|
|
140 | (2) |
|
7.2.2 Pole or Formant Location Determination |
|
|
142 | (1) |
|
7.3 LVQ and Its Application to Codebook Design |
|
|
142 | (1) |
|
7.4 Phoneme Segmentation for Spoken Word Recognition |
|
|
143 | (3) |
|
7.4.1 RNN-Based Local Classification |
|
|
144 | (1) |
|
7.4.2 SOM-Based Segmentation Algorithm |
|
|
145 | (1) |
|
7.4.3 PNN- and Fl-Based Vowel Phoneme and Initial Phoneme Recognition |
|
|
145 | (1) |
|
7.4.4 LVQ Codebook Assisted Last Phoneme Recognition |
|
|
146 | (1) |
|
7.5 Experimental Details and Results |
|
|
146 | (5) |
|
7.5.1 Experimental Speech Signals |
|
|
148 | (1) |
|
7.5.2 RNN Training Consideration |
|
|
148 | (2) |
|
7.5.3 Phoneme Segmentation and Classification Results |
|
|
150 | (1) |
|
|
151 | (2) |
|
|
152 | (1) |
|
8 Application of Clustering Techniques to Generate a Priori Knowledge for Spoken Word Recognition |
|
|
153 | (10) |
|
|
153 | (1) |
|
8.2 K-Means Clustering (KMC) |
|
|
154 | (1) |
|
8.3 KMC Applied to Speech Data |
|
|
155 | (2) |
|
|
157 | (4) |
|
8.4.1 Experimental Speech Samples |
|
|
157 | (1) |
|
8.4.2 Role of RNN in Decision Making of the Proposed Technique |
|
|
158 | (1) |
|
8.4.3 Result and Limitation |
|
|
159 | (2) |
|
|
161 | (2) |
|
|
162 | (1) |
|
9 Application of Proposed Phoneme Segmentation Technique for Speaker Identification |
|
|
163 | (20) |
|
|
163 | (2) |
|
9.2 Certain Previous Work Done |
|
|
165 | (4) |
|
9.3 Linear Prediction Residual Feature |
|
|
169 | (1) |
|
9.4 EMD Residual-Based Source Extraction |
|
|
169 | (1) |
|
9.5 LVQ Codebook and Speaker Identification |
|
|
170 | (1) |
|
|
170 | (1) |
|
9.7 Experimental Details and Results |
|
|
171 | (1) |
|
|
171 | (9) |
|
9.8.1 Vowel Segmentation Results |
|
|
172 | (2) |
|
9.8.2 Speaker Identification Results Using LP Residual |
|
|
174 | (2) |
|
9.8.3 Speaker Identification Results Using EMD Residual |
|
|
176 | (4) |
|
|
180 | (3) |
|
|
180 | (3) |
|
|
183 | (2) |
|
|
183 | (1) |
|
|
184 | (1) |
|
|
184 | (1) |
Index |
|
185 | |