|
|
xi | |
|
|
xv | |
|
|
1 | (12) |
|
Audio Content Description |
|
|
2 | (1) |
|
MPEG-7 Audio Content Description -- An Overview |
|
|
3 | (7) |
|
MPEG-7 Low-Level Descriptors |
|
|
5 | (1) |
|
MPEG-7 Description Schemes |
|
|
6 | (3) |
|
MPEG-7 Description Definition Language (DDL) |
|
|
9 | (1) |
|
BiM (Binary Format for MPEG-7) |
|
|
9 | (1) |
|
|
10 | (3) |
|
|
13 | (46) |
|
|
13 | (1) |
|
Basic Parameters and Notations |
|
|
14 | (3) |
|
|
14 | (1) |
|
|
15 | (2) |
|
|
17 | (5) |
|
|
18 | (2) |
|
|
20 | (2) |
|
|
22 | (1) |
|
|
22 | (2) |
|
|
23 | (1) |
|
|
24 | (1) |
|
Basic Spectral Descriptors |
|
|
24 | (8) |
|
|
24 | (3) |
|
|
27 | (2) |
|
|
29 | (1) |
|
|
29 | (3) |
|
|
32 | (6) |
|
|
33 | (3) |
|
Audio Fundamental Frequency |
|
|
36 | (2) |
|
|
38 | (11) |
|
Temporal Timbral: Requirements |
|
|
39 | (1) |
|
|
40 | (1) |
|
|
41 | (1) |
|
Spectral Timbral: Requirements |
|
|
42 | (3) |
|
Harmonic Spectral Centroid |
|
|
45 | (2) |
|
Harmonic Spectral Deviation |
|
|
47 | (1) |
|
|
47 | (1) |
|
Harmonic Spectral Variation |
|
|
48 | (1) |
|
|
48 | (1) |
|
Spectral Basis Representations |
|
|
49 | (1) |
|
|
50 | (1) |
|
Beyond the Scope of MPEG-7 |
|
|
50 | (9) |
|
Other Low-Level Descriptors |
|
|
50 | (2) |
|
Mel-Frequency Cepstrum Coefficients |
|
|
52 | (3) |
|
|
55 | (4) |
|
Sound Classification and Similarity |
|
|
59 | (44) |
|
|
59 | (2) |
|
|
61 | (5) |
|
Singular Value Decomposition (SVD) |
|
|
61 | (1) |
|
Principal Component Analysis (PCA) |
|
|
62 | (1) |
|
Independent Component Analysis (ICA) |
|
|
63 | (2) |
|
Non-Negative Factorization (NMF) |
|
|
65 | (1) |
|
|
66 | (7) |
|
Gaussian Mixture Model (GMM) |
|
|
66 | (2) |
|
Hidden Markov Model (HMM) |
|
|
68 | (2) |
|
|
70 | (1) |
|
Support Vector Machine (SVM) |
|
|
71 | (2) |
|
MPEG-7 Sound Classification |
|
|
73 | (6) |
|
MPEG-7 Audio Spectrum Projection (ASP) Feature Extraction |
|
|
74 | (3) |
|
Training Hidden Markov Models (HMMs) |
|
|
77 | (2) |
|
|
79 | (1) |
|
Comparison of MPEG-7 Audio Spectrum Projection vs. MFCC Features |
|
|
79 | (5) |
|
|
84 | (1) |
|
Audio Retrieval Using Histogram Sum of Squared Differences |
|
|
85 | (1) |
|
Simulation Results and Discussion |
|
|
85 | (15) |
|
Plots of MPEG-7 Audio Descriptors |
|
|
86 | (2) |
|
|
88 | (3) |
|
Results for Distinguishing Between Speech, Music and Environmental Sound |
|
|
91 | (1) |
|
Results of Sound Classification Using Three Audio Taxonomy Methods |
|
|
92 | (4) |
|
Results for Speaker Recognition |
|
|
96 | (2) |
|
Results of Musical Instrument Classification |
|
|
98 | (1) |
|
|
99 | (1) |
|
|
100 | (3) |
|
|
101 | (2) |
|
|
103 | (68) |
|
|
103 | (1) |
|
Automatic Speech Recognition |
|
|
104 | (9) |
|
|
104 | (4) |
|
Types of Speech Recognition Systems |
|
|
108 | (3) |
|
|
111 | (2) |
|
MPEG-7 Spoken Content Description |
|
|
113 | (10) |
|
|
114 | (1) |
|
|
114 | (7) |
|
|
121 | (2) |
|
Application: Spoken Document Retrieval |
|
|
123 | (40) |
|
Basic Principles of IR and SDR |
|
|
124 | (6) |
|
|
130 | (5) |
|
|
135 | (5) |
|
Sub-Word-Based Vector Space Models |
|
|
140 | (14) |
|
|
154 | (7) |
|
Combining Word and Sub-Word Indexing |
|
|
161 | (2) |
|
|
163 | (8) |
|
|
163 | (1) |
|
|
164 | (2) |
|
|
166 | (1) |
|
|
167 | (4) |
|
|
171 | (36) |
|
|
171 | (6) |
|
|
171 | (2) |
|
|
173 | (1) |
|
Harmonic InstrumentTimbre |
|
|
174 | (2) |
|
PercussiveInstrumentTimbre |
|
|
176 | (1) |
|
|
176 | (1) |
|
|
177 | (13) |
|
|
177 | (1) |
|
|
178 | (1) |
|
|
179 | (2) |
|
|
181 | (1) |
|
|
182 | (3) |
|
|
185 | (5) |
|
|
190 | (3) |
|
|
192 | (1) |
|
|
192 | (1) |
|
Application Example: Query-by-Humming |
|
|
193 | (14) |
|
Monophonic Melody Transcription |
|
|
194 | (2) |
|
Polyphonic Melody Transcription |
|
|
196 | (4) |
|
Comparison of Melody Contours |
|
|
200 | (3) |
|
|
203 | (4) |
|
Fingerprinting and Audio Signal Quality |
|
|
207 | (24) |
|
|
207 | (1) |
|
|
207 | (13) |
|
Generalities on Audio Fingerprinting |
|
|
207 | (4) |
|
|
211 | (5) |
|
Distance and Searching Methods |
|
|
216 | (1) |
|
MPEG-7-Standardized AudioSignature |
|
|
217 | (3) |
|
|
220 | (11) |
|
AudioSignalQuality Description Scheme |
|
|
221 | (1) |
|
|
222 | (1) |
|
|
222 | (1) |
|
|
222 | (1) |
|
|
223 | (1) |
|
|
224 | (1) |
|
|
224 | (1) |
|
|
225 | (1) |
|
|
226 | (1) |
|
|
226 | (1) |
|
ErrorEvent and ErrorEventList |
|
|
226 | (1) |
|
|
227 | (4) |
|
|
231 | (40) |
|
|
231 | (3) |
|
Automatic Audio Segmentation |
|
|
234 | (20) |
|
|
235 | (1) |
|
|
236 | (1) |
|
Metric-Based Segmentation |
|
|
237 | (5) |
|
Model-Selection-Based Segmentation |
|
|
242 | (1) |
|
|
243 | (3) |
|
Hybrid Segmentation Using MPEG-7 ASP |
|
|
246 | (4) |
|
|
250 | (4) |
|
Sound Indexing and Browsing of Home Video Using Spoken Annotations |
|
|
254 | (5) |
|
A Simple Experimental System |
|
|
254 | (4) |
|
|
258 | (1) |
|
Highlights Extraction for Sport Programmes Using Audio Event Detection |
|
|
259 | (6) |
|
Goal Event Segment Selection |
|
|
261 | (1) |
|
|
262 | (3) |
|
A Spoken Document Retrieval System for Digital Photo Albums |
|
|
265 | (6) |
|
|
266 | (5) |
Index |
|
271 | |