Introduction |
|
xi | |
|
PART 1 TOOLS FOR SIGNAL COMPRESSION |
|
|
1 | (88) |
|
Chapter 1 Scalar Quantization |
|
|
3 | (20) |
|
|
3 | (1) |
|
1.2 Optimum scalar quantization |
|
|
4 | (6) |
|
1.2.1 Necessary conditions for optimization |
|
|
5 | (2) |
|
1.2.2 Quantization error power |
|
|
7 | (3) |
|
1.2.3 Further information |
|
|
10 | (1) |
|
1.2.3.1 Lloyd-Max algorithm |
|
|
10 | (1) |
|
1.2.3.2 Non-linear transformation |
|
|
10 | (1) |
|
|
10 | (1) |
|
1.3 Predictive scalar quantization |
|
|
10 | (13) |
|
|
10 | (2) |
|
1.3.2 Reminders on the theory of linear prediction |
|
|
12 | (1) |
|
1.3.2.1 Introduction: least squares minimization |
|
|
12 | (1) |
|
1.3.2.2 Theoretical approach |
|
|
13 | (1) |
|
1.3.2.3 Comparing the two approaches |
|
|
14 | (1) |
|
|
15 | (1) |
|
1.3.2.5 Levinson algorithm |
|
|
16 | (1) |
|
|
17 | (1) |
|
|
17 | (1) |
|
1.3.4 Asymptotic value of the prediction gain |
|
|
17 | (3) |
|
1.3.5 Closed-loop predictive scalar quantization |
|
|
20 | (3) |
|
Chapter 2 Vector Quantization |
|
|
23 | (14) |
|
|
23 | (1) |
|
|
23 | (3) |
|
2.3 Optimum codebook generation |
|
|
26 | (2) |
|
2.4 Optimum quantizer performance |
|
|
28 | (2) |
|
|
30 | (2) |
|
2.5.1 Tree-structured vector quantization |
|
|
31 | (1) |
|
2.5.2 Cartesian product vector quantization |
|
|
31 | (1) |
|
2.5.3 Gain-shape vector quantization |
|
|
31 | (1) |
|
2.5.4 Multistage vector quantization |
|
|
31 | (1) |
|
2.5.5 Vector quantization by transform |
|
|
31 | (1) |
|
2.5.6 Algebraic vector quantization |
|
|
32 | (1) |
|
2.6 Gain-shape vector quantization |
|
|
32 | (5) |
|
2.6.1 Nearest neighbor rule |
|
|
33 | (1) |
|
2.6.2 Lloyd-Max algorithm |
|
|
34 | (3) |
|
Chapter 3 Sub-band Transform Coding |
|
|
37 | (16) |
|
|
37 | (1) |
|
3.2 Equivalence of filter banks and transforms |
|
|
38 | (2) |
|
|
40 | (6) |
|
3.3.1 Defining the problem |
|
|
40 | (1) |
|
3.3.2 Optimum bit allocation |
|
|
41 | (2) |
|
3.3.3 Practical algorithm |
|
|
43 | (1) |
|
3.3.4 Further information |
|
|
43 | (3) |
|
|
46 | (2) |
|
|
48 | (5) |
|
|
48 | (3) |
|
|
51 | (2) |
|
|
53 | (36) |
|
|
53 | (1) |
|
4.2 Noiseless coding of discrete, memoryless sources |
|
|
54 | (12) |
|
4.2.1 Entropy of a source |
|
|
54 | (2) |
|
|
56 | (1) |
|
|
56 | (1) |
|
4.2.2.2 Uniquely decodable instantaneous code |
|
|
57 | (1) |
|
|
58 | (1) |
|
|
58 | (2) |
|
4.2.3 Theorem of noiseless coding of a memoryless discrete source |
|
|
60 | (1) |
|
|
60 | (1) |
|
|
61 | (1) |
|
|
61 | (1) |
|
|
62 | (1) |
|
4.2.4 Constructing a code |
|
|
62 | (1) |
|
|
62 | (1) |
|
4.2.4.2 Huffman algorithm |
|
|
63 | (1) |
|
|
63 | (1) |
|
|
64 | (1) |
|
|
64 | (1) |
|
|
65 | (1) |
|
|
65 | (1) |
|
4.3 Noiseless coding of a discrete source with memory |
|
|
66 | (7) |
|
|
67 | (1) |
|
4.3.2 Theorem of noiseless coding of a discrete source with memory |
|
|
68 | (1) |
|
4.3.3 Example of a Markov source |
|
|
69 | (1) |
|
|
69 | (1) |
|
4.3.3.2 Example of transmitting documents by fax |
|
|
70 | (3) |
|
4.4 Scalar quantizer with entropy constraint |
|
|
73 | (6) |
|
|
73 | (1) |
|
4.4.2 Lloyd-Max quantizer |
|
|
74 | (1) |
|
4.4.3 Quantizer with entropy constraint |
|
|
75 | (1) |
|
4.4.3.1 Expression for the entropy |
|
|
76 | (1) |
|
4.4.3.2 Jensen inequality |
|
|
77 | (1) |
|
4.4.3.3 Optimum quantizer |
|
|
78 | (1) |
|
|
78 | (1) |
|
4.5 Capacity of a discrete memoryless channel |
|
|
79 | (4) |
|
|
79 | (1) |
|
|
80 | (2) |
|
4.5.3 Noisy-channel coding theorem |
|
|
82 | (1) |
|
4.5.4 Example: symmetrical binary channel |
|
|
82 | (1) |
|
4.6 Coding a discrete source with a fidelity criterion |
|
|
83 | (6) |
|
|
83 | (1) |
|
4.6.2 Rate-distortion function |
|
|
84 | (1) |
|
|
85 | (1) |
|
4.6.3.1 Source coding theorem |
|
|
85 | (1) |
|
4.6.3.2 Combined source-channel coding |
|
|
85 | (1) |
|
4.6.4 Special case: quadratic distortion measure |
|
|
85 | (1) |
|
4.6.4.1 Shannon's lower bound for a memoryless source |
|
|
85 | (1) |
|
4.6.4.2 Source with memory |
|
|
86 | (1) |
|
|
87 | (2) |
|
PART 2 AUDIO SIGNAL APPLICATIONS |
|
|
89 | (74) |
|
Chapter 5 Introduction to Audio Signals |
|
|
91 | (10) |
|
5.1 Speech signal characteristics |
|
|
91 | (1) |
|
5.2 Characteristics of music signals |
|
|
92 | (1) |
|
5.3 Standards and recommendations |
|
|
93 | (8) |
|
5.3.1 Telephone-band speech signals |
|
|
93 | (1) |
|
5.3.1.1 Public telephone network |
|
|
93 | (1) |
|
5.3.1.2 Mobile communication |
|
|
94 | (1) |
|
5.3.1.3 Other applications |
|
|
95 | (1) |
|
5.3.2 Wideband speech signals |
|
|
95 | (1) |
|
5.3.3 High-fidelity audio signals |
|
|
95 | (1) |
|
|
96 | (1) |
|
|
96 | (1) |
|
|
96 | (3) |
|
5.3.3.4 MPEG-7 and MPEG-21 |
|
|
99 | (1) |
|
5.3.4 Evaluating the quality |
|
|
99 | (2) |
|
|
101 | (22) |
|
|
101 | (1) |
|
6.2 The 2.4 bit/s LPC-10 coder |
|
|
102 | (5) |
|
6.2.1 Determining the filter coefficients |
|
|
102 | (1) |
|
|
103 | (1) |
|
|
104 | (2) |
|
6.2.4 Determining voiced and unvoiced sounds |
|
|
106 | (1) |
|
6.2.5 Bit rate constraint |
|
|
107 | (1) |
|
|
107 | (16) |
|
|
107 | (2) |
|
6.3.2 Determining the synthesis filter coefficients |
|
|
109 | (2) |
|
6.3.3 Modeling the excitation |
|
|
111 | (1) |
|
6.3.3.1 Introducing a perceptual factor |
|
|
111 | (2) |
|
6.3.3.2 Selecting the excitation model |
|
|
113 | (1) |
|
6.3.3.3 Filtered codebook |
|
|
113 | (2) |
|
6.3.3.4 Least squares minimization |
|
|
115 | (1) |
|
6.3.3.5 Standard iterative algorithm |
|
|
116 | (1) |
|
6.3.3.6 Choosing the excitation codebook |
|
|
117 | (1) |
|
6.3.3.7 Introducing an adaptive codebook |
|
|
118 | (3) |
|
|
121 | (2) |
|
|
123 | (18) |
|
7.1 Principles of "perceptual coders" |
|
|
123 | (3) |
|
|
126 | (4) |
|
7.2.1 Time/frequency transform |
|
|
127 | (1) |
|
7.2.2 Psychoacoustic modeling and bit allocation |
|
|
128 | (1) |
|
|
128 | (2) |
|
|
130 | (4) |
|
|
134 | (1) |
|
7.5 Psychoacoustic model: calculating a masking threshold |
|
|
135 | (6) |
|
|
135 | (1) |
|
|
135 | (1) |
|
|
136 | (1) |
|
|
137 | (2) |
|
|
139 | (2) |
|
Chapter 8 Audio Coding: Additional Information |
|
|
141 | (8) |
|
8.1 Low bit rate/acceptable quality coders |
|
|
141 | (5) |
|
|
142 | (1) |
|
|
143 | (1) |
|
8.1.2.1 Historical overview |
|
|
143 | (1) |
|
8.1.2.2 Principle of PS audio coding |
|
|
143 | (1) |
|
|
144 | (1) |
|
8.1.3 Sound space perception |
|
|
145 | (1) |
|
8.2 High bit rate lossless or almost lossless coders |
|
|
146 | (3) |
|
|
146 | (1) |
|
8.2.2 ISO/IEC MPEG-4 standardization |
|
|
147 | (1) |
|
|
147 | (1) |
|
|
147 | (2) |
|
Chapter 9 Stereo Coding: A Synthetic Presentation |
|
|
149 | (14) |
|
9.1 Basic hypothesis and notation |
|
|
149 | (2) |
|
9.2 Determining the inter-channel indices |
|
|
151 | (3) |
|
9.2.1 Estimating the power and the intercovariance |
|
|
151 | (1) |
|
9.2.2 Calculating the inter-channel indices |
|
|
152 | (2) |
|
|
154 | (1) |
|
|
154 | (4) |
|
9.3.1 Development in the time domain |
|
|
155 | (2) |
|
9.3.2 In the frequency domain |
|
|
157 | (1) |
|
|
158 | (3) |
|
9.4.1 Stereo signal reconstruction |
|
|
158 | (1) |
|
|
159 | (1) |
|
|
160 | (1) |
|
9.4.4 Information transmitted via the channel |
|
|
161 | (1) |
|
9.5 Draft International Standard |
|
|
161 | (2) |
|
|
163 | (32) |
|
Chapter 10 A Speech Coder |
|
|
165 | (8) |
|
|
165 | (1) |
|
10.2 Script for the calling function |
|
|
165 | (5) |
|
10.3 Script for called functions |
|
|
170 | (3) |
|
|
173 | (22) |
|
|
173 | (1) |
|
11.2 Script for the calling function |
|
|
173 | (3) |
|
11.3 Script for called functions |
|
|
176 | (19) |
Bibliography |
|
195 | (4) |
Index |
|
199 | |