|
Part I The Basic Concepts |
|
|
|
|
3 | (24) |
|
1.1 Basic Concepts and Terminology |
|
|
3 | (6) |
|
1.1.1 Digital Image and Digital Video |
|
|
3 | (3) |
|
|
6 | (1) |
|
|
6 | (2) |
|
|
8 | (1) |
|
1.2 Image and Video Analysis |
|
|
9 | (5) |
|
1.2.1 Image and Video Scene Segmentation |
|
|
9 | (1) |
|
1.2.2 Image and Video Feature Description |
|
|
10 | (2) |
|
1.2.3 Object Recognition in Images/Videos |
|
|
12 | (1) |
|
1.2.4 Scene Description and Understanding |
|
|
13 | (1) |
|
1.3 Examples of Advanced Applications |
|
|
14 | (9) |
|
|
14 | (1) |
|
|
15 | (1) |
|
1.3.3 Digital Image Inpainting |
|
|
15 | (1) |
|
|
16 | (1) |
|
1.3.5 Digital Watermarking |
|
|
17 | (1) |
|
1.3.6 Visual Object Recognition |
|
|
18 | (2) |
|
|
20 | (1) |
|
1.3.8 Dynamic Scene Classification |
|
|
21 | (1) |
|
1.3.9 Pedestrian Re-identification |
|
|
22 | (1) |
|
1.3.10 Lip Recognition in Video |
|
|
22 | (1) |
|
|
23 | (4) |
|
2 Matlab Functions of Image and Video |
|
|
27 | (38) |
|
2.1 Introduction to MATLAB for Image and Video |
|
|
27 | (1) |
|
2.2 Basic Elements of MATLAB |
|
|
28 | (7) |
|
2.2.1 Working Environment |
|
|
28 | (1) |
|
|
29 | (3) |
|
2.2.3 Array and Matrix Indexing in MATLAB |
|
|
32 | (2) |
|
|
34 | (1) |
|
2.2.5 Command-Line Operations |
|
|
34 | (1) |
|
2.3 Programming Tools: Scripts and Functions |
|
|
35 | (6) |
|
|
35 | (1) |
|
|
36 | (2) |
|
2.3.3 Important Variables and Constants |
|
|
38 | (1) |
|
2.3.4 Number Representation |
|
|
38 | (1) |
|
|
39 | (2) |
|
|
41 | (1) |
|
2.4 Graphics and Visualization |
|
|
41 | (5) |
|
2.5 The Image Processing Toolbox |
|
|
46 | (12) |
|
2.5.1 The Image Processing Toolbox: An Overview |
|
|
46 | (1) |
|
2.5.2 Essential Functions and Features |
|
|
47 | (5) |
|
2.5.3 Displaying Information About an Image File |
|
|
52 | (1) |
|
2.5.4 Reading an Image File |
|
|
52 | (1) |
|
2.5.5 Data Classes and Data Conversions |
|
|
53 | (2) |
|
2.5.6 Displaying the Contents of an Image |
|
|
55 | (2) |
|
2.5.7 Exploring the Contents of an Image |
|
|
57 | (1) |
|
2.5.8 Writing the Resulting Image onto a File |
|
|
58 | (1) |
|
2.6 Video Processing in MATLAB |
|
|
58 | (5) |
|
2.6.1 Reading Video Files |
|
|
59 | (1) |
|
2.6.2 Processing Video Files |
|
|
59 | (1) |
|
2.6.3 Playing Video Files |
|
|
60 | (1) |
|
2.6.4 Writing Video Files |
|
|
61 | (1) |
|
2.6.5 Basic Digital Video Manipulation in MATLAB |
|
|
62 | (1) |
|
|
63 | (2) |
|
3 Image and Video Segmentation |
|
|
65 | (48) |
|
|
65 | (1) |
|
3.2 Threshold Segmentation |
|
|
66 | (8) |
|
3.2.1 Global Threshold Image Segmentation |
|
|
68 | (1) |
|
3.2.2 Local Dynamic Threshold Segmentation |
|
|
69 | (5) |
|
3.3 Region-Based Segmentation |
|
|
74 | (14) |
|
|
74 | (4) |
|
3.3.2 Region Splitting and Merging |
|
|
78 | (10) |
|
3.4 Segmentation Based on Partial Differential Equation |
|
|
88 | (6) |
|
3.5 Image Segmentation Based on Clustering |
|
|
94 | (3) |
|
3.6 Image Segmentation Method Based on Graph Theory |
|
|
97 | (10) |
|
|
97 | (2) |
|
3.6.2 GraphCut and Improved Image Segmentation Method |
|
|
99 | (8) |
|
3.7 Video Motion Region Extraction Method Based on Cumulative Difference |
|
|
107 | (4) |
|
|
111 | (2) |
|
4 Feature Extraction and Representation |
|
|
113 | (48) |
|
|
113 | (2) |
|
4.2 Histogram-Based Features |
|
|
115 | (6) |
|
4.2.1 Grayscale Histogram |
|
|
115 | (2) |
|
4.2.2 Histograms of Oriented Gradients |
|
|
117 | (4) |
|
|
121 | (14) |
|
4.3.1 Haralick Texture Descriptors |
|
|
122 | (4) |
|
4.3.2 Wavelet Texture Descriptors |
|
|
126 | (5) |
|
4.3.3 LBP Texture Descriptors |
|
|
131 | (4) |
|
4.4 Corner Feature Extraction |
|
|
135 | (9) |
|
|
135 | (2) |
|
4.4.2 Harris Corner Detection Operator |
|
|
137 | (4) |
|
4.4.3 SUSAN Corner Detection Algorithm |
|
|
141 | (3) |
|
4.5 Local Invariant Feature Point Extraction |
|
|
144 | (14) |
|
4.5.1 Local Invariant Point Feature of SURF |
|
|
145 | (4) |
|
4.5.2 SIFT Scale-Invariant Feature Algorithm |
|
|
149 | (9) |
|
|
158 | (3) |
|
Part II Advances in Image Processing |
|
|
|
|
161 | (48) |
|
|
161 | (1) |
|
5.2 Noise Reduction Using Spatial-Domain Techniques |
|
|
161 | (12) |
|
5.2.1 Selected Noise Probability Density Functions |
|
|
162 | (6) |
|
|
168 | (5) |
|
|
173 | (7) |
|
5.3.1 The Restoration of Defocus Blurred Image |
|
|
174 | (2) |
|
5.3.2 Restoration of Motion Blurred Image |
|
|
176 | (4) |
|
5.4 Fisheye Distortion Correction Using Spherical Coordinates Model |
|
|
180 | (6) |
|
5.5 Skew Correction of Text Images |
|
|
186 | (5) |
|
5.5.1 Feature Analysis of Text Images |
|
|
187 | (1) |
|
5.5.2 The Basic Idea of Hough Transform |
|
|
187 | (1) |
|
5.5.3 The Implementation Steps of Text Images Skew Correction |
|
|
188 | (3) |
|
5.6 Image Dehazing Correction |
|
|
191 | (9) |
|
5.6.1 Single Image Dehazing |
|
|
191 | (1) |
|
|
192 | (2) |
|
5.6.3 Implementation Steps of DCP |
|
|
194 | (1) |
|
5.6.4 Refine Transmission Map Using Soft Matting |
|
|
195 | (5) |
|
5.7 Image Deraining Correction |
|
|
200 | (6) |
|
|
200 | (1) |
|
5.7.2 Single Image De-rain with Deep Detail Network |
|
|
200 | (3) |
|
5.7.3 Implementation of Image Deraining with Deep Network |
|
|
203 | (3) |
|
|
206 | (3) |
|
|
209 | (24) |
|
|
209 | (2) |
|
6.1.1 Structure Oriented Image Inpainting Technology |
|
|
210 | (1) |
|
6.1.2 Texture-Based Image Inpainting Technology |
|
|
211 | (1) |
|
6.2 The Principle of Image Inpainting |
|
|
211 | (2) |
|
6.3 Variational PDE-Based Image Inpainting |
|
|
213 | (9) |
|
6.3.1 Image Inpainting Algorithm Based on Total Variational Model |
|
|
214 | (5) |
|
6.3.2 Image Inpainting Based on CDD Model |
|
|
219 | (3) |
|
6.4 Exemplar-Based Image Inpainting Algorithm |
|
|
222 | (8) |
|
|
230 | (3) |
|
|
233 | (38) |
|
|
233 | (1) |
|
|
234 | (9) |
|
|
234 | (2) |
|
|
236 | (4) |
|
7.2.3 Multi-temporal Fusion |
|
|
240 | (2) |
|
|
242 | (1) |
|
|
243 | (5) |
|
7.4 Image Fusion Using Wavelet Transform |
|
|
248 | (5) |
|
7.4.1 Basis of Wavelet Transform |
|
|
248 | (1) |
|
7.4.2 Discrete Dyadic Wavelet Transform of Image and Its Mallat Algorithm |
|
|
249 | (1) |
|
7.4.3 Steps of Implementation |
|
|
250 | (3) |
|
7.5 Region-Based Image Fusion |
|
|
253 | (7) |
|
7.5.1 Basic Framework of Regional Integration |
|
|
254 | (1) |
|
7.5.2 The Strategy of Regional Joint Representation |
|
|
255 | (1) |
|
7.5.3 The Rules of Fusion |
|
|
256 | (1) |
|
7.5.4 Wavelet Fusion of Regional Variance |
|
|
256 | (4) |
|
7.6 Image Fusion Using Fuzzy Dempster-Shafer Evidence Theory |
|
|
260 | (3) |
|
7.7 Image Quality and Fusion Evaluations |
|
|
263 | (5) |
|
7.7.1 Subjective Evaluation of Image Fusion |
|
|
264 | (1) |
|
7.7.2 Objective Evaluation of Image Fusion |
|
|
264 | (4) |
|
|
268 | (3) |
|
|
271 | (58) |
|
|
271 | (1) |
|
8.2 Image Stitching Based on Region |
|
|
272 | (18) |
|
8.2.1 Image Stitching Based on Ratio Matching |
|
|
273 | (3) |
|
8.2.2 Image Stitching Based on Line and Plane Feature |
|
|
276 | (7) |
|
8.2.3 Image Stitching Based on FFT |
|
|
283 | (7) |
|
8.3 Images Stitching Based on Feature Points |
|
|
290 | (30) |
|
8.3.1 SIFT Feature Points Detection |
|
|
290 | (7) |
|
8.3.2 Image Stitching Based on Harris Feature Points |
|
|
297 | (7) |
|
8.3.3 Auto-Sorting for Image Sequence |
|
|
304 | (3) |
|
8.3.4 Harris Point Registration Based on RANSAC Algorithm |
|
|
307 | (13) |
|
8.4 Panoramic Image Stitching |
|
|
320 | (7) |
|
|
327 | (2) |
|
|
329 | (22) |
|
|
329 | (5) |
|
9.2 Fragile Watermarking Based on Spatial Domain |
|
|
334 | (2) |
|
9.3 Robust Watermarking Based on DCT |
|
|
336 | (8) |
|
9.4 Semi-fragile Watermarking Based on DWT |
|
|
344 | (5) |
|
|
349 | (2) |
|
10 Visual Object Recognition |
|
|
351 | (40) |
|
10.1 Face Recognition Based on Locality Preserving Projections |
|
|
351 | (24) |
|
10.2 Facial Expression Recognition Using PCA |
|
|
375 | (5) |
|
10.3 Extraction and Recognition of Characters in Pictures |
|
|
380 | (7) |
|
|
387 | (4) |
|
Part III Advances in Video Processing and then Associated Chapters |
|
|
|
11 Visual Object Tracking |
|
|
391 | (38) |
|
11.1 Adaptive Background Modeling by Using a Mixture of Gaussians |
|
|
391 | (5) |
|
11.2 Object Tracking Based on Ransac |
|
|
396 | (5) |
|
11.3 Object Tracking Based on MeanShift |
|
|
401 | (8) |
|
11.3.1 Description of the Object Model |
|
|
402 | (1) |
|
11.3.2 A Description of the Candidate Model |
|
|
402 | (1) |
|
11.3.3 Similarity Function |
|
|
403 | (1) |
|
|
403 | (6) |
|
11.4 Object Tracking Based on Particle Filter |
|
|
409 | (9) |
|
11.4.1 Prior Knowledge of the Goal |
|
|
410 | (1) |
|
11.4.2 System State Transition |
|
|
410 | (1) |
|
11.4.3 System Observation |
|
|
411 | (1) |
|
11.4.4 Posterior Probability Calculation |
|
|
412 | (1) |
|
11.4.5 Particle Resampling |
|
|
412 | (1) |
|
11.4.6 Implementation Steps |
|
|
413 | (5) |
|
11.5 Multiple Object Tracking |
|
|
418 | (9) |
|
|
427 | (2) |
|
12 Dynamic Scene Classification Based on Topic Models |
|
|
429 | (46) |
|
|
429 | (1) |
|
12.2 Introduction to the Topic Models |
|
|
430 | (9) |
|
|
430 | (3) |
|
12.2.2 TMBP Model Based on Factor Graph |
|
|
433 | (3) |
|
12.2.3 TMBP Model Fusing Prior Knowledge |
|
|
436 | (3) |
|
12.3 Dynamic Scene Classification Based on TMBP |
|
|
439 | (12) |
|
12.4 Behavior Recognition Based on LDA Topic Model |
|
|
451 | (24) |
|
13 Image Understanding-Person Re-identification |
|
|
475 | (38) |
|
|
475 | (2) |
|
13.2 Person Re-ID Scenarios |
|
|
477 | (1) |
|
|
478 | (2) |
|
13.4 Public Datasets and Evaluation Metrics in Person Re-identification |
|
|
480 | (4) |
|
|
480 | (3) |
|
13.4.2 Evaluation Metrics |
|
|
483 | (1) |
|
13.5 Classic Feature Representations for Person Re-identification |
|
|
484 | (17) |
|
13.5.1 Salient Color Names |
|
|
484 | (3) |
|
13.5.2 Local Maximal Occurrence Representation |
|
|
487 | (14) |
|
13.6 An Example of Metric Learning Based Person Re-identification Method-XQDA |
|
|
501 | (10) |
|
|
511 | (2) |
|
14 Image and Video Understanding Based on Deep Learning |
|
|
513 | (42) |
|
|
513 | (2) |
|
14.2 Model Analysis of CNN |
|
|
515 | (7) |
|
14.2.1 Basic Modules of CNN |
|
|
515 | (1) |
|
14.2.2 Convolution and Pooling |
|
|
515 | (1) |
|
14.2.3 Activation Function |
|
|
516 | (1) |
|
14.2.4 Softmax Classifier and Cost Function |
|
|
517 | (2) |
|
14.2.5 Learning Algorithm |
|
|
519 | (2) |
|
|
521 | (1) |
|
14.2.7 Batch Normalization |
|
|
522 | (1) |
|
|
522 | (9) |
|
|
522 | (1) |
|
|
523 | (1) |
|
|
524 | (4) |
|
|
528 | (2) |
|
|
530 | (1) |
|
14.4 Deep Learning Model for Lip Recognition Instance |
|
|
531 | (8) |
|
|
531 | (1) |
|
14.4.2 Deep Network Training |
|
|
532 | (4) |
|
|
536 | (3) |
|
14.5 Deep CNN Architecture for Event Recognition Instance |
|
|
539 | (14) |
|
|
539 | (1) |
|
14.5.2 Deep Feature Extraction |
|
|
540 | (1) |
|
14.5.3 Spatial-Temporal Feature Fusion |
|
|
540 | (1) |
|
14.5.4 Fisher Vector Encoding |
|
|
541 | (1) |
|
|
542 | (11) |
|
|
553 | (2) |
Appendix: Common Evaluation Criterion |
|
555 | |