Preface |
|
xi | |
1 Preliminaries |
|
1 | (112) |
|
1.1 Using the R Computing Environment |
|
|
1 | (2) |
|
|
2 | (1) |
|
|
3 | (1) |
|
1.2 Data Sets from Biological Experiments |
|
|
3 | (132) |
|
1.2.1 Arabidopsis experiment: Anna Amtmann |
|
|
4 | (2) |
|
1.2.2 Skin cancer experiment: Nighean Barr |
|
|
6 | (1) |
|
1.2.3 Breast cancer experiment: John Bartlett |
|
|
7 | (2) |
|
1.2.4 Mammary gland experiment: Gusterson group |
|
|
9 | (1) |
|
1.2.5 Tuberculosis experiment: ΒμG@S group |
|
|
10 | (103) |
I Getting Good Data |
|
113 | (22) |
|
2 Set-up of a Microarray Experiment |
|
|
15 | (8) |
|
2.1 Nucleic Acids: DNA and RNA |
|
|
15 | (1) |
|
2.2 Simple cDNA Spotted Microarray Experiment |
|
|
16 | (7) |
|
2.2.1 Growing experimental material |
|
|
17 | (1) |
|
|
17 | (1) |
|
2.2.3 Adding spiking RNA and poly-T primer |
|
|
18 | (1) |
|
2.2.4 Preparing the enzyme environment |
|
|
19 | (1) |
|
2.2.5 Obtaining labelled cDNA |
|
|
19 | (1) |
|
2.2.6 Preparing cDNA mixture for hybridization |
|
|
19 | (1) |
|
2.2.7 Slide hybridization |
|
|
20 | (3) |
|
3 Statistical Design of Microarrays |
|
|
23 | (34) |
|
|
24 | (2) |
|
|
26 | (10) |
|
3.2.1 Biological and technical replication |
|
|
27 | (2) |
|
3.2.2 How many replicates? |
|
|
29 | (1) |
|
|
30 | (6) |
|
|
36 | (4) |
|
3.3.1 Blocking, crossing and randomization |
|
|
37 | (2) |
|
3.3.2 Design and normalization |
|
|
39 | (1) |
|
3.4 Single-channel Microarray Design |
|
|
40 | (4) |
|
|
41 | (1) |
|
|
42 | (1) |
|
3.4.3 Dealing with technical replicates |
|
|
42 | (2) |
|
3.5 Two-channel Microarray Designs |
|
|
44 | (13) |
|
3.5.1 Optimal design of dual-channel arrays |
|
|
44 | (6) |
|
3.5.2 Several practical two-channel designs |
|
|
50 | (7) |
|
|
57 | (46) |
|
|
57 | (5) |
|
|
58 | (2) |
|
|
60 | (1) |
|
|
61 | (1) |
|
|
62 | (1) |
|
4.2 Introduction to Normalization |
|
|
62 | (7) |
|
4.2.1 Scale of gene expression data |
|
|
63 | (2) |
|
4.2.2 Using control spots for normalization |
|
|
65 | (1) |
|
|
65 | (4) |
|
4.3 Normalization for Dual-channel Arrays |
|
|
69 | (24) |
|
4.3.1 Order for the normalizations |
|
|
70 | (1) |
|
|
71 | (5) |
|
4.3.3 Background correction |
|
|
76 | (4) |
|
4.3.4 Dye effect normalization |
|
|
80 | (4) |
|
4.3.5 Normalization within and across conditions |
|
|
84 | (9) |
|
4.4 Normalization of Single-channel Arrays |
|
|
93 | (10) |
|
4.4.1 Affymetrix data structure |
|
|
93 | (1) |
|
4.4.2 Normalization of Affymetrix data |
|
|
94 | (9) |
|
|
103 | (22) |
|
5.1 Using MIAME in Quality Assessment |
|
|
104 | (1) |
|
5.1.1 Components of MIAME |
|
|
104 | (1) |
|
5.2 Comparing Multivariate Data |
|
|
105 | (8) |
|
|
105 | (1) |
|
5.2.2 Dissimilarity and distance measures |
|
|
106 | (5) |
|
5.2.3 Representing multivariate data |
|
|
111 | (2) |
|
5.3 Detecting Data Problems |
|
|
113 | (10) |
|
|
114 | (3) |
|
5.3.2 Normalization problems |
|
|
117 | (2) |
|
5.3.3 Hybridization problems |
|
|
119 | (2) |
|
|
121 | (2) |
|
5.4 Consequences of Quality Assessment Checks |
|
|
123 | (2) |
|
|
125 | (12) |
|
|
125 | (4) |
|
6.1.1 Single-versus dual-channel designs? |
|
|
125 | (4) |
|
6.1.2 Dye-swap experiments |
|
|
129 | (1) |
|
|
129 | (8) |
|
6.2.1 Myth: 'microarray data is Gaussian' |
|
|
129 | (2) |
|
6.2.2 Myth: 'microarray data is not Gaussian' |
|
|
131 | (1) |
|
6.2.3 Confounding spatial and dye effect |
|
|
132 | (1) |
|
6.2.4 Myth: 'non-negative background subtraction' |
|
|
133 | (2) |
II Getting Good Answers |
|
135 | (116) |
|
|
137 | (40) |
|
7.1 Discovering Sample Classes |
|
|
137 | (18) |
|
7.1.1 Why cluster samples? |
|
|
138 | (1) |
|
7.1.2 Sample dissimilarity measures |
|
|
139 | (5) |
|
7.1.3 Clustering methods for samples |
|
|
144 | (11) |
|
7.2 Exploratory Supervised Learning |
|
|
155 | (5) |
|
7.2.1 Labelled dendrograms |
|
|
156 | (1) |
|
7.2.2 Labelled PAM-type clusterings |
|
|
157 | (3) |
|
7.3 Discovering Gene Clusters |
|
|
160 | (17) |
|
7.3.1 Similarity measures for expression profiles |
|
|
160 | (3) |
|
7.3.2 Gene clustering methods |
|
|
163 | (14) |
|
8 Differential Expression |
|
|
177 | (34) |
|
|
177 | (2) |
|
8.1.1 Classical versus Bayesian hypothesis testing |
|
|
177 | (2) |
|
8.1.2 Multiple testing 'problem' |
|
|
179 | (1) |
|
8.2 Classical Hypothesis Testing |
|
|
179 | (17) |
|
8.2.1 What is a hypothesis test? |
|
|
180 | (3) |
|
8.2.2 Hypothesis tests for two conditions |
|
|
183 | (9) |
|
|
192 | (3) |
|
8.2.4 Results from skin cancer experiment |
|
|
195 | (1) |
|
8.3 Bayesian Hypothesis Testing |
|
|
196 | (15) |
|
8.3.1 A general testing procedure |
|
|
200 | (3) |
|
|
203 | (8) |
|
9 Predicting Outcomes with Gene Expression Profiles |
|
|
211 | (36) |
|
|
211 | (7) |
|
9.1.1 Probabilistic classification theory |
|
|
212 | (5) |
|
9.1.2 Modelling and predicting continuous variables |
|
|
217 | (1) |
|
9.2 Curse of Dimensionality: Gene Filtering |
|
|
218 | (5) |
|
9.2.1 Use only significantly expressed genes |
|
|
218 | (2) |
|
9.2.2 PCA and gene clustering |
|
|
220 | (2) |
|
|
222 | (1) |
|
9.2.4 Biological selection |
|
|
222 | (1) |
|
9.3 Predicting Class Memberships |
|
|
223 | (12) |
|
9.3.1 Variance-bias trade-off in prediction |
|
|
223 | (4) |
|
9.3.2 Linear discriminant analysis |
|
|
227 | (4) |
|
9.3.3 kappa-nearest neighbour classification |
|
|
231 | (4) |
|
9.4 Predicting Continuous Responses |
|
|
235 | (12) |
|
9.4.1 Penalized regression: LASSO |
|
|
235 | (8) |
|
9.4.2 kappa-nearest neighbour regression |
|
|
243 | (4) |
|
10 Microarray Myths: Inference |
|
|
247 | (4) |
|
10.1 Differential Expression |
|
|
247 | (2) |
|
10.1.1 Myth: 'Bonferroni is too conservative' |
|
|
247 | (1) |
|
10.1.2 FPR and collective multiple testing |
|
|
248 | (1) |
|
10.1.3 Misinterpreting FDR |
|
|
248 | (1) |
|
10.2 Prediction and Learning |
|
|
249 | (2) |
|
|
249 | (2) |
Bibliography |
|
251 | (8) |
Index |
|
259 | |