Abbreviations |
|
xix | |
|
|
xxiii | |
|
|
|
1 Probabilistic Graphical Models for Next-generation Genomics and Genetics |
|
|
3 | (27) |
|
|
1.1 Fine-grained Description of Living Systems |
|
|
4 | (2) |
|
|
4 | (1) |
|
|
5 | (1) |
|
1.1.3 Phenotype and Genotype |
|
|
5 | (1) |
|
1.1.4 Molecular Biology, Genetics, Genomics, and Postgenomics |
|
|
6 | (1) |
|
1.2 Higher Description Levels of Living Systems |
|
|
6 | (10) |
|
1.2.1 Complexity in Cells |
|
|
7 | (2) |
|
1.2.2 Genetics, Epigenetics, and Copy Number Polymorphism |
|
|
9 | (2) |
|
1.2.3 Epigenetics with Additional Prior Knowledge on the Genome |
|
|
11 | (1) |
|
|
11 | (2) |
|
1.2.5 Transcriptomics with Prior Biological Knowledge |
|
|
13 | (1) |
|
1.2.6 Integrating Data from Several Levels |
|
|
13 | (3) |
|
|
16 | (1) |
|
1.3 An Era of High-throughput Genomic Technologies |
|
|
16 | (7) |
|
|
16 | (3) |
|
1.3.2 Copy Number Polymorphism |
|
|
19 | (1) |
|
1.3.3 DNA Methylation Measurements |
|
|
19 | (1) |
|
1.3.4 Gene Expression Data |
|
|
20 | (1) |
|
1.3.5 Quantitative Trait Loci |
|
|
21 | (2) |
|
1.3.6 The Challenge of Handling Omics Data |
|
|
23 | (1) |
|
1.4 Probabilistic Graphical Models to Infer Novel Knowledge from Omics Data |
|
|
23 | (7) |
|
1.4.1 Gene Network Inference |
|
|
24 | (1) |
|
1.4.2 Causality Discovery |
|
|
24 | (2) |
|
1.4.3 Association Genetics |
|
|
26 | (1) |
|
|
26 | (1) |
|
1.4.5 Detection of Copy Number Variations |
|
|
26 | (1) |
|
1.4.6 Prediction of Outcomes from High-dimensional Genomic Data |
|
|
26 | (4) |
|
2 Essentials to Understand Probabilistic Graphical Models: A Tutorial about Inference and Learning |
|
|
30 | (55) |
|
|
|
32 | (1) |
|
|
32 | (6) |
|
2.3 Various Classes of Probabilistic Graphical Models |
|
|
38 | (8) |
|
2.3.1 Markov Chains and Hidden Markov Models |
|
|
38 | (1) |
|
2.3.2 Markov Random Fields |
|
|
39 | (2) |
|
2.3.3 Variants around the Concept of Markov random field |
|
|
41 | (1) |
|
|
41 | (4) |
|
2.3.5 Unifying Model and Model Extension |
|
|
45 | (1) |
|
2.4 Probabilistic Inference |
|
|
46 | (11) |
|
|
46 | (5) |
|
2.4.2 Approximate Inference |
|
|
51 | (6) |
|
2.5 Learning Bayesian networks |
|
|
57 | (12) |
|
|
58 | (3) |
|
|
61 | (8) |
|
2.6 Learning Markov random fields |
|
|
69 | (6) |
|
|
69 | (3) |
|
|
72 | (3) |
|
|
75 | (2) |
|
2.8 List of General Monographs and Focused Chapter Books |
|
|
77 | (8) |
|
|
|
3 Graphical Models and Multivariate Analysis of Microarray Data |
|
|
85 | (20) |
|
|
|
85 | (2) |
|
|
87 | (1) |
|
|
88 | (4) |
|
3.3.1 Maximum Likelihood Estimation when the Zero Pattern is Known |
|
|
89 | (1) |
|
3.3.2 Determining the Pattern of Zeroes in the Inverse Covariance Matrix |
|
|
90 | (2) |
|
|
92 | (4) |
|
3.4.1 Null Distributions by Permutation |
|
|
92 | (1) |
|
3.4.2 A Multivariate Test Statistic |
|
|
93 | (1) |
|
3.4.3 Partitioning of the Test Statistic |
|
|
94 | (1) |
|
|
95 | (1) |
|
|
96 | (3) |
|
3.6 Discussion and Conclusions |
|
|
99 | (6) |
|
4 Comparison of Mixture Bayesian and Mixture Regression Approaches to Infer Gene Networks |
|
|
105 | (16) |
|
|
|
|
106 | (1) |
|
|
107 | (5) |
|
4.2.1 Mixture Bayesian Network |
|
|
107 | (1) |
|
4.2.2 Mixture Regression Approach |
|
|
108 | (2) |
|
|
110 | (2) |
|
|
112 | (4) |
|
4.3.1 Comparison of Mixtures |
|
|
112 | (1) |
|
4.3.2 Mixture Modeling of Changes in Gene Relationships |
|
|
112 | (2) |
|
4.3.3 Interpretation of Mixtures |
|
|
114 | (2) |
|
4.3.4 Inference of Large Networks |
|
|
116 | (1) |
|
|
116 | (5) |
|
5 Network Inference in Breast Cancer with Gaussian Graphical Models and Extensions |
|
|
121 | (28) |
|
|
|
|
|
|
122 | (1) |
|
5.2 Modeling of Gene Networks by Gaussian Graphical Networks |
|
|
123 | (11) |
|
5.2.1 Simple Gaussian graphical network |
|
|
123 | (4) |
|
5.2.2 Extensions Motivated by Regulatory Network Modeling |
|
|
127 | (7) |
|
5.3 Application to Estrogen Receptor Status in Breast Cancer |
|
|
134 | (7) |
|
|
134 | (1) |
|
5.3.2 Biological Prior Definition |
|
|
135 | (4) |
|
5.3.3 Network Inference from Biological Prior: Application and Interpretation |
|
|
139 | (2) |
|
5.4 Conclusions and Discussion |
|
|
141 | (8) |
|
Part III CAUSALITY DISCOVERY |
|
|
|
6 Utilizing Genotypic Information as a Prior for Learning Gene Networks |
|
|
149 | (16) |
|
|
|
|
149 | (2) |
|
|
151 | (10) |
|
|
151 | (1) |
|
6.2.2 LCMS Method for Learning a Prior Matrix of Causal Relationships |
|
|
151 | (3) |
|
6.2.3 Bayesian Network Structure Learning |
|
|
154 | (1) |
|
6.2.4 Integrating the Prior Matrix |
|
|
155 | (1) |
|
6.2.5 Stochastic Causal Tree Method |
|
|
156 | (5) |
|
|
161 | (4) |
|
7 Bayesian Causal Phenotype Network Incorporating Genetic Variation and Biological Knowledge |
|
|
165 | (31) |
|
|
|
|
|
|
166 | (1) |
|
7.2 Joint Inference of Causal Phenotype Network and Causal QTLs |
|
|
167 | (7) |
|
7.2.1 Standard Bayesian Network Model |
|
|
168 | (1) |
|
|
169 | (1) |
|
7.2.3 Systems Genetics and Causal Inference |
|
|
170 | (2) |
|
7.2.4 QTL Mapping Conditional on Phenotype Network Structure |
|
|
172 | (1) |
|
7.2.5 Joint Inference of Phenotype Network and Causal QTLs |
|
|
173 | (1) |
|
7.3 Causal Phenotype Network Incorporating Biological Knowledge |
|
|
174 | (9) |
|
|
175 | (3) |
|
|
178 | (2) |
|
7.3.3 Summary of Encoding of Biological Knowledge |
|
|
180 | (3) |
|
|
183 | (2) |
|
7.5 Analysis of Yeast Cell-Cycle Genes |
|
|
185 | (3) |
|
|
188 | (8) |
|
8 Structural Equation Models for Studying Causal Phenotype Networks in Quantitative Genetics |
|
|
196 | (21) |
|
|
|
|
196 | (1) |
|
8.2 Classical Linear Mixed-effects Models in Quantitative Genetics |
|
|
197 | (5) |
|
8.3 Mixed-effects Structural Equation Models |
|
|
202 | (2) |
|
8.4 Data-driven Search for Phenotypic Causal Relationships |
|
|
204 | (3) |
|
|
204 | (2) |
|
|
206 | (1) |
|
8.5 Inferring Causal Structures in Genetics Applications |
|
|
207 | (3) |
|
8.5.1 Genotypic information as Instrumental Variable |
|
|
207 | (1) |
|
8.5.2 Accounting for Polygenic Confounding Effects |
|
|
208 | (2) |
|
|
210 | (7) |
|
Part IV GENETIC ASSOCIATION STUDIES |
|
|
|
9 Modeling Linkage Disequilibrium and Performing Association Studies through Probabilistic Graphical Models: a Visiting Tour of Recent Advances |
|
|
217 | (30) |
|
|
|
|
218 | (1) |
|
9.2 Modeling Linkage Disequilibrium |
|
|
219 | (9) |
|
|
221 | (1) |
|
9.2.2 Decomposable Markov Random Fields |
|
|
221 | (2) |
|
9.2.3 Bayesian Network-based Approaches without Latent Variables |
|
|
223 | (1) |
|
9.2.4 Bayesian Network-based Approaches with Latent Variables |
|
|
224 | (2) |
|
|
226 | (2) |
|
9.3 Single-SNP Approaches for Genome-wide Association Studies |
|
|
228 | (9) |
|
9.3.1 Integration of Confounding Factors |
|
|
228 | (2) |
|
9.3.2 GWAS Multilocus Approach |
|
|
230 | (5) |
|
9.3.3 Strengths and Limitations |
|
|
235 | (2) |
|
9.4 Identifying Epistasis at the Genome Scale |
|
|
237 | (4) |
|
9.4.1 Bayesian Network-based Approaches |
|
|
237 | (2) |
|
9.4.2 Markov Blanket-based Method |
|
|
239 | (1) |
|
|
240 | (1) |
|
|
241 | (1) |
|
|
242 | (5) |
|
10 Modeling Linkage Disequilibrium with Decomposable Graphical Models |
|
|
247 | (22) |
|
|
|
|
248 | (1) |
|
|
249 | (9) |
|
10.2.1 Decomposable Graphical Models |
|
|
249 | (2) |
|
10.2.2 Estimating Decomposable Graphical Models |
|
|
251 | (3) |
|
10.2.3 Application to Diploid Data by Phase Imputation |
|
|
254 | (2) |
|
10.2.4 Estimation on the Genome-Wide Scale |
|
|
256 | (2) |
|
|
258 | (7) |
|
|
258 | (2) |
|
10.3.2 Unconditional Simulation |
|
|
260 | (1) |
|
10.3.3 Phenotypes and Covariates |
|
|
261 | (2) |
|
|
263 | (2) |
|
10.4 Application to Sequence Data |
|
|
265 | (4) |
|
11 Scoring, Searching and Evaluating Bayesian Network Models of Gene-phenotype Association |
|
|
269 | (25) |
|
|
|
|
|
270 | (1) |
|
|
270 | (2) |
|
|
270 | (1) |
|
11.2.2 Genome-wide association studies |
|
|
271 | (1) |
|
11.3 A Bayesian Network Model |
|
|
272 | (1) |
|
11.4 Scoring Candidate Models |
|
|
273 | (5) |
|
11.4.1 Bayesian Network Scoring Criteria |
|
|
273 | (2) |
|
|
275 | (3) |
|
11.5 Searching over the Space of Models |
|
|
278 | (2) |
|
|
280 | (1) |
|
11.6 Determining Whether a Model is Sufficiently Noteworthy |
|
|
280 | (10) |
|
11.6.1 The Bayesian Network Posterior Probability (BNPP) |
|
|
282 | (3) |
|
11.6.2 Prior Probabilities |
|
|
285 | (2) |
|
|
287 | (3) |
|
11.7 Discussion and Further Research |
|
|
290 | (4) |
|
12 Graphical Modeling of Biological Pathways in Genome-wide Association Studies |
|
|
294 | (24) |
|
|
|
|
|
295 | (1) |
|
12.2 MRF Modeling of Gene Pathways |
|
|
296 | (4) |
|
12.3 A Bayesian Framework |
|
|
300 | (12) |
|
12.3.1 Prior Specification and Likelihood Function |
|
|
300 | (2) |
|
12.3.2 Posterior Distribution |
|
|
302 | (2) |
|
12.3.3 Making Inference Based on the Posterior Distribution |
|
|
304 | (1) |
|
|
305 | (4) |
|
12.3.5 Real Data Example---Crohn's Disease Data |
|
|
309 | (3) |
|
|
312 | (6) |
|
13 Bayesian, Systems-based, Multilevel Analysis of Associations for Complex Phenotypes: from Interpretation to Decision |
|
|
318 | (45) |
|
|
|
|
|
|
|
|
|
|
319 | (1) |
|
13.2 Bayesian network-based Concepts of Association and Relevance |
|
|
320 | (8) |
|
13.2.1 Association and Strong Relevance |
|
|
320 | (2) |
|
13.2.2 Stable Distributions, Markov Blankets and Markov Boundaries |
|
|
322 | (1) |
|
13.2.3 Further relevance types |
|
|
323 | (3) |
|
13.2.4 Necessary Subsets and Sufficient Supersets in Strong Relevance |
|
|
326 | (1) |
|
13.2.5 Relevance for Multiple Targets |
|
|
327 | (1) |
|
13.3 A Bayesian View of Relevance for Complex Phenotypes |
|
|
328 | (16) |
|
13.3.1 Estimating the Posteriors of Complex Features |
|
|
330 | (2) |
|
13.3.2 Sufficiency of the Data for Full Multivariate Analysis |
|
|
332 | (1) |
|
13.3.3 Rate of Learning: Effect of Feature and Model Complexity |
|
|
333 | (3) |
|
13.3.4 Bayesian network-based Bayesian Multilevel Analysis of Relevance |
|
|
336 | (3) |
|
13.3.5 Posteriors for Multiple Target Variables |
|
|
339 | (1) |
|
13.3.6 Subtypes of Strong and Weak Relevance |
|
|
340 | (2) |
|
13.3.7 Interaction-redundancy Scores Based on Posteriors of Strong Relevance |
|
|
342 | (2) |
|
13.4 Bayes Optimal Decisions about Multivariate Relevance |
|
|
344 | (6) |
|
13.4.1 Optimal Decision about Univariate Relevance |
|
|
344 | (1) |
|
13.4.2 Optimal Bayesian Decision to Control FDR |
|
|
345 | (3) |
|
13.4.3 General Bayes Optimal Decision about Multivariate Relevance |
|
|
348 | (2) |
|
13.5 Knowledge Fusion: Relevance of Genes and Annotations |
|
|
350 | (2) |
|
|
352 | (11) |
|
|
|
14 Bayesian Networks in the Study of Genome-wide DNA Methylation |
|
|
363 | (24) |
|
|
|
14.1 Introduction to Epigenetics |
|
|
364 | (1) |
|
14.2 Next-generation Sequencing and DNA Methylation |
|
|
365 | (5) |
|
14.2.1 Assaying Genome-wide DNA Methylation |
|
|
366 | (2) |
|
14.2.2 The methyl-Seq Method |
|
|
368 | (2) |
|
14.3 A Bayesian network for methyl-Seq Analysis |
|
|
370 | (5) |
|
|
371 | (1) |
|
14.3.2 A Generative Model |
|
|
371 | (1) |
|
14.3.3 Parameter Learning and Inference of Posterior Probabilities |
|
|
372 | (3) |
|
14.4 Genomic Structure as a Prior on Methylation Status |
|
|
375 | (4) |
|
14.5 Application: Methyltyping the Human Neutrophil |
|
|
379 | (2) |
|
14.5.1 Unmethylated Clusters |
|
|
379 | (2) |
|
|
381 | (6) |
|
15 Latent Variable Models for Analyzing DNA Methylation |
|
|
387 | (22) |
|
|
|
388 | (2) |
|
15.2 Latent Variable Methods for DNA Methylation in Low-dimensional Settings |
|
|
390 | (6) |
|
15.2.1 Discrete Latent Variables |
|
|
391 | (1) |
|
15.2.2 Continuous Latent Variables |
|
|
392 | (4) |
|
15.3 Latent Variable Methods for DNA Methylation in High-dimensional Settings |
|
|
396 | (5) |
|
15.3.1 Model-based Clustering: Recursively Partitioned Mixture Models |
|
|
396 | (3) |
|
15.3.2 Semi-Supervised Recursively Partitioned Mixture Models |
|
|
399 | (2) |
|
|
401 | (8) |
|
Part VI DETECTION OF COPY NUMBER VARIATIONS |
|
|
|
16 Detection of Copy Number Variations from Array Comparative Genomic Hybridization Data Using Linear-chain Conditional Random Field Models |
|
|
409 | (22) |
|
|
|
|
410 | (1) |
|
16.2 aCGH Data and Analysis |
|
|
411 | (2) |
|
|
411 | (1) |
|
16.2.2 Existing Algorithms |
|
|
412 | (1) |
|
16.3 Linear-chain CRF Model for aCGH Data |
|
|
413 | (8) |
|
|
415 | (2) |
|
16.3.2 Parameter Estimation |
|
|
417 | (4) |
|
16.3.3 Evaluation Methods |
|
|
421 | (1) |
|
16.4 Experimental Results |
|
|
421 | (4) |
|
|
421 | (3) |
|
|
424 | (1) |
|
|
425 | (6) |
|
Part VII PREDICTION OF OUTCOMES FROM HIGH-DIMENSIONAL GENOMIC DATA |
|
|
|
17 Prediction of Clinical Outcomes from Genome-wide Data |
|
|
431 | (16) |
|
|
|
431 | (1) |
|
17.2 Challenges with Genome-wide Data |
|
|
432 | (1) |
|
|
433 | (2) |
|
17.3.1 The Naive Bayes Model |
|
|
433 | (1) |
|
17.3.2 Bayesian Model Averaging |
|
|
434 | (1) |
|
17.3.3 Alzheimer's Disease |
|
|
434 | (1) |
|
17.4 The Model-Averaged Naive Bayes (MANB) Algorithm |
|
|
435 | (3) |
|
17.4.1 Overview of the MANB Algorithm |
|
|
435 | (1) |
|
17.4.2 Details of the MANB Algorithm |
|
|
436 | (2) |
|
|
438 | (1) |
|
|
438 | (1) |
|
|
438 | (1) |
|
|
439 | (1) |
|
|
440 | (7) |
Index |
|
447 | |