Muutke küpsiste eelistusi

Bayesian Inference for Gene Expression and Proteomics [Kõva köide]

Edited by (University of Texas, MD Anderson Cancer Center), Edited by (Swiss Federal Institute of Technology, Zürich), Edited by (Rice University, Houston)
  • Formaat: Hardback, 456 pages, kõrgus x laius x paksus: 236x158x28 mm, kaal: 738 g, 22 Tables, unspecified
  • Ilmumisaeg: 24-Jul-2006
  • Kirjastus: Cambridge University Press
  • ISBN-10: 052186092X
  • ISBN-13: 9780521860925
Teised raamatud teemal:
  • Formaat: Hardback, 456 pages, kõrgus x laius x paksus: 236x158x28 mm, kaal: 738 g, 22 Tables, unspecified
  • Ilmumisaeg: 24-Jul-2006
  • Kirjastus: Cambridge University Press
  • ISBN-10: 052186092X
  • ISBN-13: 9780521860925
Teised raamatud teemal:
Expert overviews of Bayesian methodology, tools, and software for multi-platform high-throughput experimentation.

The interdisciplinary nature of bioinformatics presents a challenge in integrating concepts, methods, software, and multi-platform data. Although there have been rapid developments in new technology and an inundation of statistical methodology and software for the analysis of microarray gene expression arrays, there exist few rigorous statistical methods for addressing other types of high-throughput data, such as proteomic profiles that arise from mass spectrometry experiments. This book discusses the development and application of Bayesian methods in the analysis of high-throughput bioinformatics data, from medical research and molecular and structural biology. The Bayesian approach has the advantage that evidence can be easily and flexibly incorporated into statistical models. A basic overview of the biological and technical principles behind multi-platform high-throughput experimentation is followed by expert reviews of Bayesian methodology, tools, and software for single group inference, group comparisons, classification and clustering, motif discovery and regulatory networks, and Bayesian networks and gene interactions.

Arvustused

'A text that has a systematic account of Bayesian analysis in computational biology has been needed for a long time. This book is a timely publication entirely devoted to cutting-edge Bayesian methods in genomics and proteomics research and many of its contributors are leading authorities in the field. It is thus an indispensable reference for researchers who are interested in applying Bayesian techniques in their own biological research.' Ping Ma, University of Illinois, Urbana-Champaign ' an authoritative volume presents the state of the art statistical techniques that are starting to make an impact at the forefronts of modern scientific discovery.' Journal of the RSS 'A collection of 22 high quality chapters, authored by several distinguished groups of academic researchers researchers and students will appreciate an authoritative volume like the present.' Z. Q. John Lu, National Institute of Standards and Technology, Gaithersburg 'The editors have done a great job keeping the writing of diverse authors readable without great redundancy This book should be required reading for all graduate students of statistics, statistical researchers in this field, and students and researchers in other fields that use these technologies.' Tapabrata Maita, Journal of the American Statistician 'Overall, I find this text an excellent contribution to the literature on statistical methods for high throughput genomic and proteomic data analysis. The chapters are well written, the case studies are informative, and the range of topics covered is quite broad and generally logically grouped. I would highly recommend this text to both those people already working in the area and those wanting to break in. It is not only suitable for researchers developing their own methodologies but also for applied quantitative scientists looking for the most cutting-edge tools to analyze their high throughput datasets.' J. Sunil Rao, Biometrics

Muu info

Expert overviews of Bayesian methodology, tools and software for multi-platform high-throughput experimentation.
List of Contributors
xi
Preface xv
An Introduction to High-Throughput Bioinformatics Data
1(39)
Keith A. Baggerly
Kevin R. Coombes
Jeffrey S. Morris
Introduction
1(1)
Microarrays
2(17)
Sage
19(5)
Mass Spectrometry
24(10)
Finding Data
34(6)
Hierarchical Mixture Models for Expression Profiles
40(13)
Michael A. Newton
Ping Wang
Christina Kendziorski
Introduction
40(3)
Dual Character of Posterior Probabilities
43(2)
Differential Expression as Independence
45(2)
The Multigroup Mixture Model
47(2)
Improving Flexibility
49(4)
Bayesian Hierarchical Models for Inference in Microarray Data
53(22)
Anne-Mette K. Hein
Alex Lewin
Sylvia Richardson
Introduction
53(3)
Bayesian Hierarchical Modeling of Probe Level GeneChip Data
56(11)
Bayesian Hierarchical Model for Normalization and Differential Expression
67(3)
Predictive Model Checking
70(5)
Bayesian Process-Based Modeling of Two-Channel Microarray Experiments: Estimating Absolute mRNA Concentrations
75(22)
Mark A. van de Wiel
Marit Holden
Ingrid K. Glad
Heidi Lyng
Arnoldo Frigessi
Introduction
75(3)
The Hierarchical Model
78(4)
Reparameterization and Identifiability
82(2)
MCMC-Based Inference
84(1)
Validation
85(1)
Illustration
85(6)
TransCount Web Site and Computing Times
91(1)
A Statistical Discussion of the Model
91(2)
Discussion
93(4)
Identification of Biomarkers in Classification and Clustering of High-Throughput Data
97(19)
Mahlet G. Tadesse
Naijun Sha
Sinae Kim
Marina Vannucci
Introduction
97(3)
Bayesian Variable Selection in Linear Models
100(1)
Bayesian Variable Selection in Classification
101(2)
Bayesian Variable Selection in Clustering via Finite Mixture Models
103(3)
Bayesian Variable Selection in Clustering via Dirichlet Process Mixture Models
106(2)
Example: Leukemia Gene Expression Data
108(5)
Conclusion
113(3)
Modeling Nonlinear Gene Interactions Using Bayesian MARS
116(21)
Veerabhadran Baladandayuthapani
Chris C. Holmes
Bani K. Mallick
Raymond J. Carroll
Introduction
116(2)
Bayesian MARS Model for Gene Interaction
118(3)
Computation
121(1)
Prediction and Model Choice
122(1)
Examples
123(8)
Discussion and Summary
131(6)
Models for Probability of Under- and Overexpression: The POE Scale
137(18)
Elizabeth Garrett-Mayer
Robert Scharpf
POE: A Latent Variable Mixture Model
137(1)
The POE Model
138(6)
Unsupervised versus Semisupervised POE
144(1)
Using POE Scale
145(3)
Example: POE as Applied to Lung Cancer Microarray Data
148(4)
Discussion
152(3)
Sparse Statistical Modelling in Gene Expression Genomics
155(22)
Joseph Lucas
Carlos Carvalho
Quanli Wang
Andrea Bild
Joseph R. Nevins
Mike West
Perspective
156(1)
Sparse Regression Modelling
157(5)
Sparse Regression for Artifact Correction with Affymetrix Expression Arrays
162(5)
Sparse Latent Factor Models and Latent Factor Regressions
167(6)
Concluding Comments
173(4)
Bayesian Analysis of Cell Cycle Gene Expression Data
177(24)
Chuan Zhou
Jon C. Wakefield
Linda L. Breeden
Introduction
177(1)
Previous Studies
178(2)
Data
180(2)
Bayesian Analysis of Cell Cycle Data
182(15)
Discussion
197(4)
Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model
201(18)
David B. Dahl
Introduction
201(2)
Model
203(5)
Inference
208(1)
Simulation Study
209(3)
Example
212(4)
Conclusion
216(3)
Interval Mapping for Expression Quantitative Trait Loci
219(19)
Meng Chen
Christina Kendziorski
Introduction
219(2)
eQTL Mapping Experiments
221(1)
QTL Mapping Methods
222(1)
Currently Available eQTL Mapping Methods
223(2)
MOM Interval Mapping
225(6)
Discussion
231(7)
Bayesian Mixture Models for Gene Expression and Protein Profiles
238(16)
Michele Guindani
Kim-Anh Do
Peter Muller
Jeffrey S. Morris
Introduction
238(2)
A Nonparametric Bayesian Model for Differential Gene Expression
240(3)
A Mixture of Beta Model for MALDI-TOF Data
243(4)
A Semiparametric Mixture Model for SAGE Data
247(3)
Summary
250(4)
Shrinkage Estimation for SAGE Data Using a Mixture Dirichlet Prior
254(15)
Jeffrey S. Morris
Keith A. Baggerly
Kevin R. Coombes
Introduction
254(1)
Overview of SAGE
255(2)
Methods for Estimating Relative Abundances
257(3)
Mixture Dirichlet Distribution
260(3)
Implementation Details
263(1)
Simulation Study
264(3)
Conclusion
267(2)
Analysis of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models
269(24)
Jeffrey S. Morris
Philip J. Brown
Keith A. Baggerly
Kevin R. Coombes
Introduction
270(1)
Overview of MALDI-TOF
270(4)
Functional Mixed Models
274(2)
Wavelet-Based Functional Mixed Models
276(4)
Analyzing Mass Spectrometry Data Using Wavelet-Based Functional Mixed Models
280(8)
Conclusion
288(5)
Nonparametric Models for Proteomic Peak Identification and Quantification
293(16)
Merlise A. Clyde
Leanna L. House
Robert L. Wolpert
Introduction
293(1)
Kernel Models for Spectra
294(2)
Prior Distributions
296(5)
Likelihood
301(1)
Posterior Inference
302(1)
Illustration
303(2)
Summary
305(4)
Bayesian Modeling and Inference for Sequence Motif Discovery
309(24)
Mayetri Gupta
Jun S. Liu
Introduction
309(2)
Biology of Transcription Regulation
311(1)
Problem Formulation, Background, and General Strategies
312(4)
A Bayesian Approach to Motif Discovery
316(4)
Extensions of the Product-Multinomial Motif Model
320(1)
HMM-Type Models for Regulatory Modules
321(6)
Model Selection through a Bayesian Approach
327(2)
Discussion: Motif Discovery Beyond Sequence Analysis
329(4)
Identification of DNA Regulatory Motifs and Regulators by Integrating Gene Expression and Sequence Data
333(14)
Deukwoo Kwon
Sinae Kim
David B. Dahl
Michael Swartz
Mahlet G. Tadesse
Marina Vannucci
Introduction
333(2)
Integrating Gene Expression and Sequence Data
335(2)
A Model for the Identification of Regulatory Motifs
337(3)
Identification of Regulatory Motifs and Regulators
340(4)
Conclusion
344(3)
A Misclassification Model for Inferring Transcriptional Regulatory Networks
347(19)
Ning Sun
Hongyu Zhao
Introduction
347(1)
Methods
348(7)
Simulation Results
355(5)
Application to Yeast Cell Cycle Data
360(1)
Discussion
361(5)
Estimating Cellular Signaling from Transcription Data
366(19)
Andrew V. Kossenkov
Ghislain Bidaut
Michael F. Ochs
Introduction
366(4)
Bayesian Decomposition
370(3)
Key Biological Databases
373(3)
Example: Signaling Activity in Saccharomyces cerevisiae
376(4)
Conclusion
380(5)
Computational Methods for Learning Bayesian Networks from High-Throughput Biological Data
385(16)
Bradley M. Broom
Devika Subramanian
Introduction
385(2)
Bayesian Networks
387(2)
Learning Bayesian Networks
389(2)
Algorithms for Learning Bayesian Networks
391(4)
Example: Learning Robust Features from Data
395(3)
Conclusion
398(3)
Bayesian Networks and Informative Priors: Transcriptional Regulatory Network Models
401(24)
Alexander J. Hartemink
Introduction
401(2)
Bayesian Networks and Bayesian Network Inference
403(4)
Adding Informative Structure Priors
407(2)
Applications of Informative Structure Priors
409(9)
Adding Informative Parameter Priors
418(1)
Discussion
419(2)
Availability of Papers and Banjo Software
421(1)
Acknowledgments
421(4)
Sample Size Choice for Microarray Experiments
425
Peter Muller
Christian Robert
Judith Rousseau
Introduction
425(3)
Optimal Sample Size as a Decision Problem
428(3)
Monte Carlo Evaluation of Predictive Power
431(1)
The Probability Model
432(3)
Pilot Data
435(1)
Example
435(1)
Conclusion
436


Kim-Anh Do is a Professor in the Department of Biostatistics and Applied Mathematics at the University of Texas M. D. Anderson Cancer Center. Her research interests are in computer-intensive statistical methods with recent focus in the development of methodology and software to analyze data produced from high-throughput optimization. Peter Müller is a Professor in the Department of Biostatistics and Applied Mathematics at the University of Texas M. D. Anderson Cancer Center. His research interests and contributions are in the areas of Markov chain Monte Carlo posterior simulation, nonparametric Bayesian inference, hierarchical models, mixture models and Bayesian decisions problems. Marina Vannucci is a Professor of Statistics at Rice University. Her research focuses on the theory and practice of Bayesian variable selection techniques and on the development of wavelet-based statistical models and their applications. Her work is often motivated by real problems that need to be addressed with suitable statistical methods.