Muutke küpsiste eelistusi

Analyzing High-Dimensional Gene Expression and DNA Methylation Data with R [Kõva köide]

(University of Memphis, Tennessee, USA)
  • Formaat: Hardback, 202 pages, kõrgus x laius: 234x156 mm, kaal: 530 g, 1 Tables, black and white; 40 Illustrations, black and white
  • Sari: Chapman & Hall/CRC Computational Biology Series
  • Ilmumisaeg: 28-May-2020
  • Kirjastus: Chapman & Hall/CRC
  • ISBN-10: 1498772595
  • ISBN-13: 9781498772594
Teised raamatud teemal:
  • Formaat: Hardback, 202 pages, kõrgus x laius: 234x156 mm, kaal: 530 g, 1 Tables, black and white; 40 Illustrations, black and white
  • Sari: Chapman & Hall/CRC Computational Biology Series
  • Ilmumisaeg: 28-May-2020
  • Kirjastus: Chapman & Hall/CRC
  • ISBN-10: 1498772595
  • ISBN-13: 9781498772594
Teised raamatud teemal:
"This will be the first book to systematically describe the process and provide corresponding methods for analysing data generated from genetic- and epigenetic-studies. The overall subject is on large data analysis. Specifically, the book aims to provideapipe line for genetic and epigenetic data analysis starting from raw genome- and epigenome-scale data. It includes methods to pre-process genetic and epigenetic data in the genome-scale, methods for data mining to identify potentially informative factors, and methods for subsequent analyses after data mining, e.g., factor/variable selections, network construction, and testing for differential networks"--

Analyzing high-dimensional gene expression and DNA methylation data with R is the first practical book that shows a ``pipeline" of analytical methods with concrete examples starting from raw gene expression and DNA methylation data at the genome scale. Methods on quality control, data pre-processing, data mining, and further assessments are presented in the book, and R programs based on simulated data and real data are included. Codes with example data are all reproducible.

Features:

·         Provides a sequence of analytical tools for genome-scale gene expression data and DNA methylation data, starting from quality control and pre-processing of raw genome-scale data.

·         Organized by a parallel presentation with explanation on statistical methods and corresponding R packages/functions in quality control, pre-processing, and data analyses (e.g., clustering and networks).

·         Includes source codes with simulated and real data to reproduce the results. Readers are expected to gain the ability to independently analyze genome-scaled expression and methylation data and detect potential biomarkers.

 

This book is ideal for students majoring in statistics, biostatistics, and bioinformatics and researchers with an interest in high dimensional genetic and epigenetic studies.

Arvustused

'A big asset of the book, which makes it remarkable contribution and ideal reference book for students of statistics, biostatistics, bioinformatics as well as applied workers/researchers interested in exploring high-dimensional genetic and epigenetic, is the well-illustrated applications and reproducible R codes for thoroughly analysing gene expression and DNA methylation data sets at the genome scale along with the pipeline for analytical methods.'

-Anoop Chaturvedi, University of Allahabad, Prayagraj, India

"I would recommend this brief but consistent practical volume, especially to students with a statistical background, interested in high-dimensional genetic and epigenetic studies."

-Anca Vitcu, International Society for Clinical Biostatistics, 72, 2021

Preface xi
Chapter 1 Introduction
1(4)
1.1 Pipelines To Analyze "Omics" Data
1(2)
1.2 Rna-Seq Gene Expression In S2-Drsc Cells
3(1)
1.3 Mic Roar Ray Gene Expression In Yeast Cells And In Prostate Samples
3(1)
1.4 Dna Methylation In Normal And Colon/Rectal Adenocarcinoma Samples
4(1)
Chapter 2 Genome-Scale Gene Expression Data
5(12)
2.1 Microarray Gene Expression Data
5(8)
2.1.1 Data Generation
5(2)
2.1.2 Preprocessing And Quality Control Of Microarray Data
7(6)
2.2 Data From Next Generation Sequencing
13(4)
2.2.1 Data Generation
14(1)
2.2.2 Preprocessing And Quality Control Of Bulk Rna-Seq Data
14(3)
Chapter 3 Genome-Scale Epigenetic Data
17(22)
3.1 Data Generation
17(1)
3.2 Quality Control And Preprocessing Of Dna Methylation Data
18(4)
3.2.1 The Control Probe Adjustment And Reduction Of Global Correlation Pipeline (Cpacor)
18(3)
3.2.2 Quantile Normalization With Combat
21(1)
3.3 Cell Type Composition Inferences
22(9)
3.3.1 Reference-Based Methods
23(4)
3.3.2 Reference-Free Methods
27(4)
3.4 Appendix - Modified Programs In The Cpacor With An Application
31(8)
Chapter 4 Screening Genome-Scale Genetic And Epigenetic Data
39(18)
4.1 Screening Via Training And Testing Samples
40(1)
4.2 Screening Incorporating Surrogate Variables
41(5)
4.3 Sure Independence Screening
46(4)
4.3.1 Correlation Learning
47(3)
4.4 Non- And Semi-Parametric Screening Techniques
50(7)
4.4.1 Random Forest
50(3)
4.4.2 Support Vector Machine
53(4)
Chapter 5 Cluster Analysis In Data Mining
57(44)
5.1 Non-Parametric Cluster Analysis Methods
57(28)
5.1.1 Distances
58(1)
5.1.2 Partitioning-Based Methods
59(5)
5.1.3 Hierarchical Clustering
64(5)
5.1.4 Hybrids Of Partitioning-Based And Hierarchical Clustering
69(6)
5.1.5 Examples - Clustering To Detect Gene Expression Patterns
75(10)
5.2 Cluster Analyses In Linear Regressions
85(6)
5.3 Bicluster Analyses
91(5)
5.4 Joint Cluster Analysis
96(5)
Chapter 6 Methods To Select Genetic And Epigenetic Factors Based On Linear Associations
101(24)
6.1 Frequentist Approaches
102(6)
6.1.1 Elastic Net
102(3)
6.1.2 Adaptive Lasso
105(1)
6.1.3 Smoothly Clipped Absolute Deviation (Scad)
106(2)
6.2 Bayesian Approaches
108(10)
6.2.1 Zellner's 9-Prior
109(2)
6.2.2 Extension Of Zellner's 5-Prior To Multi-Components G-Prior
111(2)
6.2.3 The Spike-And-Slab Prior
113(5)
6.3 Examples - Selecting Important Epigenetic Factors
118(7)
Chapter 7 Non- And Semi-Parametric Methods To Select Genetic And Epigenetic Factors
125(20)
7.1 Variable Selection Based On Splines
126(2)
7.2 Overview Of The Anova-Based Approach
128(1)
7.3 Variable Selection Built Upon Reproducing Kernels
129(3)
7.4 Examples
132(13)
7.4.1 Selecting Important Epigenetic Factors
132(7)
7.4.2 Selecting Variables With Known Underlying Truth
139(6)
Chapter 8 Network Construction And Analyses
145(20)
8.1 Undirected Networks
145(7)
8.1.1 The Two-Stage Graphs Selection Method
146(1)
8.1.2 The Ggmselect Package And Gene Expression Examples
147(5)
8.2 Correlation Networks
152(6)
8.3 Bayesian Networks
158(3)
8.4 Network Comparisons
161(4)
8.4.1 Comparing Undirected Networks
162(1)
8.4.2 Comparing Bayesian Networks
163(2)
Bibliography 165(18)
Index 183
Hongmei Zhang is a Biostatistician at the University of Memphis. She has been working with gene expression and DNA methylation data and her methodological research interest is to develop corresponding statistical methods. She has been teaching courses in this field for a number of years.