Muutke küpsiste eelistusi

Primer to Analysis of Genomic Data Using R [Pehme köide]

  • Formaat: Paperback / softback, 270 pages, kõrgus x laius: 235x155 mm, kaal: 4918 g, 74 Illustrations, color; 5 Illustrations, black and white; XVI, 270 p. 79 illus., 74 illus. in color., 1 Paperback / softback
  • Sari: Use R!
  • Ilmumisaeg: 09-Jun-2015
  • Kirjastus: Springer International Publishing AG
  • ISBN-10: 331914474X
  • ISBN-13: 9783319144740
Teised raamatud teemal:
  • Pehme köide
  • Hind: 57,96 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Tavahind: 68,19 €
  • Säästad 15%
  • Raamatu kohalejõudmiseks kirjastusest kulub orienteeruvalt 2-4 nädalat
  • Kogus:
  • Lisa ostukorvi
  • Tasuta tarne
  • Tellimisaeg 2-4 nädalat
  • Lisa soovinimekirja
  • Formaat: Paperback / softback, 270 pages, kõrgus x laius: 235x155 mm, kaal: 4918 g, 74 Illustrations, color; 5 Illustrations, black and white; XVI, 270 p. 79 illus., 74 illus. in color., 1 Paperback / softback
  • Sari: Use R!
  • Ilmumisaeg: 09-Jun-2015
  • Kirjastus: Springer International Publishing AG
  • ISBN-10: 331914474X
  • ISBN-13: 9783319144740
Teised raamatud teemal:
Through this book, researchers and students will learn to use R for analysis of large-scale genomic data and how to create routines to automate analytical steps. The philosophy behind the book is to start with real world raw datasets and perform all the analytical steps needed to reach final results. Though theory plays an important role, this is a practical book for graduate and undergraduate courses in bioinformatics and genomic analysis or for use in lab sessions. How to handle and manage high-throughput genomic data, create automated workflows and speed up analyses in R is also taught. A wide range of R packages useful for working with genomic data are illustrated with practical examples.





The key topics covered are association studies, genomic prediction, estimation of population genetic parameters and diversity, gene expression analysis, functional annotation of results using publically available databases and how to work efficiently in R with large genomic datasets. Important principles are demonstrated and illustrated through engaging examples which invite the reader to work with the provided datasets. Some methods that are discussed in this volume include: signatures of selection, population parameters (LD, FST, FIS, etc); use of a genomic relationship matrix for population diversity studies; use of SNP data for parentage testing; snpBLUP and gBLUP for genomic prediction. Step-by-step, all the R code required for a genome-wide association study is shown: starting from raw SNP data, how to build databases to handle and manage the data, quality control and filtering measures, association testing and evaluation of results, through to identification and functional annotation of candidate genes. Similarly, gene expression analyses are shown using microarray and RNAseq data.





At a time when genomic data is decidedly big, the skills from this book are critical. In recent years R has become the de facto< tool for analysis of gene expression data, in addition to its prominent role in analysis of genomic data. Benefits to using R include the integrated development environment for analysis, flexibility and control of the analytic workflow. Included topics are core components of advanced undergraduate and graduate classes in bioinformatics, genomics and statistical genetics. This book is also designed to be used by students in computer science and statistics who want to learn the practical aspects of genomic analysis without delving into algorithmic details. The datasets used throughout the book may be downloaded from the publishers website.





 

Arvustused

The book is timely and practical, not only through its approach on data analysis, but also due to the numerous examples and further reading indications (including R packages and books) at the end of each chapter. The targeted audience consists of undergraduates and graduates with some experience in bioinformatics analyses. the style of the book can accommodate also researchers with a computing or biological background. (Irina Ioana Mohorianu, zbMATH 1327.92002, 2016)

1 R Basics 1(28)
1.1 Why R?
1(2)
1.2 Installing R
3(2)
1.3 Packages and Bioconductor
5(2)
1.4 R 32-Bit or 64-Bit?
7(1)
1.5 Getting a Handle on R
7(4)
1.6 Importing and Manipulating Data
11(7)
1.7 Plots and Descriptive Statistics
18(6)
1.8 Saving Results
24(2)
1.9 Some Help on Help
26(1)
1.10 Where to Go from Here?
27(2)
2 Simple Marker Association Tests 29(44)
2.1 Introduction to Markers
29(3)
2.1.1 Microsatellites
31(1)
2.2 Case-Control and Family-Based Association Studies
32(1)
2.3 Discrete and Quantitative Traits
33(1)
2.4 Additive, Dominant, and Recessive Models
34(1)
2.5 A Worked Out Example
34(37)
2.6 Useful R Books and Packages
71(2)
3 Genome Wide Association Studies 73(32)
3.1 From Microsatellites and Linkage Analysis to SNP and Genome Wide Association Studies
73(2)
3.1.1 Single Nucleotide Polymorphism
74(1)
3.1.2 Genome Wide Association Studies
74(1)
3.2 Experimental Design
75(1)
3.3 Platforms
76(1)
3.4 Preprocessing and Quality Control
77(18)
3.4.1 Storing and Handling Data
77(6)
3.4.2 Quality Control
83(12)
3.5 Single SNP Analysis
95(7)
3.6 Multiple Testing
102(1)
3.7 What Next
103(1)
3.8 Useful R Packages
103(2)
4 Populations and Genetic Architecture 105(58)
4.1 Beyond Genome Wide Association Studies
105(1)
4.2 Matrix Algebra
106(3)
4.2.1 Loops and Vectorization
106(3)
4.3 Matrix Operations in R
109(2)
4.4 SNP Best Linear Unbiased Prediction
111(13)
4.5 Genomic Prediction
124(11)
4.5.1 Prediction with snpBLUP
126(6)
4.5.2 Prediction with gBLUP
132(3)
4.6 Population Genetics
135(17)
4.6.1 Signatures of Selection
136(4)
4.6.2 Other Population Estimates
140(1)
4.6.3 Genetic Distances
141(11)
4.7 Parentage Testing
152(8)
4.8 Useful R Books and Packages
160(3)
5 Gene Expression Analysis 163(38)
5.1 Introduction to Gene Expression Analysis
163(4)
5.1.1 Platforms for Expression Profiling
164(3)
5.2 Experimental Design
167(1)
5.3 Gene Expression Data
168(1)
5.4 Preprocessing and Quality Control
169(16)
5.4.1 Importing Gene Expression Data into R
169(3)
5.4.2 Quality Control
172(7)
5.4.3 Preprocessing
179(6)
5.5 Analysis of Differential Expression
185(14)
5.5.1 Multiple Testing
192(1)
5.5.2 Differential Expression of RNA-Seq
193(6)
5.6 Useful R Packages
199(2)
6 Databases and Functional Information 201(20)
6.1 Introduction to Databases
201(1)
6.2 Gene Annotation
202(10)
6.3 Gene Ontology
212(6)
6.4 Pathway Analysis, Physical Mapping, and Protein Domains
218(2)
6.5 Useful R Packages
220(1)
7 Extending R 221(34)
7.1 Large Data-Large Problems
221(1)
7.2 Improving Read and Write Operations in R
222(4)
7.3 Byte-Code Compiler
226(1)
7.4 Managing Memory
227(3)
7.5 Parallel Computation
230(6)
7.6 External Interfaces in R
236(9)
7.6.1 Linking R to C++
241(4)
7.7 Using R Inside Other Applications
245(2)
7.8 Reporting in R
247(4)
7.9 Summary
251(1)
7.10 Useful R Books and Packages
252(3)
8 Final Comments 255(2)
8.1 The Future: Polishing the Crystal Ball
256(1)
A Example QC Report for GWAS Data 257(8)
References 265
Cedric Gondro is Associate Professor of computational genetics at the University of New England. He has extensive experience in analysis of livestock projects using data from various genomic platforms. His main research interests are in the development of computational methods for optimization of biological problems; statistical and functional analysis methods for high throughput genomic data (expression arrays, SNP chips, sequence data); estimation of population genetic parameters using genome-wide data; and simulation of biological systems.