Muutke küpsiste eelistusi

Probability and Statistics for Data Science [Kõva köide]

(New York University)
  • Formaat: Hardback, 624 pages, kaal: 1426 g, Worked examples or Exercises
  • Ilmumisaeg: 03-Jul-2025
  • Kirjastus: Cambridge University Press
  • ISBN-10: 1009180088
  • ISBN-13: 9781009180085
Teised raamatud teemal:
  • Formaat: Hardback, 624 pages, kaal: 1426 g, Worked examples or Exercises
  • Ilmumisaeg: 03-Jul-2025
  • Kirjastus: Cambridge University Press
  • ISBN-10: 1009180088
  • ISBN-13: 9781009180085
Teised raamatud teemal:
This accessible book for graduate students and data scientists provides a solid background in probabilistic and statistical concepts relevant to data science. Emphasis is placed on practice, with examples throughout using real-world data that readers can implement from Python code available on the book's website.

This self-contained guide introduces two pillars of data science, probability theory and statistics, side by side, illuminating the connections between probabilistic concepts and the statistical techniques they employ, such as the relationship between nonparametric and parametric models and random variables. Other topics covered include hypothesis testing, principal component analysis, correlation, and regression. Examples throughout the book draw from real-world datasets, quickly demonstrating concepts in practice and confronting readers with fundamental challenges in data science, such as overfitting, the curse of dimensionality, and causal inference. Code in Python reproducing these examples is available on the book's website, along with videos, slides, and solutions to exercises. This accessible book is ideal for undergraduate and graduate students, data science practitioners, and others interested in the theoretical concepts underlying data science methods.

Arvustused

'Fernandez-Granda's Probability and Statistics for Data Science is a comprehensive yet approachable treatment of the fundamentals required of all aspiring Data Scientists-whether they be in academia, industry or elsewhere. The language is clear and precise, and it is one of the best-organized treatments of this material I have ever seen. With lucid examples and helpful exercises, it deserves to be the leading text for these topics among undergraduate and graduate students in this technical, fast-moving discipline. Instructors take note!' Arthur Spirling, Princeton University 'If you're mathematically inclined and want to master the foundations of data science in one go, this book is for you. It covers a broad range of essential modern topics - including nonparametric methods, causal inference, latent variable models, Bayesian approaches, and a thorough introduction to machine learning - all illustrated with an abundance of figures and real-world data examples. Highly recommended.' David Rosenberg, Office of the CTO, Bloomberg

Muu info

A self-contained introduction to probability and statistics for data science with examples involving real-world datasets.
Preface; Book Website; Introduction and Overview;
1. Probability;
2.
Discrete variables;
3. Continuous variables;
4. Multiple discrete variables;
5. Multiple continuous variables;
6. Discrete and continuous variables;
7.
Averaging;
8. Correlation;
9. Estimation of population parameters;
10.
Hypothesis testing;
11. Principal component analysis and low-rank models;
12.
Regression and classification; A. Datasets; References; Index.
Carlos Fernandez-Granda is Associate Professor of Mathematics and Data Science at New York University, where he has taught probability and statistics to data science students since 2015. The goal of his research is to design and analyze data science methodology, with a focus on machine learning, artificial intelligence, and their application to medicine, climate science, biology, and other scientific domains.