Klienditugi: 7440010 (E-R 10-18)

Abi | Registreeri | Logi sisse

E-raamat: Humanities Data Analysis: Case Studies with Python

4.25/5 (8 hinnangut Goodreads-ist)

Allen Riddell, Mike Kestemont, Folgert Karsdorp

Formaat: 360 pages
Ilmumisaeg: 12-Jan-2021
Kirjastus: Princeton University Press
Keel: eng
ISBN-13: 9780691200330

Teised raamatud teemal:

Formaat - PDF+DRM
Hind: 49,73 €*
* hind on lõplik, st. muud allahindlused enam ei rakendu
Lisa ostukorvi
Lisa soovinimekirja
See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.

Formaat: 360 pages
Ilmumisaeg: 12-Jan-2021
Kirjastus: Princeton University Press
Keel: eng
ISBN-13: 9780691200330

Teised raamatud teemal:

DRM piirangud

Kopeerimine (copy/paste):

ei ole lubatud
Printimine:

ei ole lubatud
Kasutamine:

Digitaalõiguste kaitse (DRM)
Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

Vajalik tarkvara
Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

Seda e-raamatut ei saa lugeda Amazon Kindle's.

"The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment. The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter. An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions. Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python. Applicable to many humanities disciplines, including history, literature, and sociology. Offers real-world case studies using publicly available data sets. Provides exercises at the end of each chapter for students to test acquired skills. Emphasizes visual storytelling via data visualizations"--

A practical guide to data-intensive humanities research using the Python programming language

The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment.

The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter.

An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions.

Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python
Applicable to many humanities disciplines, including history, literature, and sociology
Offers real-world case studies using publicly available data sets
Provides exercises at the end of each chapter for students to test acquired skills
Emphasizes visual storytelling via data visualizations

Preface

I Data Analysis Essentials

(164)

Chapter 1 Introduction

(29)

1.1 Quantitative Data Analysis and the Humanities

(2)

1.2 Overview of the Book

(1)

1.3 Related Books

(1)

1.4 How to Use This Book

(6)

1.4.1 What you should know

(4)

1.4.2 Packages and data

(1)

1.4.3 Exercises

(1)

1.5 An Exploratory Data Analysis of the United States' Culinary History

(1)

1.6 Cooking with Tabular Data

(4)

1.7 Taste Trends in Culinary US History

(8)

1.8 America's Culinary Melting Pot

(4)

1.9 Further Reading

(2)

Chapter 2 Parsing and Manipulating Structured Data

(46)

2.1 Introduction

(1)

2.2 Plain Text

(3)

2.3 CSV

(4)

2.4 PDF

(3)

2.5 JSON

(3)

2.6 XML

(11)

2.6.1 Parsing XML

(3)

2.6.2 Creating XML

(5)

2.6.3 TEI

(1)

2.7 HTML

(8)

2.7.1 Retrieving HTML from the web

(1)

2.8 Extracting Character Interaction Networks

(9)

2.9 Conclusion and Further Reading

(4)

Chapter 3 Exploring Texts Using the Vector Space Model

(48)

3.1 Introduction

(1)

3.2 From Texts to Vectors

(11)

3.2.1 Text preprocessing

(9)

3.3 Mapping Genres

(21)

3.3.1 Computing distances between documents

(10)

3.3.2 Nearest neighbors

107

(4)

3.4 Further Reading

111

(2)

3.5 Appendix: Vectorizing Texts with NumPy

113

(13)

3.5.1 Constructing arrays

113

(4)

3.5.2 Indexing and slicing arrays

117

(3)

3.5.3 Aggregating functions

120

(2)

3.5.4 Array broadcasting

122

(4)

Chapter 4 Processing Tabular Data

126

(39)

4.1 Loading, Inspecting, and Summarizing Tabular Data

127

(9)

4.1.1 Reading tabular data with Pandas

130

(6)

4.2 Mapping Cultural Change

136

(13)

4.2.1 Turnover in naming practices

136

(10)

4.2.2 Visualizing turnovers

146

(3)

4.3 Changing Naming Practices

149

(13)

4.3.1 Increasing name diversity

150

(3)

4.3.2 A bias for names ending in w?

153

(5)

4.3.3 Unisex names in the United States

158

(4)

4.4 Conclusions and Further Reading

162

(3)

II Advanced Data Analysis

165

(158)

Chapter 5 Statistics Essentials: Who Reads Novels?

169

(32)

5.1 Introduction

169

(1)

5.2 Statistics

170

(1)

5.3 Summarizing Location and Dispersion

171

(4)

5.3.1 Data: Novel reading in the United States

171

(4)

5.4 Location

175

(4)

5.5 Dispersion

179

(9)

5.5.1 Variation in categorical values

184

(4)

5.6 Measuring Association

188

(9)

5.6.1 Measuring association between numbers

188

(4)

5.6.2 Measuring association between categories

192

(3)

5.6.3 Mutual information

195

(2)

5.7 Conclusion

197

(1)

5.8 Further Reading

198

(3)

Chapter 6 Introduction to Probability

201

(28)

6.1 Uncertainty and Thomas Pynchon

202

(1)

6.2 Probability

203

(5)

6.2.1 Probability and degree of belief

205

(3)

6.3 Example: Bayes's Rule and Authorship Attribution

208

(17)

6.3.1 Random variables and probability distributions

213

(12)

6.4 Further Reading

225

(2)

6.5 Appendix

227

(2)

6.5.1 Bayes's rule

227

(1)

6.5.2 Fitting a negative binomial distribution

228

(1)

Chapter 7 Narrating with Maps

229

(19)

7.1 Introduction

229

(1)

7.2 Data Preparations

230

(3)

7.3 Projections and Basemaps

233

(3)

7.4 Plotting Battles

236

(2)

7.5 Mapping the Development of the War

238

(6)

7.6 Further Reading

244

(4)

Chapter 8 Stylometry and the Voice of Hildegard

248

(37)

8.1 Introduction

248

(2)

8.2 Authorship Attribution

250

(12)

8.2.1 Burrows's Delta

252

(2)

8.2.2 Function words

254

(3)

8.2.3 Computing document distances with Delta

257

(3)

8.2.4 Authorship attribution evaluation

260

(2)

8.3 Hierarchical Agglomerative Clustering

262

(4)

8.4 Principal Component Analysis

266

(14)

8.4.1 Applying PCA

268

(3)

8.4.2 The intuition behind PCA

271

(3)

8.4.3 Loadings

274

(6)

8.5 Conclusions

280

(1)

8.6 Further Reading

280

(5)

Chapter 9 A Topic Model of United States Supreme Court Opinions, 1900-2000

285

(38)

9.1 Introduction

285

(2)

9.2 Mixture Models: Artwork Dimensions in the Tate Galleries

287

(7)

9.3 Mixed-Membership Model of Texts

294

(23)

9.3.1 Parameter estimation

300

(4)

9.3.2 Checking an unsupervised model

304

(5)

9.3.3 Modeling different word senses

309

(4)

9.3.4 Exploring trends over time in the Supreme Court

313

(4)

9.4 Conclusion

317

(1)

9.5 Further Reading

318

(2)

9.6 Appendix: Mapping Between Our Topic Model and Lauderdale and Clark (2014)

320

(3)

Epilogue: Good Enough Practices

323

(2)

Bibliography

325

(8)

Index

333

Folgert Karsdorp is a postdoctoral researcher at the Meertens Institute of the Royal Netherlands Academy of Arts and Sciences. Mike Kestemont is assistant professor of literature at the University of Antwerp. Twitter @Mike_Kestemont Allen Riddell is assistant professor of information science at Indiana University.

Lisainfo e-raamatute kohta

Püsilink: https://www.kriso.ee/db/97806912003302e.html

Märksõnad:

E-raamat: Humanities Data Analysis: Case Studies with Python

DRM piirangud

Kopeerimine (copy/paste):

Printimine:

Kasutamine:

Konto & seaded

Otsing

Otsingu andmebaas

Filtreeri tulemusi

Teemad E-raamatute teemad

Vali ostukorv