Preface |
|
xii | |
Acknowledgments |
|
xv | |
About the Authors |
|
xvi | |
Part I Digital Texts, Digital Social Science |
|
1 | (32) |
|
1 Social Science and the Digital Text Revolution |
|
|
2 | (14) |
|
|
3 | (2) |
|
Risks and Rewards of Text Mining for the Social Sciences |
|
|
5 | (1) |
|
Social Data From Digital Environments |
|
|
6 | (4) |
|
|
10 | (2) |
|
|
12 | (1) |
|
Participant Consent, Privacy, and Anonymity |
|
|
12 | (1) |
|
Prompted and Unprompted Data |
|
|
13 | (1) |
|
Organization of This Volume |
|
|
13 | (3) |
|
2 Research Design Strategies |
|
|
16 | (17) |
|
|
18 | (1) |
|
|
18 | (1) |
|
|
18 | (1) |
|
|
18 | (1) |
|
Strategies for Document Selection and Sampling |
|
|
19 | (3) |
|
|
19 | (1) |
|
|
20 | (2) |
|
Types of Inferential Logic |
|
|
22 | (5) |
|
|
23 | (1) |
|
|
24 | (1) |
|
|
25 | (2) |
|
Approaches to Research Design |
|
|
27 | (7) |
|
Analysis of Discourse Positions |
|
|
27 | (1) |
|
|
28 | (1) |
|
Critical Discourse Analysis |
|
|
28 | (1) |
|
|
29 | (1) |
|
Foucauldian Intertextuality |
|
|
30 | (1) |
|
Analysis of Texts as Social Information |
|
|
31 | (2) |
Part II Text Mining Fundamentals |
|
33 | (40) |
|
3 Web Crawling and Scraping |
|
|
34 | (8) |
|
|
36 | (1) |
|
|
37 | (2) |
|
Process Steps in Crawling |
|
|
37 | (1) |
|
|
38 | (1) |
|
|
38 | (1) |
|
|
39 | (2) |
|
Software for Web Crawling and Scraping |
|
|
41 | (1) |
|
|
42 | (10) |
|
|
43 | (3) |
|
|
45 | (1) |
|
|
46 | (1) |
|
Linguistic Inquiry and Word Count |
|
|
46 | (2) |
|
|
48 | (1) |
|
|
48 | (3) |
|
|
51 | (1) |
|
Downloadable Lexical Resources and Application Program Interfaces |
|
|
51 | (1) |
|
|
52 | (10) |
|
|
54 | (1) |
|
|
55 | (1) |
|
Stemming and Lemmatization |
|
|
55 | (1) |
|
|
56 | (3) |
|
|
59 | (1) |
|
|
60 | (1) |
|
|
60 | (1) |
|
Collocation Identification |
|
|
60 | (1) |
|
|
61 | (1) |
|
|
61 | (1) |
|
Word Sense Disambiguation |
|
|
61 | (1) |
|
Software for Text Processing |
|
|
61 | (1) |
|
|
62 | (11) |
|
Feature Representation and Weighting |
|
|
65 | (1) |
|
|
65 | (1) |
|
Supervised Learning Algorithms |
|
|
66 | (5) |
|
|
67 | (1) |
|
|
68 | (1) |
|
|
69 | (2) |
|
Evaluation of Supervised Learning |
|
|
71 | (1) |
|
Software for Supervised Learning |
|
|
71 | (2) |
Part III Text Analysis Methods From The Humanities And Social Sciences |
|
73 | (32) |
|
7 Thematic Analysis, Qualitative Data Analysis Software, and Visualization |
|
|
74 | (14) |
|
|
75 | (2) |
|
Qualitative Data Analysis Software |
|
|
77 | (6) |
|
|
83 | (3) |
|
|
84 | (1) |
|
Word Trees and Phrase Nets |
|
|
84 | (1) |
|
|
85 | (1) |
|
|
86 | (1) |
|
Software for Thematic Analysis, Qualitative Data Analysis, and Visualization |
|
|
86 | (2) |
|
|
88 | (8) |
|
|
90 | (2) |
|
Structural Approaches to Narrative |
|
|
90 | (1) |
|
Functionalist Approaches to Narrative |
|
|
91 | (1) |
|
Sociological Approaches to Narrative |
|
|
92 | (1) |
|
Mixed Methods of Narrative Analysis |
|
|
92 | (1) |
|
Automated Methods of Narrative Analysis |
|
|
93 | (1) |
|
|
93 | (1) |
|
Software for Narrative Analysis |
|
|
94 | (2) |
|
|
96 | (9) |
|
|
98 | (1) |
|
Qualitative Metaphor Analysis |
|
|
99 | (2) |
|
|
99 | (1) |
|
|
99 | (1) |
|
|
100 | (1) |
|
|
100 | (1) |
|
|
101 | (1) |
|
Mixed Methods of Metaphor Analysis |
|
|
101 | (2) |
|
|
101 | (1) |
|
|
102 | (1) |
|
|
102 | (1) |
|
Automated Metaphor Identification Methods |
|
|
103 | (1) |
|
Software for Metaphor Analysis |
|
|
103 | (2) |
Part IV Text Mining Methods From Computer Science |
|
105 | (58) |
|
10 Word and Text Relatedness |
|
|
106 | (10) |
|
|
107 | (1) |
|
Corpus-Based and Knowledge-Based Measures of Relatedness |
|
|
108 | (6) |
|
Corpus-Based Measures of Word Relatedness |
|
|
108 | (2) |
|
Knowledge-Based Measures of Word Relatedness |
|
|
110 | (2) |
|
Measures of Text Relatedness |
|
|
112 | (2) |
|
Software and Data Sets for Word and Text Relatedness |
|
|
114 | (2) |
|
|
116 | (14) |
|
A Brief History of Text Classification |
|
|
118 | (1) |
|
Applications of Text Classification |
|
|
119 | (3) |
|
|
119 | (1) |
|
|
120 | (1) |
|
Sentiment Analysis/Opinion Mining |
|
|
120 | (1) |
|
|
120 | (2) |
|
|
122 | (1) |
|
|
122 | (1) |
|
Representing Texts for Supervised Text Classification |
|
|
122 | (2) |
|
Feature Weighting and Selection |
|
|
123 | (1) |
|
Text Classification Algorithms |
|
|
124 | (2) |
|
|
124 | (1) |
|
|
125 | (1) |
|
Bootstrapping in Text Classification |
|
|
126 | (1) |
|
Evaluation of Text Classification |
|
|
127 | (1) |
|
Software and Data Sets for Text Classification |
|
|
127 | (3) |
|
12 Information Extraction |
|
|
130 | (6) |
|
|
132 | (1) |
|
|
133 | (1) |
|
Web Information Extraction |
|
|
134 | (1) |
|
|
135 | (1) |
|
Software and Data Sets for Information Extraction and Text Mining |
|
|
135 | (1) |
|
|
136 | (12) |
|
|
138 | (1) |
|
Components of an Information Retrieval System |
|
|
138 | (2) |
|
Information Retrieval Models |
|
|
140 | (2) |
|
|
142 | (2) |
|
Evaluation of Information Retrieval Models |
|
|
144 | (1) |
|
Web-Based Information Retrieval |
|
|
145 | (2) |
|
Software and Data Sets for Information Retrieval |
|
|
147 | (1) |
|
|
148 | (8) |
|
|
150 | (1) |
|
|
151 | (1) |
|
|
152 | (1) |
|
|
153 | (1) |
|
Software and Data Sets for Sentiment Analysis |
|
|
154 | (2) |
|
|
156 | (7) |
|
|
160 | (1) |
|
|
160 | (1) |
|
|
161 | (1) |
|
Software for Topic Modeling |
|
|
161 | (2) |
Part V Conclusions |
|
163 | (5) |
|
16 Text Mining, Text Analysis, and the Future of Social Science |
|
|
164 | (4) |
|
Social and Computer Science Collaboration |
|
|
166 | (2) |
References |
|
168 | (15) |
Index |
|
183 | |