|
|
xi | |
Foreword |
|
xiii | |
Introduction |
|
xvi | |
|
|
|
|
3 | (1) |
|
|
1.1 Introduction: Defining Language Variation |
|
|
3 | (2) |
|
1.2 Types of Linguistic Variables |
|
|
5 | (2) |
|
1.3 Dimensions of Variation |
|
|
7 | (8) |
|
|
15 | (2) |
|
2 Phonetic Variation in Dialects |
|
|
17 | (1) |
|
|
|
17 | (5) |
|
|
22 | (1) |
|
|
23 | (1) |
|
|
24 | (1) |
|
|
24 | (3) |
|
3 Similar Languages, Varieties, and Dialects: Status and Variation |
|
|
27 | (24) |
|
|
|
|
27 | (1) |
|
3.2 Language: More Than Communication |
|
|
28 | (1) |
|
|
28 | (4) |
|
3.4 Creoles as a Class of Natural Languages |
|
|
32 | (2) |
|
3.5 Introducing the Corpus of Spoken Bequia Creole English |
|
|
34 | (2) |
|
3.6 Transforming Spoken Word into Categorical Data |
|
|
36 | (2) |
|
3.7 Feature Interrelationship |
|
|
38 | (2) |
|
3.8 Graphical Models to Visualise Interactions |
|
|
40 | (2) |
|
3.9 Feature Relations at a Marginalised Level |
|
|
42 | (1) |
|
3.10 Feature Interrelationship within Communities |
|
|
43 | (1) |
|
3.11 Community Distinction |
|
|
44 | (1) |
|
3.12 Speaker-Specific Creole Frequency |
|
|
45 | (3) |
|
3.13 Conclusions: Principled Methods for Exploring Systematic Dialects within Languages |
|
|
48 | (3) |
|
|
51 | (45) |
|
|
|
|
51 | (7) |
|
4.2 How to Measure Intelligibility |
|
|
58 | (9) |
|
4.3 Extra-linguistic and Para-linguistic Factors Influencing Intelligibility |
|
|
67 | (3) |
|
4.4 Linguistic Determinants of Intelligibility |
|
|
70 | (13) |
|
4.5 Relationship between Intelligibility and Language Trees |
|
|
83 | (4) |
|
4.6 Conclusions, Discussion, and Desiderata for Future Research |
|
|
87 | (9) |
|
5 Dialectology for Computational Linguists |
|
|
96 | (25) |
|
|
|
|
|
|
96 | (1) |
|
|
96 | (2) |
|
|
98 | (2) |
|
5.4 Edit Distance on Phonetic Transcriptions |
|
|
100 | (4) |
|
5.5 Geography of Distributions |
|
|
104 | (4) |
|
|
108 | (3) |
|
5.7 Emerging Opportunities and Issues |
|
|
111 | (1) |
|
|
112 | (9) |
|
Part II Methods and Resources |
|
|
|
6 Data Collection and Representation for Similar Languages, Varieties and Dialects |
|
|
121 | (17) |
|
|
|
6.1 Representing Language Variability in Corpora |
|
|
122 | (2) |
|
6.2 Types of Micro-Variation and the Corresponding Data Collection Procedures |
|
|
124 | (9) |
|
6.3 Privacy and Linguistic Micro-Variation |
|
|
133 | (1) |
|
|
134 | (4) |
|
7 Adaptation of Morphosyntactic Taggers |
|
|
138 | (29) |
|
|
|
138 | (7) |
|
7.2 Model Transfer Methods |
|
|
145 | (4) |
|
7.3 Normalization and Other Data Transfer Methods |
|
|
149 | (7) |
|
7.4 Tagger Adaptation and Multilingual Models |
|
|
156 | (2) |
|
|
158 | (9) |
|
8 Sharing Dependency Parsers between Similar Languages |
|
|
167 | (20) |
|
|
|
167 | (5) |
|
8.2 A New Hope? Notable Exceptions |
|
|
172 | (6) |
|
8.3 To Conclude: A Glimpse of the Future |
|
|
178 | (9) |
|
Part III Applications and Language Specific Issues |
|
|
|
9 Dialect and Similar Language Identification |
|
|
187 | (17) |
|
|
|
187 | (2) |
|
9.2 A Supervised Text Classification Problem |
|
|
189 | (1) |
|
|
190 | (2) |
|
|
192 | (3) |
|
|
195 | (3) |
|
9.6 Conclusion and Future Perspectives |
|
|
198 | (6) |
|
10 Dialect Variation on Social Media |
|
|
204 | (15) |
|
|
|
204 | (2) |
|
10.2 Social Media for Dialect Research |
|
|
206 | (3) |
|
|
209 | (1) |
|
10.4 Patterns in Social Media |
|
|
210 | (3) |
|
|
213 | (6) |
|
11 Machine Translation between Similar Languages |
|
|
219 | (35) |
|
|
|
|
219 | (1) |
|
11.2 Models and Approaches |
|
|
219 | (4) |
|
11.3 Character-Level Machine Translation |
|
|
223 | (5) |
|
11.4 Closely Related Languages as MT Pivots |
|
|
228 | (6) |
|
|
234 | (4) |
|
|
238 | (5) |
|
|
243 | (1) |
|
11.8 Applications and Future Directions |
|
|
244 | (2) |
|
|
246 | (8) |
|
12 Automatic Spoken Dialect Identification |
|
|
254 | (25) |
|
Pedro A. Torres-Carrasquillo |
|
|
|
|
254 | (1) |
|
|
255 | (5) |
|
12.3 Resources for Dialect Identification |
|
|
260 | (5) |
|
|
265 | (4) |
|
12.5 Standarized Evaluations and Recent Performance |
|
|
269 | (3) |
|
12.6 Challenges and Future Outlook |
|
|
272 | (7) |
|
13 Arabic Dialect Processing |
|
|
279 | (25) |
|
|
|
279 | (1) |
|
13.2 Arabic and Its Variants |
|
|
279 | (3) |
|
13.3 Challenges of Arabic Dialect Processing |
|
|
282 | (3) |
|
13.4 Arabic Dialect Resources |
|
|
285 | (4) |
|
13.5 Arabic Dialect Processing Tools and Applications |
|
|
289 | (4) |
|
13.6 Conclusion and Outlook |
|
|
293 | (11) |
|
14 Computational Processing of Varieties of Chinese: Comparable Corpus-Driven Approaches to Light Verb Variation |
|
|
304 | |
|
|
|
|
|
|
|
304 | (6) |
|
14.2 Computational Approaches to Language Variations in Chinese and Other Languages |
|
|
310 | (6) |
|
14.3 Classification of Varieties of Mandarin Chinese |
|
|
316 | (3) |
|
|
319 | |