Muutke küpsiste eelistusi

E-raamat: Term Variation in Specialised Corpora: Characterisation, automatic discovery and applications

(University of Nantes - LINA)
Teised raamatud teemal:
  • Formaat - PDF+DRM
  • Hind: 102,50 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Lisa ostukorvi
  • Lisa soovinimekirja
  • See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.
Teised raamatud teemal:

DRM piirangud

  • Kopeerimine (copy/paste):

    ei ole lubatud

  • Printimine:

    ei ole lubatud

  • Kasutamine:

    Digitaalõiguste kaitse (DRM)
    Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale  Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

    Vajalik tarkvara
    Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

    PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

    Seda e-raamatut ei saa lugeda Amazon Kindle's. 

This book addresses term variation which has been a very important topic in terminology, computational terminology and natural language processing for up to twenty years. This book presents the first complete inventory of term variants and the linguistic procedures that lead to their formation. It also takes into account issues raised by multilingual applications and presents ways to detect variants in five different languages: French, English, German, Spanish and Russian.
The book provides insights into the following issues: What is a variant? What are the main linguistic mechanisms involved in the transformation of base terms into variants? How can variants be automatically detected in texts? Should variation be taken into account in natural language processing applications?
This book is targeted at terminologists and linguists interested in term variation as well as researchers in natural language processing and computer science that must handle term variants in different kinds of applications.
Acknowledgements xi
Chapter 1 Introduction 1(10)
1.1 Preliminary example
3(1)
1.2 Variants and terminological analysis
4(2)
1.3 The automatic detection of variants
6(1)
1.4 Variants and applications
7(1)
1.5 Typographical conventions
7(4)
Part I. Characterisation
Chapter 2 Definitions
11(22)
2.1 Term
11(3)
2.2 Derivation
14(1)
2.3 Compounding
15(6)
2.3.1 Morphological compounds
16(2)
2.3.2 Border between derivation and compounding
18(1)
2.3.3 Syntagmatic compounds
19(1)
2.3.4 Border between morphological and syntagmatic compounds
20(1)
2.4 Borrowing
21(1)
2.5 Term patterns
22(7)
2.5.1 Simple term patterns
23(1)
2.5.2 Morphological compound patterns
23(1)
2.5.3 Syntagmatic compound patterns
24(3)
2.5.4 Frequency of term patterns
27(2)
2.6 Term variants
29(2)
2.6.1 The definition of variant
29(1)
2.6.2 Denominative variants
30(1)
2.6.3 Conceptual variants
30(1)
2.7 Border between terms and variants
31(2)
Chapter 3 Conceptualisation of terminological variants
33(34)
3.1 Description of variants
34(3)
3.1.1 Organisation of variants
34(2)
3.1.2 Mechanisms and linguistic operations
36(1)
3.1.3 Properties of variants
37(1)
3.2 Denominative variants
37(9)
3.2.1 Synonymic substitution
38(2)
3.2.2 Simplification
40(3)
3.2.3 Exemplification
43(2)
3.2.4 Competing patterns
45(1)
3.3 Conceptual variants
46(6)
3.3.1 Expansion
46(5)
3.3.2 Anaphorical reduction
51(1)
3.4 Linguistic variants
52(10)
3.4.1 Graphics and spelling
52(3)
3.4.2 Inflection
55(1)
3.4.3 Derivation
56(1)
3.4.4 Fullback-compounding
56(1)
3.4.5 Modification
57(1)
3.4.6 Coordination, disjunction and enumeration
58(4)
3.5 Variants of register
62(1)
3.5.1 Variation of scientification/popularisation
62(1)
3.5.2 Variants of position
63(1)
3.6 Borders between categories of variants
63(4)
3.6.1 Denominative and linguistic variants
64(1)
3.6.2 Denominative and conceptual variants
64(1)
3.6.3 Conceptual and linguistic variants
64(3)
Chapter 4 Semantics of conceptual variants
67(16)
4.1 Structuring terms
67(6)
4.1.1 Conceptual and semantic relations
68(1)
4.1.2 Classic semantic relations
68(2)
4.1.3 Collocation
70(1)
4.1.4 Lexical functions
70(3)
4.2 Fundamental relations between term and variant
73(2)
4.2.1 Synonymy
73(1)
4.2.2 Hierarchical relations
74(1)
4.3 Complex relations between term and variant
75(4)
4.3.1 Result
76(1)
4.3.2 Plurality
77(1)
4.3.3 Spatiality
77(1)
4.3.4 Temporality
78(1)
4.3.5 Quality
78(1)
4.4 Other relations between term and variant
79(4)
4.4.1 Predication
79(1)
4.4.2 Instance
79(4)
Part II. Automatic discovery
Chapter 5 Primitive exploration of variants using comparable corpora
83(26)
5.1 Comparable corpora
83(4)
5.1.1 Corpus
83(1)
5.1.2 Properties
84(2)
5.1.3 Collecting comparable corpora
86(1)
5.1.4 Comparability
87(1)
5.2 Comparable corpora used in this study
87(2)
5.3 Looking for variants
89(15)
5.3.1 Implementation
89(1)
5.3.2 N-gram massive data
90(1)
5.3.3 Unigrams
91(8)
5.3.4 Skip-grams
99(2)
5.3.5 Categories of variants facing data
101(3)
5.4 Comparison according to communication levels
104(5)
5.4.1 Unigrams
104(1)
5.4.2 Skip-grams
105(4)
Chapter 6 Processing methods for the detection of variants from corpora
109(16)
6.1 Linguistic-based methods
110(4)
6.1.1 Morphological analysis
110(1)
6.1.2 Syntagmatic and paradigmatic analysis
111(1)
6.1.3 Syntactic analysis
112(1)
6.1.4 Distributional analysis
113(1)
6.2 Algorithms on strings
114(6)
6.2.1 Distance computed from common substrings
115(2)
6.2.2 Edit distances
117(3)
6.3 Statistical methods
120(1)
6.4 Typology of variant occurrences
121(3)
6.4.1 Isolated variant occurrences
121(1)
6.4.2 Inter-mixed term and variant occurrences
122(1)
6.4.3 Separated term and variant occurrences
123(1)
6.5 Relationship between processing methods and types of occurrences
124(1)
Chapter 7 Grammar of variants
125(22)
7.1 Specifications and properties
125(7)
7.1.1 Expressivity of the syntagmatic rules
126(3)
7.1.2 Core operations
129(1)
7.1.3 Ambiguity of the syntactic analysis
130(2)
7.2 Generic grammar of recognition of variants
132(4)
7.2.1 Competing structures
133(1)
7.2.2 Augmented/reduced structures
134(1)
7.2.3 Contextual structures
135(1)
7.2.4 Function words
135(1)
7.2.5 Ad-hoc rules
135(1)
7.3 Variant grammars for specific languages
136(2)
7.4 Cross-lingual observations
138(8)
7.4.1 Coverage
138(2)
7.4.2 Precision
140(6)
7.5 Summary of observations
146(1)
Chapter 8 Synonymic variants
147(24)
8.1 Distributional analysis
147(5)
8.1.1 Modelling of the distributional methods
148(3)
8.1.2 Observations in specialised domains
151(1)
8.2 Compositional method
152(1)
8.3 Semi-compositional method
153(1)
8.4 Cross-lingual and cross-method observations
154(11)
8.4.1 Reference lists of synonyms
155(2)
8.4.2 Experimental setup parameters
157(1)
8.4.3 Evaluation measures
158(1)
8.4.4 Results
159(6)
8.5 Towards the detection of antonymic variants
165(6)
Part III. Applications and tools
Chapter 9 Terminology extraction
171(34)
9.1 The core of terminology extraction
172(1)
9.2 Collecting candidate terms
173(4)
9.2.1 Patterns
173(1)
9.2.2 Generic rules
173(3)
9.2.3 Borders
176(1)
9.2.4 Lexical expansion
176(1)
9.3 Filtering and sorting candidate terms
177(9)
9.3.1 Frequency
178(1)
9.3.2 Association measures
178(1)
9.3.3 Specificity measures
179(3)
9.3.4 Filtering by removing nested terms
182(1)
9.3.5 Contextual filtering
183(2)
9.3.6 Supervised learning methods
185(1)
9.4 Evaluation
186(4)
9.4.1 References
187(2)
9.4.2 Measures
189(1)
9.5 Comparing term extraction without and with variant recognition
190(1)
9.6 Experimental setting
190(11)
9.6.1 Corpora
190(2)
9.6.2 Our integrated terminology extraction
192(2)
9.6.3 Comparison protocol
194(1)
9.6.4 Maximum recall
194(1)
9.6.5 Observations with a posteriori RTL
195(4)
9.6.6 Observations with a priori RTL
199(2)
9.7 Summary of observations
201(4)
Chapter 10 End-user applications and tools
205(20)
10.1 Machine-aided indexing and FASTR
205(1)
10.2 Thematic cartography and TermWatch
206(2)
10.3 TermSuite
208(17)
10.3.1 Architecture
208(1)
10.3.2 Token Regex
208(4)
10.3.3 Compost
212(4)
10.3.4 Variant grouping
216(3)
10.3.5 Ranking by termhood
219(1)
10.3.6 Performance
220(1)
10.3.7 Release
221(4)
Part IV. Conclusions
Chapter 11 Term variants and their discovery
225(22)
11.1 Summary of the present study
225(4)
11.1.1 A unified typology of term variants
225(1)
11.1.2 A variety of methods for the discovery of variants
226(2)
11.1.3 A terminology-resource building application
228(1)
11.2 Remaining issues and direction for further research
229(2)
11.2.1 Semantic analysis of variations
229(1)
11.2.2 Distributional analysis at the morpheme level
230(1)
11.2.3 Recognition of other variants
231(1)
11.3 Implications for related studies
231(4)
11.3.1 Variants and paraphrases
231(1)
11.3.2 Variants and translation
232(3)
Bibliography 235(12)
Appendix A: Notation 247(2)
A.1 Examples
247(1)
A.2 Specialised domains
247(1)
A.3 Specialised corpora
247(2)
Appendix B: Multext categories 249(2)
Appendix C: Search with AntConc 251(6)
C.1 Parameters
251(1)
C.2 Collection of n-grams
251(2)
C.3 Results of n-grams
253(4)
Appendix D: GGRV 257(14)
D.1 French
257(3)
D.2 English
260(3)
D.3 Spanish
263(3)
D.4 German
266(2)
D.5 Russian
268(3)
Index 271