Klienditugi: 7440010 (E-R 10-18)

Abi | Registreeri | Logi sisse

E-raamat: Term Variation in Specialised Corpora: Characterisation, automatic discovery and applications

Beatrice Daille (University of Nantes - LINA)

Formaat: 284 pages
Sari: Terminology and Lexicography Research and Practice 19
Ilmumisaeg: 07-Aug-2017
Kirjastus: John Benjamins Publishing Co
Keel: eng
ISBN-13: 9789027265357

Teised raamatud teemal:

Lexicography

Formaat - PDF+DRM
Hind: 102,50 €*
* hind on lõplik, st. muud allahindlused enam ei rakendu
Lisa ostukorvi
Lisa soovinimekirja
See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.

Formaat: 284 pages
Sari: Terminology and Lexicography Research and Practice 19
Ilmumisaeg: 07-Aug-2017
Kirjastus: John Benjamins Publishing Co
Keel: eng
ISBN-13: 9789027265357

Teised raamatud teemal:

Lexicography

DRM piirangud

Kopeerimine (copy/paste):

ei ole lubatud
Printimine:

ei ole lubatud
Kasutamine:

Digitaalõiguste kaitse (DRM)
Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

Vajalik tarkvara
Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

Seda e-raamatut ei saa lugeda Amazon Kindle's.

This book addresses term variation which has been a very important topic in terminology, computational terminology and natural language processing for up to twenty years. This book presents the first complete inventory of term variants and the linguistic procedures that lead to their formation. It also takes into account issues raised by multilingual applications and presents ways to detect variants in five different languages: French, English, German, Spanish and Russian.
The book provides insights into the following issues: What is a variant? What are the main linguistic mechanisms involved in the transformation of base terms into variants? How can variants be automatically detected in texts? Should variation be taken into account in natural language processing applications?
This book is targeted at terminologists and linguists interested in term variation as well as researchers in natural language processing and computer science that must handle term variants in different kinds of applications.

Acknowledgements

Chapter 1 Introduction

(10)

1.1 Preliminary example

(1)

1.2 Variants and terminological analysis

(2)

1.3 The automatic detection of variants

(1)

1.4 Variants and applications

(1)

1.5 Typographical conventions

(4)

Part I. Characterisation

Chapter 2 Definitions

(22)

2.1 Term

(3)

2.2 Derivation

(1)

2.3 Compounding

(6)

2.3.1 Morphological compounds

(2)

2.3.2 Border between derivation and compounding

(1)

2.3.3 Syntagmatic compounds

(1)

2.3.4 Border between morphological and syntagmatic compounds

(1)

2.4 Borrowing

(1)

2.5 Term patterns

(7)

2.5.1 Simple term patterns

(1)

2.5.2 Morphological compound patterns

(1)

2.5.3 Syntagmatic compound patterns

(3)

2.5.4 Frequency of term patterns

(2)

2.6 Term variants

(2)

2.6.1 The definition of variant

(1)

2.6.2 Denominative variants

(1)

2.6.3 Conceptual variants

(1)

2.7 Border between terms and variants

(2)

Chapter 3 Conceptualisation of terminological variants

(34)

3.1 Description of variants

(3)

3.1.1 Organisation of variants

(2)

3.1.2 Mechanisms and linguistic operations

(1)

3.1.3 Properties of variants

(1)

3.2 Denominative variants

(9)

3.2.1 Synonymic substitution

(2)

3.2.2 Simplification

(3)

3.2.3 Exemplification

(2)

3.2.4 Competing patterns

(1)

3.3 Conceptual variants

(6)

3.3.1 Expansion

(5)

3.3.2 Anaphorical reduction

(1)

3.4 Linguistic variants

(10)

3.4.1 Graphics and spelling

(3)

3.4.2 Inflection

(1)

3.4.3 Derivation

(1)

3.4.4 Fullback-compounding

(1)

3.4.5 Modification

(1)

3.4.6 Coordination, disjunction and enumeration

(4)

3.5 Variants of register

(1)

3.5.1 Variation of scientification/popularisation

(1)

3.5.2 Variants of position

(1)

3.6 Borders between categories of variants

(4)

3.6.1 Denominative and linguistic variants

(1)

3.6.2 Denominative and conceptual variants

(1)

3.6.3 Conceptual and linguistic variants

(3)

Chapter 4 Semantics of conceptual variants

(16)

4.1 Structuring terms

(6)

4.1.1 Conceptual and semantic relations

(1)

4.1.2 Classic semantic relations

(2)

4.1.3 Collocation

(1)

4.1.4 Lexical functions

(3)

4.2 Fundamental relations between term and variant

(2)

4.2.1 Synonymy

(1)

4.2.2 Hierarchical relations

(1)

4.3 Complex relations between term and variant

(4)

4.3.1 Result

(1)

4.3.2 Plurality

(1)

4.3.3 Spatiality

(1)

4.3.4 Temporality

(1)

4.3.5 Quality

(1)

4.4 Other relations between term and variant

(4)

4.4.1 Predication

(1)

4.4.2 Instance

(4)

Part II. Automatic discovery

Chapter 5 Primitive exploration of variants using comparable corpora

(26)

5.1 Comparable corpora

(4)

5.1.1 Corpus

(1)

5.1.2 Properties

(2)

5.1.3 Collecting comparable corpora

(1)

5.1.4 Comparability

(1)

5.2 Comparable corpora used in this study

(2)

5.3 Looking for variants

(15)

5.3.1 Implementation

(1)

5.3.2 N-gram massive data

(1)

5.3.3 Unigrams

(8)

5.3.4 Skip-grams

(2)

5.3.5 Categories of variants facing data

101

(3)

5.4 Comparison according to communication levels

104

(5)

5.4.1 Unigrams

104

(1)

5.4.2 Skip-grams

105

(4)

Chapter 6 Processing methods for the detection of variants from corpora

109

(16)

6.1 Linguistic-based methods

110

(4)

6.1.1 Morphological analysis

110

(1)

6.1.2 Syntagmatic and paradigmatic analysis

111

(1)

6.1.3 Syntactic analysis

112

(1)

6.1.4 Distributional analysis

113

(1)

6.2 Algorithms on strings

114

(6)

6.2.1 Distance computed from common substrings

115

(2)

6.2.2 Edit distances

117

(3)

6.3 Statistical methods

120

(1)

6.4 Typology of variant occurrences

121

(3)

6.4.1 Isolated variant occurrences

121

(1)

6.4.2 Inter-mixed term and variant occurrences

122

(1)

6.4.3 Separated term and variant occurrences

123

(1)

6.5 Relationship between processing methods and types of occurrences

124

(1)

Chapter 7 Grammar of variants

125

(22)

7.1 Specifications and properties

125

(7)

7.1.1 Expressivity of the syntagmatic rules

126

(3)

7.1.2 Core operations

129

(1)

7.1.3 Ambiguity of the syntactic analysis

130

(2)

7.2 Generic grammar of recognition of variants

132

(4)

7.2.1 Competing structures

133

(1)

7.2.2 Augmented/reduced structures

134

(1)

7.2.3 Contextual structures

135

(1)

7.2.4 Function words

135

(1)

7.2.5 Ad-hoc rules

135

(1)

7.3 Variant grammars for specific languages

136

(2)

7.4 Cross-lingual observations

138

(8)

7.4.1 Coverage

138

(2)

7.4.2 Precision

140

(6)

7.5 Summary of observations

146

(1)

Chapter 8 Synonymic variants

147

(24)

8.1 Distributional analysis

147

(5)

8.1.1 Modelling of the distributional methods

148

(3)

8.1.2 Observations in specialised domains

151

(1)

8.2 Compositional method

152

(1)

8.3 Semi-compositional method

153

(1)

8.4 Cross-lingual and cross-method observations

154

(11)

8.4.1 Reference lists of synonyms

155

(2)

8.4.2 Experimental setup parameters

157

(1)

8.4.3 Evaluation measures

158

(1)

8.4.4 Results

159

(6)

8.5 Towards the detection of antonymic variants

165

(6)

Part III. Applications and tools

Chapter 9 Terminology extraction

171

(34)

9.1 The core of terminology extraction

172

(1)

9.2 Collecting candidate terms

173

(4)

9.2.1 Patterns

173

(1)

9.2.2 Generic rules

173

(3)

9.2.3 Borders

176

(1)

9.2.4 Lexical expansion

176

(1)

9.3 Filtering and sorting candidate terms

177

(9)

9.3.1 Frequency

178

(1)

9.3.2 Association measures

178

(1)

9.3.3 Specificity measures

179

(3)

9.3.4 Filtering by removing nested terms

182

(1)

9.3.5 Contextual filtering

183

(2)

9.3.6 Supervised learning methods

185

(1)

9.4 Evaluation

186

(4)

9.4.1 References

187

(2)

9.4.2 Measures

189

(1)

9.5 Comparing term extraction without and with variant recognition

190

(1)

9.6 Experimental setting

190

(11)

9.6.1 Corpora

190

(2)

9.6.2 Our integrated terminology extraction

192

(2)

9.6.3 Comparison protocol

194

(1)

9.6.4 Maximum recall

194

(1)

9.6.5 Observations with a posteriori RTL

195

(4)

9.6.6 Observations with a priori RTL

199

(2)

9.7 Summary of observations

201

(4)

Chapter 10 End-user applications and tools

205

(20)

10.1 Machine-aided indexing and FASTR

205

(1)

10.2 Thematic cartography and TermWatch

206

(2)

10.3 TermSuite

208

(17)

10.3.1 Architecture

208

(1)

10.3.2 Token Regex

208

(4)

10.3.3 Compost

212

(4)

10.3.4 Variant grouping

216

(3)

10.3.5 Ranking by termhood

219

(1)

10.3.6 Performance

220

(1)

10.3.7 Release

221

(4)

Part IV. Conclusions

Chapter 11 Term variants and their discovery

225

(22)

11.1 Summary of the present study

225

(4)

11.1.1 A unified typology of term variants

225

(1)

11.1.2 A variety of methods for the discovery of variants

226

(2)

11.1.3 A terminology-resource building application

228

(1)

11.2 Remaining issues and direction for further research

229

(2)

11.2.1 Semantic analysis of variations

229

(1)

11.2.2 Distributional analysis at the morpheme level

230

(1)

11.2.3 Recognition of other variants

231

(1)

11.3 Implications for related studies

231

(4)

11.3.1 Variants and paraphrases

231

(1)

11.3.2 Variants and translation

232

(3)

Bibliography

235

(12)

Appendix A: Notation

247

(2)

A.1 Examples

247

(1)

A.2 Specialised domains

247

(1)

A.3 Specialised corpora

247

(2)

Appendix B: Multext categories

249

(2)

Appendix C: Search with AntConc

251

(6)

C.1 Parameters

251

(1)

C.2 Collection of n-grams

251

(2)

C.3 Results of n-grams

253

(4)

Appendix D: GGRV

257

(14)

D.1 French

257

(3)

D.2 English

260

(3)

D.3 Spanish

263

(3)

D.4 German

266

(2)

D.5 Russian

268

(3)

Index

271

Lisainfo e-raamatute kohta

Püsilink: https://www.kriso.ee/db/97890272653572e.html

Märksõnad:

Language and languages - Variation

E-raamat: Term Variation in Specialised Corpora: Characterisation, automatic discovery and applications

DRM piirangud

Kopeerimine (copy/paste):

Printimine:

Kasutamine:

Konto & seaded

Otsing

Otsingu andmebaas

Filtreeri tulemusi

Teemad E-raamatute teemad

Vali ostukorv