Klienditugi: 7440010 (E-R 10-18)

Abi | Registreeri | Logi sisse

Multiword Expressions Acquisition: A Generic and Open Framework 2015 ed. [Kõva köide]

Carlos Ramisch

Formaat: Hardback, 230 pages, kõrgus x laius: 235x155 mm, kaal: 4912 g, 17 Illustrations, black and white; XIV, 230 p. 17 illus., 1 Hardback
Sari: Theory and Applications of Natural Language Processing
Ilmumisaeg: 08-Oct-2014
Kirjastus: Springer International Publishing AG
ISBN-10: 3319092065
ISBN-13: 9783319092065

Teised raamatud teemal:

Natural language & machine translation
Computational linguistics - (Hetkel poes: 1 nimetust)
Artificial intelligence - (Hetkel poes: 4 nimetust)

Kõva köide
Hind: 95,02 €*
* hind on lõplik, st. muud allahindlused enam ei rakendu
Tavahind: 111,79 €
Säästad 15%
Raamatu kohalejõudmiseks kirjastusest kulub orienteeruvalt 2-4 nädalat
Kogus:
- - 1
  - 2
  - 3
  - 4
  - 5
  - 6
  - 7
  - 8
  - 9
  - 10
Lisa ostukorvi
Tasuta tarne
Tellimisaeg 2-4 nädalat
Lisa soovinimekirja

Formaat: Hardback, 230 pages, kõrgus x laius: 235x155 mm, kaal: 4912 g, 17 Illustrations, black and white; XIV, 230 p. 17 illus., 1 Hardback
Sari: Theory and Applications of Natural Language Processing
Ilmumisaeg: 08-Oct-2014
Kirjastus: Springer International Publishing AG
ISBN-10: 3319092065
ISBN-13: 9783319092065

Teised raamatud teemal:

Natural language & machine translation
Computational linguistics - (Hetkel poes: 1 nimetust)
Artificial intelligence - (Hetkel poes: 4 nimetust)

Püsilink: https://www.kriso.ee/db/9783319092065.html

Märksõnad:

?This book is an excellent introduction to multiword expressions. It provides a unique, comprehensive and up-to-date overview of this exciting topic in computational linguistics. The first part describes the diversity and richness of multiword expressions, including many examples in several languages. These constructions are not only complex and arbitrary, but also much more frequent than one would guess, making them a real nightmare for natural language processing applications.

The second part introduces a new generic framework for automatic acquisition of multiword expressions from texts. Furthermore, it describes the accompanying free software tool, the mwetoolkit, which comes in handy when looking for expressions in texts (regardless of the language). Evaluation is greatly emphasized, underlining the fact that results depend on parameters like corpus size, language, MWE type, etc. The last part contains solid experimental results and evaluates the mwetoolkit, demonstrating its usefulness for computer-assisted lexicography and machine translation.

This is the first book to cover the whole pipeline of multiword expression acquisition in a single volume. It is addresses the needs of students and researchers in computational and theoretical linguistics, cognitive sciences, artificial intelligence and computer science. Its good balance between computational and linguistic views make it the perfect starting point for anyone interested in multiword expressions, language and text processing in general.

Arvustused

The motivating idea behind this work is to explore and compare approaches to MWE, involving various tools as well as human resources. Much information is given to enable other researchers to investigate MWEs. The book contains a vast amount of information. An extensive bibliography follows each chapter. There are helpful appendices, including a list of standard part of speech tags. (Alice Davison, Computing Reviews, September, 2015)

1 Introduction

(22)

1.1 Motivations

(8)

1.1.1 What Are Multiword Expressions?

(3)

1.1.2 Why Do They Matter?

(3)

1.1.3 What Happens If We Ignore Them?

(2)

1.2 A New Framework for MWE Treatment

(5)

1.2.1 Hypotheses

(1)

1.2.2 Goals

(1)

1.2.3 Guiding Principles

(3)

1.3
Chapters Outline

(2)

1.4 Summary

(7)

References

(6)

Part I Multiword Expressions: A Tough Nut to Crack

2 Definitions and Characteristics

(30)

2.1 A Brief History

(5)

2.1.1 Theoretical Linguistics

(2)

2.1.2 Computational Linguistics

(2)

2.2 Defining MWEs

(6)

2.2.1 What Is a Word?

(1)

2.2.2 What Is a MWE?

(2)

2.2.3 A Note on Terminology

(3)

2.3 Characteristics and Characterisations

(11)

2.3.1 The Compositionality Continuum

(2)

2.3.2 Derived MWE Properties

(3)

2.3.3 Existing MWE Typologies

(2)

2.3.4 A Simplified Typology

(4)

2.4 A Snapshot of the Research Field

(1)

2.5 Summary

(7)

References

(6)

3 State of the Art in MWE Processing

(52)

3.1 Elementary Notions

(17)

3.1.1 Linguistic Processing: Analysis

(3)

3.1.2 Word Frequency Distributions

(3)

3.1.3 N-Grams, Language Models and Suffix Arrays

(3)

3.1.4 Lexical Association Measures

(7)

3.2 Methods for Automatic MWE Acquisition

(10)

3.2.1 Monolingual Methods

(3)

3.2.2 Bi- and Multilingual Methods

(2)

3.2.3 Existing Tools

(4)

3.3 Other Tasks Related to MWE Processing

(11)

3.3.1 Interpretation

(3)

3.3.2 Disambiguation

(1)

3.3.3 Representation

(2)

3.3.4 Applications

(5)

3.4 Summary

(14)

References

(12)

Part II MWE Acquisition

4 Evaluation of MWE Acquisition

105

(22)

4.1 Evaluation Context

106

(8)

4.1.1 Evaluation Axes

106

(3)

4.1.2 Evaluation Measures

109

(2)

4.1.3 Annotation

111

(3)

4.2 Acquisition Contexts

114

(5)

4.2.1 Characteristics of Target Constructions

115

(1)

4.2.2 Characteristics of Corpora

116

(3)

4.2.3 Existing Resources

119

(1)

4.3 Discussion

119

(2)

4.4 Summary

121

(6)

References

122

(5)

5 A New Framework for MWE Acquisition

127

(32)

5.1 The mwetoolkit Framework

127

(14)

5.1.1 General Architecture

128

(2)

5.1.2 Modules

130

(8)

5.1.3 Discussion

138

(3)

5.2 A Toy Experiment

141

(4)

5.2.1 Candidate Extraction

141

(1)

5.2.2 Candidate Filtering

142

(2)

5.2.3 Results

144

(1)

5.3 Comparison with Related Approaches

145

(7)

5.3.1 Related Approaches

145

(1)

5.3.2 Comparison Setup

146

(1)

5.3.3 Results

147

(5)

5.4 Summary

152

(7)

References

154

(5)

Part III Applications

6 Application 1: Lexicography

159

(22)

6.1 A Dictionary of Nominal Compounds in Greek

159

(7)

6.1.1 Greek Nominal Compounds

160

(2)

6.1.2 Automatic Acquisition Setup

162

(1)

6.1.3 Results

163

(3)

6.2 A Dictionary of Complex Predicates in Portuguese

166

(10)

6.2.1 Portuguese Complex Predicates

167

(2)

6.2.2 Automatic Acquisition Setup

169

(2)

6.2.3 Results

171

(5)

6.3 Summary

176

(5)

References

178

(3)

7 Application 2: Machine Translation

181

(20)

7.1 A Brief Introduction to SMT

183

(3)

7.2 Evaluation of Phrasal Verb Translation

186

(11)

7.2.1 English Phrasal Verbs

187

(2)

7.2.2 Translation Setup

189

(3)

7.2.3 Results

192

(5)

7.3 Summary

197

(4)

References

197

(4)

8 Conclusions

201

(6)

References

204

(3)

A Extended List of Translation Examples

207

(2)

B Resources Used in the Experiments

209

(2)

B.1 Data

209

(1)

B.1.1 Monolingual Corpora

209

(1)

B.1.2 Multilingual Corpora

209

(1)

B.2 Software

210

(1)

B.2.1 Analysis Tools

210

(1)

C The mwetoolkit: Documentation

211

(12)

C.1 Design Choices

211

(1)

C.2 Installing the mwetoolkit

212

(1)

C.2.1 Windows

212

(1)

C.2.2 Linux and Mac OS

212

(1)

C.2.3 Mac OS Dependencies

213

(1)

C.2.4 Testing Your Installation

213

(1)

C.3 Getting Started

213

(3)

C.3.1 An Example

214

(2)

C.4 Defining Patterns for Extraction

216

(3)

C.4.1 Literal Matches

216

(1)

C.4.2 Repetitions and Optional Elements

216

(1)

C.4.3 Ignoring Parts of the Match

217

(1)

C.4.4 Backpatterns

218

(1)

C.4.5 Syntactic Patterns

218

(1)

C.5 Preprocessing a Corpus Using TreeTagger

219

(1)

C.5.1 Installing TreeTagger

219

(1)

C.5.2 Converting TreeTagger's Output to XML

219

(1)

C.6 Preprocessing a Corpus Using RASP

220

(1)

C.6.1 Installing RASP

220

(1)

C.6.2 Converting RASP'S Output to XML

220

(1)

C.7 Examples of XML Files

220

(1)

C.8 Developers

221

(2)

D Tagsets for POS and Syntax

223

(6)

D.1 Generic POS Tagset

223

(1)

D.2 RASP English POS Tagset

223

(3)

D.3 RASP English Grammatical Relations

226

(1)

D.4 TreeTagger English POS Tagset

227

(2)

E Detailed Lexicon Descriptions

229

E.1 Sentiment Verbs Extracted from Brazilian WordNet

229

(1)

E.2 Sentiment Nouns

230

Carlos Ramisch is a researcher and lecturer at the Aix-Marseille University (France). He holds a double PhD in computer science from Grenoble University (France) and UFRGS (Brazil). His research interests are multiword expressions, semantics and multilingualism. Carlos coordinated many events, including the MWE workshops (2010, 2011, 2013) and the ACM TSLP special issue. He is the creator and developer of the mwetoolkit.

Multiword Expressions Acquisition: A Generic and Open Framework 2015 ed. [Kõva köide]

Arvustused

Konto & seaded

Otsing

Otsingu andmebaas

Filtreeri tulemusi

Teemad Ingliskeelsed raamatud

Vali ostukorv