Klienditugi: 7440010 (E-R 10-18)

Abi | Registreeri | Logi sisse

E-raamat: Collaborative Annotation for Reliable Natural Language Processing: Technical and Sociological Aspects

Karėn Fort

Formaat: EPUB+DRM
Ilmumisaeg: 14-Jun-2016
Kirjastus: ISTE Ltd and John Wiley & Sons Inc
Keel: eng
ISBN-13: 9781119307655

Teised raamatud teemal:

Formaat - EPUB+DRM
Hind: 171,60 €*
* hind on lõplik, st. muud allahindlused enam ei rakendu
Lisa ostukorvi
Lisa soovinimekirja
See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.
Raamatukogudele

Formaat: EPUB+DRM
Ilmumisaeg: 14-Jun-2016
Kirjastus: ISTE Ltd and John Wiley & Sons Inc
Keel: eng
ISBN-13: 9781119307655

Teised raamatud teemal:

DRM piirangud

Kopeerimine (copy/paste):

ei ole lubatud
Printimine:

ei ole lubatud
Kasutamine:

Digitaalõiguste kaitse (DRM)
Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

Vajalik tarkvara
Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

Seda e-raamatut ei saa lugeda Amazon Kindle's.

This book presents a unique opportunity for constructing a consistent image of collaborative manual annotation for Natural Language Processing (NLP). NLP has witnessed two major evolutions in the past 25 years: firstly, the extraordinary success of machine learning, which is now, for better or for worse, overwhelmingly dominant in the field, and secondly, the multiplication of evaluation campaigns or shared tasks. Both involve manually annotated corpora, for the training and evaluation of the systems.

These corpora have progressively become the hidden pillars of our domain, providing food for our hungry machine learning algorithms and reference for evaluation. Annotation is now the place where linguistics hides in NLP. However, manual annotation has largely been ignored for some time, and it has taken a while even for annotation guidelines to be recognized as essential.

Although some efforts have been made lately to address some of the issues presented by manual annotation, there has still been little research done on the subject. This book aims to provide some useful insights into the subject.

Manual corpus annotation is now at the heart of NLP, and is still largely unexplored. There is a need for manual annotation engineering (in the sense of a precisely formalized process), and this book aims to provide a first step towards a holistic methodology, with a global view on annotation.

Preface

List of Acronyms

Introduction

xiii

Chapter 1 Annotating Collaboratively

(76)

1.1 The annotation process (re)visited

(23)

1.1.1 Building consensus

(2)

1.1.2 Existing methodologies

(4)

1.1.3 Preparatory work

(6)

1.1.4 Pre-campaign

(4)

1.1.5 Annotation

(4)

1.1.6 Finalization

(3)

1.2 Annotation complexity

(19)

1.2.1 Example overview

(3)

1.2.2 What to annotate?

(2)

1.2.3 How to annotate?

(6)

1.2.4 The weight of the context

(2)

1.2.5 Visualization

(2)

1.2.6 Elementary annotation tasks

(3)

1.3 Annotation tools

(12)

1.3.1 To be or not to be an annotation tool

(3)

1.3.2 Much more than prototypes

(3)

1.3.3 Addressing the new annotation challenges

(5)

1.3.4 The impossible dream tool

(1)

1.4 Evaluating the annotation quality

(20)

1.4.1 What is annotation quality?

(1)

1.4.2 Understanding the basics

(7)

1.4.3 Beyond kappas

(4)

1.4.4 Giving meaning to the metrics

(8)

1.5 Conclusion

(2)

Chapter 2 Crowdsourcing Annotation

(38)

2.1 What is crowdsourcing and why should we be interested in it?

(4)

2.1.1 A moving target

(3)

2.1.2 A massive success

(1)

2.2 Deconstructing the myths

(12)

2.2.1 Crowdsourcing is a recent phenomenon

(2)

2.2.2 Crowdsourcing involves a crowd (of non-experts)

(4)

2.2.3 "Crowdsourcing involves (a crowd of) non-experts"

(6)

2.3 Playing with a purpose

(8)

2.3.1 Using the players' innate capabilities and world knowledge

(2)

2.3.2 Using the players' school knowledge

(1)

2.3.3 Using the players' learning capacities

(4)

2.4 Acknowledging crowdsourcing specifics

101

(8)

2.4.1 Motivating the participants

101

(6)

2.4.2 Producing quality data

107

(2)

2.5 Ethical issues

109

(6)

2.5.1 Game ethics

109

(2)

2.5.2 What's wrong with Amazon Mechanical Turk?

111

(2)

2.5.3 A charter to rule them all

113

(2)

Conclusion

115

(2)

Appendix

117

(24)

Glossary

141

(2)

Bibliography

143

(20)

Index

163

Karėn Fort is Associate Professor at University Paris-Sorbonne (Paris 4) working on the STIH (meaning, text, computer science, history) team. Her current research interests include collaborative manual annotation, crowdsourcing and ethics.

Lisainfo e-raamatute kohta