Klienditugi: 7440010 (E-R 10-18)

Abi | Registreeri | Logi sisse

E-raamat: Multilingual Information Retrieval: From Research To Practice

3.00/5 (6 hinnangut Goodreads-ist)

Paul Clough, Martin Braschler, Carol Peters

Formaat: PDF+DRM
Ilmumisaeg: 05-Jan-2012
Kirjastus: Springer-Verlag Berlin and Heidelberg GmbH & Co. K
Keel: eng
ISBN-13: 9783642230080

Teised raamatud teemal:

Formaat - PDF+DRM
Hind: 55,56 €*
* hind on lõplik, st. muud allahindlused enam ei rakendu
Lisa ostukorvi
Lisa soovinimekirja
See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.

Formaat: PDF+DRM
Ilmumisaeg: 05-Jan-2012
Kirjastus: Springer-Verlag Berlin and Heidelberg GmbH & Co. K
Keel: eng
ISBN-13: 9783642230080

Teised raamatud teemal:

DRM piirangud

Kopeerimine (copy/paste):

ei ole lubatud
Printimine:

ei ole lubatud
Kasutamine:

Digitaalõiguste kaitse (DRM)
Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

Vajalik tarkvara
Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

Seda e-raamatut ei saa lugeda Amazon Kindle's.

Summarizing almost ten years of intensive research into multi- and cross-lingual information access, this book is a comprehensive description of the technologies involved in designing and developing systems for multilingual information retrieval.

We are living in a multilingual world and the diversity in languages which are used to interact with information access systems has generated a wide variety of challenges to be addressed by computer and information scientists. The growing amount of non-English information accessible globally and the increased worldwide exposure of enterprises also necessitates the adaptation of Information Retrieval (IR) methods to new, multilingual settings.

Peters, Braschler and Clough present a comprehensive description of the technologies involved in designing and developing systems for Multilingual Information Retrieval (MLIR). They provide readers with broad coverage of the various issues involved in creating systems to make accessible digitally stored materials regardless of the language(s) they are written in. Details on Cross-Language Information Retrieval (CLIR) are also covered that helps readers to understand how to develop retrieval systems that cross language boundaries. Their work is divided into six chapters and accompanies the reader step-by-step through the various stages involved in building, using and evaluating MLIR systems. The book concludes with some examples of recent applications that utilise MLIR technologies. Some of the techniques described have recently started to appear in commercial search systems, while others have the potential to be part of future incarnations.

The book is intended for graduate students, scholars, and practitioners with a basic understanding of classical text retrieval methods. It offers guidelines and information on all aspects that need to be taken into consideration when building MLIR systems, while avoiding too many 'hands-on details' that could rapidly become obsolete. Thus it bridges the gap between the material covered by most of the classical IR textbooks and the novel requirements related to the acquisition and dissemination of information in whatever language it is stored.

Arvustused

From the reviews:

In this book the Reader may find a lot of information from experience of running such large evaluation campaigns. The book contents comprise six chapters that follow a conference paper structure. I recommend it to academia as a resource providing background knowledge in multilingual information retrieval. (Jolanta Mizera-Pietraszko, Informer, November, 2012)

"A valuable and comprehensive handbook. The book is containing pointers to many useful resources" Prasenjit Majumder, DAIICT, India

1 Introduction

(16)

1.1 The Growth of the Digital Universe

(4)

1.2 The Terminology

(1)

1.3 A Brief History

(8)

1.3.1 Enabling Technologies and Standards

(1)

1.3.2 Publicly-Funded Research Initiatives

(2)

1.3.3 Conferences and Evaluation Campaigns

(1)

1.3.4 Commercial Products

(3)

1.4 The Current Research Challenges

(2)

References

(2)

2 Within-Language Information Retrieval

(40)

2.1 Introduction

(3)

2.2 The Retrieval Problem and Its Consequences

(2)

2.3 Implementation of a Within-Language Information Retrieval System

(2)

2.4 Indexing Phase

(12)

2.4.1 Pre-processing (Step 1)

(1)

2.4.2 Language Identification (Step 2)

(1)

2.4.3 Document Formation (Step 3)

(1)

2.4.4 Segmentation, Tokenisation, Parsing (Step 4)

(3)

2.4.5 Feature Normalisation (Step 5)

(4)

2.4.6 Enrichment (Step 6)

(1)

2.5 Matching Phase

(16)

2.5.1 `Bag of Words' Paradigm

(2)

2.5.2 Inverted Index

(1)

2.5.3 Basic Matching Algorithm

(2)

2.5.4 Vector Space Model

(2)

2.5.5 The tf.idf-Cosine Weighting Scheme

(3)

2.5.6 Relevance Feedback

(2)

2.5.7 Probabilistic Weighting Schemes

(2)

2.5.8 Ranking Using Language Models

(1)

2.5.9 Off-Page Information: Page Rank

(1)

2.6 Summary and Future Directions

(1)

2.7 Suggested Reading

(1)

References

(3)

3 Cross-Language Information Retrieval

(28)

3.1 Introduction

(1)

3.2 Implementation of Cross-Language Information Retrieval

(4)

3.2.1 Query Translation and Document Translation

(1)

3.2.2 No Translation

(1)

3.2.3 Different Types of Translation Resources

(1)

3.2.4 Term Ambiguity

(1)

3.3 Translation Approaches for Cross-Language Information Retrieval

(11)

3.3.1 Machine-Readable Dictionaries

(3)

3.3.2 Statistical Approaches

(4)

3.3.3 Pre-translation and Post-translation Query Expansion

(2)

3.3.4 Machine Translation

(1)

3.3.5 Combination Approaches

(2)

3.4 Handling Many Languages

(7)

3.4.1 CLIR Flows

(4)

3.4.2 Merging Across Languages

(2)

3.4.3 Document Translation

(1)

3.4.4 Indirect Translation

(1)

3.5 Summary and Future Directions

(2)

3.6 Suggested Reading

(1)

References

(2)

4 Interaction and User Interfaces

(44)

4.1 Information Seeking and User Interaction

(4)

4.2 Users' Information Needs and Search Tasks

(3)

4.3 Users' Language Skills and Cultural Differences

(2)

4.4 Supporting Multilingual User Interaction

(20)

4.4.1 Query Formulation and Translation

(7)

4.4.2 Document Selection and Examination

102

(6)

4.4.3 Query Reformulation

108

(1)

4.4.4 Browsing and Visualisation

109

(5)

4.5 Designing Multilingual Search User Interfaces

114

(9)

4.5.1 User-Centred Design

116

(2)

4.5.2 Intemationalisation and Localisation

118

(3)

4.5.3 Case Study: CLIR in Google's Web Search

121

(2)

4.6 Summary and Future Directions

123

(1)

4.7 Suggested Reading

124

(1)

References

125

(4)

5 Evaluation for Multilingual Information Retrieval Systems

129

(42)

5.1 Introduction

129

(1)

5.2 System-Oriented Evaluation

130

(26)

5.2.1 The Cranfield Tradition

131

(1)

5.2.2 Evaluation Campaigns

132

(1)

5.2.3 Building a Test Collection

133

(8)

5.2.4 Promoting Research into Multilingual and Multimedia System Development via Evaluation

141

(2)

5.2.5 Alternative Methodologies for Test Collection Construction

143

(2)

5.2.6 Performance Measures

145

(9)

5.2.7 Statistical Significance Testing

154

(1)

5.2.8 System Effectiveness and User Satisfaction

155

(1)

5.3 User-Oriented Evaluation

156

(5)

5.3.1 Experimental Design

157

(2)

5.3.2 Evaluating Interactive CLIR Systems at CLEF

159

(1)

5.3.3 Alternative Performance Measures

160

(1)

5.4 Evaluating Your Own System

161

(3)

5.5 Summary and Future Directions

164

(1)

5.6 Suggested Reading

165

(1)

References

166

(5)

6 Applications of Multilingual Information Access

171

(38)

6.1 Introduction

171

(1)

6.2 Beyond Multilingual Textual Document Retrieval

172

(16)

6.2.1 Image Retrieval

173

(4)

6.2.2 Speech Retrieval

177

(3)

6.2.3 Video Retrieval

180

(3)

6.2.4 Question Answering

183

(5)

6.3 Multilingual Information Access in Practice

188

(13)

6.3.1 Web Search

189

(2)

6.3.2 Digital Libraries and Cultural Heritage

191

(4)

6.3.3 Medicine and Healthcare

195

(1)

6.3.4 Government and Law

196

(3)

6.3.5 Business and Commerce

199

(2)

6.4 Summing Up

201

(1)

References

202

(7)

Glossary of Acronyms

209

(4)

Index

213

Carol Peters is a researcher at the Italian National Research Councils Istituto di Scienza e Tecnologie dell'Informazione. Her main research activities concern the development of multilingual access mechanisms for digital libraries and evaluation methodologies for cross-language information retrieval systems. She was leader of the EU Sixth Framework project MultiMatch and coordinated the Cross-Language Evaluation Forum (CLEF) during its first ten years of activity. In 2009, in recognition of her work for CLEF, she was awarded the Tony Kent Strix Award.

Martin Braschler is a lecturer at the Zurich University of Applied Sciences in Winterthur. His main research interests are in the field of information retrieval evaluation, cross-language information retrieval, and natural language processing. He served as the technical coordinator of the cross-language track at the American TREC series of evaluation campaigns from 1997-1999, and was technical coordinator of the CLEF evaluation campaigns in Europe from their start in 2000 until 2004. Having previously served as head of research and innovation at Eurospider Information Technology AG, Zurich, Switzerland, a vendor of information retrieval solutions, until 2004, he has actively been involved in the transfer of state-of-the-art information retrieval technology to the commercial marketplace.

Paul Clough is a senior lecturer in the Information School at the University of Sheffield. He has worked on both technical and user-oriented aspects of information retrieval in areas that include Cross-Language IR (CLIR), Geographic IR (GIR), image retrieval and personalisation. An important area of his work has been on evaluating IR systems where he co-founded and helped co-ordinate the ImageCLEF evaluation campaign 2003-2010 and is currently involved in organising a TREC task on evaluating query sessions (the Session Track). He has been a Principal Investigator for Sheffield in 4EU-funded projects (MultiMatch, Memoir, TrebleCLEF and PATHS), an AHRC-funded studentship on recommender systems and a project funded by the UK National Archives on improving information access.

Lisainfo e-raamatute kohta

Püsilink: https://www.kriso.ee/db/97836422300802e.html

Märksõnad:

E-raamat: Multilingual Information Retrieval: From Research To Practice

DRM piirangud

Kopeerimine (copy/paste):

Printimine:

Kasutamine:

Arvustused

Konto & seaded

Otsing

Otsingu andmebaas

Filtreeri tulemusi

Teemad E-raamatute teemad

Vali ostukorv