Muutke küpsiste eelistusi

E-raamat: Multilingual Information Retrieval: From Research To Practice

  • Formaat: PDF+DRM
  • Ilmumisaeg: 05-Jan-2012
  • Kirjastus: Springer-Verlag Berlin and Heidelberg GmbH & Co. K
  • Keel: eng
  • ISBN-13: 9783642230080
  • Formaat - PDF+DRM
  • Hind: 55,56 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Lisa ostukorvi
  • Lisa soovinimekirja
  • See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.
  • Formaat: PDF+DRM
  • Ilmumisaeg: 05-Jan-2012
  • Kirjastus: Springer-Verlag Berlin and Heidelberg GmbH & Co. K
  • Keel: eng
  • ISBN-13: 9783642230080

DRM piirangud

  • Kopeerimine (copy/paste):

    ei ole lubatud

  • Printimine:

    ei ole lubatud

  • Kasutamine:

    Digitaalõiguste kaitse (DRM)
    Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale  Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

    Vajalik tarkvara
    Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

    PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

    Seda e-raamatut ei saa lugeda Amazon Kindle's. 

Summarizing almost ten years of intensive research into multi- and cross-lingual information access, this book is a comprehensive description of the technologies involved in designing and developing systems for multilingual information retrieval.



We are living in a multilingual world and the diversity in languages which are used to interact with information access systems has generated a wide variety of challenges to be addressed by computer and information scientists. The growing amount of non-English information accessible globally and the increased worldwide exposure of enterprises also necessitates the adaptation of Information Retrieval (IR) methods to new, multilingual settings.

Peters, Braschler and Clough present a comprehensive description of the technologies involved in designing and developing systems for Multilingual Information Retrieval (MLIR). They provide readers with broad coverage of the various issues involved in creating systems to make accessible digitally stored materials regardless of the language(s) they are written in. Details on Cross-Language Information Retrieval (CLIR) are also covered that helps readers to understand how to develop retrieval systems that cross language boundaries. Their work is divided into six chapters and accompanies the reader step-by-step through the various stages involved in building, using and evaluating MLIR systems. The book concludes with some examples of recent applications that utilise MLIR technologies. Some of the techniques described have recently started to appear in commercial search systems, while others have the potential to be part of future incarnations.

The book is intended for graduate students, scholars, and practitioners with a basic understanding of classical text retrieval methods. It offers guidelines and information on all aspects that need to be taken into consideration when building MLIR systems, while avoiding too many 'hands-on details' that could rapidly become obsolete. Thus it bridges the gap between the material covered by most of the classical IR textbooks and the novel requirements related to the acquisition and dissemination of information in whatever language it is stored.

Arvustused

From the reviews:

In this book the Reader may find a lot of information from experience of running such large evaluation campaigns. The book contents comprise six chapters that follow a conference paper structure. I recommend it to academia as a resource providing background knowledge in multilingual information retrieval. (Jolanta Mizera-Pietraszko, Informer, November, 2012)

"A valuable and  comprehensive handbook. The book is containing pointers to many useful resources" Prasenjit Majumder, DAIICT, India

1 Introduction
1(16)
1.1 The Growth of the Digital Universe
1(4)
1.2 The Terminology
5(1)
1.3 A Brief History
5(8)
1.3.1 Enabling Technologies and Standards
6(1)
1.3.2 Publicly-Funded Research Initiatives
7(2)
1.3.3 Conferences and Evaluation Campaigns
9(1)
1.3.4 Commercial Products
10(3)
1.4 The Current Research Challenges
13(2)
References
15(2)
2 Within-Language Information Retrieval
17(40)
2.1 Introduction
17(3)
2.2 The Retrieval Problem and Its Consequences
20(2)
2.3 Implementation of a Within-Language Information Retrieval System
22(2)
2.4 Indexing Phase
24(12)
2.4.1 Pre-processing (Step 1)
25(1)
2.4.2 Language Identification (Step 2)
26(1)
2.4.3 Document Formation (Step 3)
27(1)
2.4.4 Segmentation, Tokenisation, Parsing (Step 4)
28(3)
2.4.5 Feature Normalisation (Step 5)
31(4)
2.4.6 Enrichment (Step 6)
35(1)
2.5 Matching Phase
36(16)
2.5.1 `Bag of Words' Paradigm
36(2)
2.5.2 Inverted Index
38(1)
2.5.3 Basic Matching Algorithm
39(2)
2.5.4 Vector Space Model
41(2)
2.5.5 The tf.idf-Cosine Weighting Scheme
43(3)
2.5.6 Relevance Feedback
46(2)
2.5.7 Probabilistic Weighting Schemes
48(2)
2.5.8 Ranking Using Language Models
50(1)
2.5.9 Off-Page Information: Page Rank
51(1)
2.6 Summary and Future Directions
52(1)
2.7 Suggested Reading
53(1)
References
54(3)
3 Cross-Language Information Retrieval
57(28)
3.1 Introduction
57(1)
3.2 Implementation of Cross-Language Information Retrieval
58(4)
3.2.1 Query Translation and Document Translation
58(1)
3.2.2 No Translation
59(1)
3.2.3 Different Types of Translation Resources
60(1)
3.2.4 Term Ambiguity
61(1)
3.3 Translation Approaches for Cross-Language Information Retrieval
62(11)
3.3.1 Machine-Readable Dictionaries
62(3)
3.3.2 Statistical Approaches
65(4)
3.3.3 Pre-translation and Post-translation Query Expansion
69(2)
3.3.4 Machine Translation
71(1)
3.3.5 Combination Approaches
71(2)
3.4 Handling Many Languages
73(7)
3.4.1 CLIR Flows
73(4)
3.4.2 Merging Across Languages
77(2)
3.4.3 Document Translation
79(1)
3.4.4 Indirect Translation
79(1)
3.5 Summary and Future Directions
80(2)
3.6 Suggested Reading
82(1)
References
83(2)
4 Interaction and User Interfaces
85(44)
4.1 Information Seeking and User Interaction
85(4)
4.2 Users' Information Needs and Search Tasks
89(3)
4.3 Users' Language Skills and Cultural Differences
92(2)
4.4 Supporting Multilingual User Interaction
94(20)
4.4.1 Query Formulation and Translation
95(7)
4.4.2 Document Selection and Examination
102(6)
4.4.3 Query Reformulation
108(1)
4.4.4 Browsing and Visualisation
109(5)
4.5 Designing Multilingual Search User Interfaces
114(9)
4.5.1 User-Centred Design
116(2)
4.5.2 Intemationalisation and Localisation
118(3)
4.5.3 Case Study: CLIR in Google's Web Search
121(2)
4.6 Summary and Future Directions
123(1)
4.7 Suggested Reading
124(1)
References
125(4)
5 Evaluation for Multilingual Information Retrieval Systems
129(42)
5.1 Introduction
129(1)
5.2 System-Oriented Evaluation
130(26)
5.2.1 The Cranfield Tradition
131(1)
5.2.2 Evaluation Campaigns
132(1)
5.2.3 Building a Test Collection
133(8)
5.2.4 Promoting Research into Multilingual and Multimedia System Development via Evaluation
141(2)
5.2.5 Alternative Methodologies for Test Collection Construction
143(2)
5.2.6 Performance Measures
145(9)
5.2.7 Statistical Significance Testing
154(1)
5.2.8 System Effectiveness and User Satisfaction
155(1)
5.3 User-Oriented Evaluation
156(5)
5.3.1 Experimental Design
157(2)
5.3.2 Evaluating Interactive CLIR Systems at CLEF
159(1)
5.3.3 Alternative Performance Measures
160(1)
5.4 Evaluating Your Own System
161(3)
5.5 Summary and Future Directions
164(1)
5.6 Suggested Reading
165(1)
References
166(5)
6 Applications of Multilingual Information Access
171(38)
6.1 Introduction
171(1)
6.2 Beyond Multilingual Textual Document Retrieval
172(16)
6.2.1 Image Retrieval
173(4)
6.2.2 Speech Retrieval
177(3)
6.2.3 Video Retrieval
180(3)
6.2.4 Question Answering
183(5)
6.3 Multilingual Information Access in Practice
188(13)
6.3.1 Web Search
189(2)
6.3.2 Digital Libraries and Cultural Heritage
191(4)
6.3.3 Medicine and Healthcare
195(1)
6.3.4 Government and Law
196(3)
6.3.5 Business and Commerce
199(2)
6.4 Summing Up
201(1)
References
202(7)
Glossary of Acronyms 209(4)
Index 213
Carol Peters is a researcher at the Italian National Research Councils Istituto di Scienza e Tecnologie dell'Informazione. Her main research activities concern the development of multilingual access mechanisms for digital libraries and evaluation methodologies for cross-language information retrieval systems. She was leader of the EU Sixth Framework project MultiMatch and coordinated the Cross-Language Evaluation Forum (CLEF) during its first ten years of activity. In 2009, in recognition of her work for CLEF, she was awarded the Tony Kent Strix Award.

 

Martin Braschler is a lecturer at the Zurich University of Applied Sciences in Winterthur. His main research interests are in the field of information retrieval evaluation, cross-language information retrieval, and natural language processing. He served as the technical coordinator of the cross-language track at the American TREC series of evaluation campaigns from 1997-1999, and was technical coordinator of the CLEF evaluation campaigns in Europe from their start in 2000 until 2004. Having previously served as head of research and innovation at Eurospider Information Technology AG, Zurich, Switzerland, a vendor of information retrieval solutions, until 2004, he has actively been involved in the transfer of state-of-the-art information retrieval technology to the commercial marketplace.

 

Paul Clough is a senior lecturer in the Information School at the University of Sheffield. He has worked on both technical and user-oriented aspects of information retrieval in areas that include Cross-Language IR (CLIR), Geographic IR (GIR), image retrieval and personalisation. An important area of his work has been on evaluating IR systems where he co-founded and helped co-ordinate the ImageCLEF evaluation campaign 2003-2010 and is currently involved in organising a TREC task on evaluating query sessions (the Session Track). He has been a Principal Investigator for Sheffield in 4EU-funded projects (MultiMatch, Memoir, TrebleCLEF and PATHS), an AHRC-funded studentship on recommender systems and a project funded by the UK National Archives on improving information access.