Illustrations |
|
xvii | |
Preface |
|
xix | |
Acknowledgments |
|
xxi | |
Chapter 1 Introduction |
|
1 | (26) |
|
Digital Information and Information Users |
|
|
1 | (2) |
|
|
1 | (1) |
|
|
2 | (1) |
|
|
2 | (1) |
|
Digital Libraries and Digital Collections |
|
|
3 | (5) |
|
Digital Libraries Defined |
|
|
3 | (2) |
|
Digital Libray Architecture |
|
|
5 | (1) |
|
|
5 | (1) |
|
Metadata and Metadata Records |
|
|
6 | (1) |
|
|
7 | (1) |
|
Services for Digital Collections |
|
|
8 | (2) |
|
|
8 | (1) |
|
|
9 | (1) |
|
Digital Reference Services or Virtual Reference Services |
|
|
9 | (1) |
|
Social Media and Interaction |
|
|
10 | (1) |
|
World Languages and Linguistics |
|
|
10 | (3) |
|
|
10 | (1) |
|
|
11 | (1) |
|
Linguistics-the Discipline of Languages |
|
|
12 | (1) |
|
|
12 | (1) |
|
|
12 | (1) |
|
|
13 | (1) |
|
|
13 | (1) |
|
Multilingual Information Access |
|
|
13 | (5) |
|
Information Access and Its Language Barriers |
|
|
14 | (1) |
|
|
15 | (1) |
|
MLIA for Digital Collections |
|
|
16 | (2) |
|
|
18 | (5) |
|
Data Modeling and Database Design |
|
|
18 | (1) |
|
Full-Text Information Retrieval |
|
|
19 | (1) |
|
Natural Language Processing |
|
|
19 | (2) |
|
|
21 | (1) |
|
User Studies and User Interfaces |
|
|
22 | (1) |
|
Summary and Suggested Readings |
|
|
23 | (1) |
|
|
24 | (1) |
|
|
24 | (3) |
Chapter 2 Cross-Language Information Retrieval |
|
27 | (26) |
|
Information Retrieval and Its Challenges |
|
|
27 | (2) |
|
|
29 | (5) |
|
|
30 | (1) |
|
|
30 | (1) |
|
|
31 | (2) |
|
|
33 | (1) |
|
Cross-Language Information Retrieval |
|
|
34 | (5) |
|
|
35 | (1) |
|
Transformation Strategies |
|
|
35 | (2) |
|
Translation Resources and Approaches |
|
|
37 | (2) |
|
Dictionaries, Ontologies, and Thesauri |
|
|
37 | (1) |
|
Parallel, Comparable, and Combined Monolingual Corpora |
|
|
38 | (1) |
|
|
39 | (1) |
|
CLIR Translation Problems and Solutions |
|
|
39 | (3) |
|
|
40 | (1) |
|
Untranslatable Search Keys |
|
|
40 | (1) |
|
Phrase Identification and Translation |
|
|
41 | (1) |
|
Noise in Translation Resources |
|
|
41 | (1) |
|
|
42 | (3) |
|
|
42 | (1) |
|
|
43 | (2) |
|
|
45 | (2) |
|
|
45 | (1) |
|
|
46 | (1) |
|
Summary and Suggested Readings |
|
|
47 | (1) |
|
|
48 | (1) |
|
|
48 | (5) |
Chapter 3 Machine Translation Research and Practice |
|
53 | (36) |
|
|
53 | (3) |
|
|
53 | (1) |
|
|
54 | (2) |
|
|
56 | (7) |
|
|
56 | (3) |
|
|
56 | (1) |
|
|
57 | (1) |
|
Transfer-Based Translation |
|
|
58 | (1) |
|
|
59 | (2) |
|
|
61 | (1) |
|
|
62 | (1) |
|
Machine Translation between English and East Asian Languages |
|
|
63 | (11) |
|
Characteristics of East Asian Languages |
|
|
63 | (3) |
|
|
63 | (2) |
|
|
65 | (1) |
|
|
66 | (1) |
|
Chinese and Japanese Text Segmentation |
|
|
66 | (7) |
|
n-Gram-Based Text Segmentation |
|
|
66 | (1) |
|
Chinese Word Segmentation |
|
|
67 | (4) |
|
Dictionary-Based Approaches |
|
|
68 | (1) |
|
Statistical and Learning-Based Approaches |
|
|
69 | (1) |
|
|
70 | (1) |
|
Use of Chinese Linguistic Knowledge in Word Segmentation |
|
|
70 | (1) |
|
Japanese Word Segmentation |
|
|
71 | (1) |
|
|
72 | (1) |
|
Character-Based Tagging Approach |
|
|
72 | (1) |
|
Evaluation of Word Segmentation |
|
|
72 | (1) |
|
MT Systems and Approaches |
|
|
73 | (1) |
|
|
74 | (4) |
|
|
75 | (1) |
|
Human Evaluation Measures |
|
|
75 | (1) |
|
Automatic Evaluation Measures |
|
|
76 | (1) |
|
|
76 | (2) |
|
|
76 | (1) |
|
Determination of Evaluation Measures |
|
|
77 | (1) |
|
Reference Translation Creation |
|
|
77 | (1) |
|
|
77 | (1) |
|
Evaluation Platform Development |
|
|
78 | (1) |
|
Evaluation Instruction and Progress |
|
|
78 | (1) |
|
Result Analysis and Decision Making |
|
|
78 | (1) |
|
MT for Information Service |
|
|
78 | (3) |
|
|
79 | (1) |
|
Use Scenarios in Library and Information Science |
|
|
80 | (1) |
|
Summary and Suggested Readings |
|
|
81 | (1) |
|
|
82 | (1) |
|
|
83 | (6) |
Chapter 4 Machine Translation for Digital Collections |
|
89 | (42) |
|
MRT Project: Metadata Records Machine Translation and Evaluation |
|
|
89 | (23) |
|
|
90 | (1) |
|
|
90 | (1) |
|
Metadata Records Extraction and MT |
|
|
90 | (3) |
|
Metadata Records Extraction |
|
|
90 | (3) |
|
|
93 | (1) |
|
HeMT: The Platform for Human Translation and MT Evaluation |
|
|
93 | (10) |
|
|
93 | (1) |
|
|
94 | (2) |
|
|
94 | (1) |
|
Multilingual Lexicon Management |
|
|
94 | (1) |
|
|
95 | (1) |
|
|
95 | (1) |
|
|
95 | (1) |
|
Evaluation Result Visualization |
|
|
95 | (1) |
|
|
96 | (2) |
|
|
98 | (1) |
|
|
98 | (1) |
|
|
99 | (4) |
|
|
100 | (1) |
|
|
101 | (1) |
|
Pretest and Posttest Questionnaires |
|
|
102 | (1) |
|
Test Results and Discussion |
|
|
103 | (1) |
|
Evaluation Tasks and Measures |
|
|
103 | (3) |
|
|
103 | (1) |
|
|
104 | (2) |
|
Evaluator Recruitment and Training |
|
|
106 | (2) |
|
|
108 | (1) |
|
|
108 | (2) |
|
Adequacy, Fluency, and the Best System |
|
|
109 | (1) |
|
|
109 | (1) |
|
|
110 | (1) |
|
Comments from the Evaluators |
|
|
110 | (1) |
|
|
111 | (1) |
|
|
111 | (1) |
|
Building Your Own MT System |
|
|
112 | (9) |
|
Procedures for Building an MT System |
|
|
112 | (3) |
|
Determining an MT Strategy |
|
|
112 | (1) |
|
Selecting an Open-Source MT Platform |
|
|
113 | (1) |
|
Analyzing Available Linguistic Resources |
|
|
113 | (1) |
|
Installing and Configuring the MT Platform |
|
|
113 | (1) |
|
Developing or Purchasing Desired Linguistic Resources |
|
|
114 | (1) |
|
|
114 | (1) |
|
Testing and Adjusting the MT System |
|
|
114 | (1) |
|
Documenting the MT System |
|
|
114 | (1) |
|
MRT Project: Constructing a Multi-Engine MT System |
|
|
115 | (6) |
|
|
115 | (1) |
|
Moses: The Statistical MT Platform |
|
|
116 | (1) |
|
Available Linguistic Resources |
|
|
117 | (1) |
|
Moses Installation and Configuration |
|
|
117 | (1) |
|
Parallel Corpus Preparation and Monolingual Corpus Acquisition |
|
|
118 | (1) |
|
MEMT Strategies and Experimentation |
|
|
118 | (1) |
|
|
119 | (1) |
|
Thoughts on Future Improvement |
|
|
120 | (1) |
|
Appendix: Data Dictionary for HEMT Database |
|
|
121 | (6) |
|
|
121 | (1) |
|
|
122 | (1) |
|
|
122 | (1) |
|
|
123 | (1) |
|
|
123 | (1) |
|
|
124 | (1) |
|
|
124 | (1) |
|
|
125 | (1) |
|
|
125 | (1) |
|
|
126 | (1) |
|
|
127 | (1) |
|
Summary and Suggested Readings |
|
|
127 | (1) |
|
|
127 | (4) |
Chapter 5 Multilingual Systems and Interfaces |
|
131 | (26) |
|
Multilingual Digital Libraries and Collections |
|
|
131 | (15) |
|
Multilingual Digital Libraries Defined |
|
|
131 | (3) |
|
Multilingual Systems on the Internet |
|
|
131 | (1) |
|
Multilingual Digital Collections |
|
|
132 | (1) |
|
Multilingual Digital Libraries |
|
|
132 | (1) |
|
A Framework for Analyzing Multilingual Digital Libraries |
|
|
133 | (1) |
|
The Perseus Digital Library |
|
|
134 | (3) |
|
|
134 | (1) |
|
|
135 | (1) |
|
|
135 | (1) |
|
|
135 | (1) |
|
|
136 | (1) |
|
|
136 | (1) |
|
|
137 | (1) |
|
The World Digital Library |
|
|
137 | (4) |
|
|
137 | (1) |
|
|
138 | (1) |
|
|
138 | (1) |
|
|
138 | (1) |
|
|
139 | (1) |
|
|
140 | (1) |
|
|
140 | (1) |
|
|
141 | (3) |
|
|
141 | (1) |
|
|
141 | (1) |
|
|
141 | (1) |
|
|
142 | (1) |
|
|
143 | (1) |
|
|
143 | (1) |
|
|
144 | (1) |
|
Characteristics of Multilingual Digital Libraries |
|
|
144 | (1) |
|
Challenges and Opportunities to Multilingual Digital Libraries |
|
|
145 | (1) |
|
Developing Multilingual Services for Digital Collections |
|
|
146 | (8) |
|
|
146 | (2) |
|
|
146 | (1) |
|
|
147 | (1) |
|
|
147 | (1) |
|
|
147 | (1) |
|
User Testing and Modification |
|
|
148 | (1) |
|
|
148 | (2) |
|
|
148 | (1) |
|
Metadata Records Translation-Based |
|
|
149 | (1) |
|
|
150 | (2) |
|
|
150 | (1) |
|
|
151 | (1) |
|
|
152 | (1) |
|
|
152 | (5) |
|
|
153 | (1) |
|
|
153 | (1) |
|
Summary and Suggested Readings |
|
|
154 | (1) |
|
|
155 | (1) |
|
|
155 | (2) |
Chapter 6 Beyond Retrieval: Knowledge Discovery and Future Directions |
|
157 | (24) |
|
Knowledge Discovery from Multilingual Texts |
|
|
157 | (14) |
|
Multilingual Information Extraction |
|
|
157 | (5) |
|
Information Extraction Tasks |
|
|
158 | (1) |
|
Entity Identcation and Extraction |
|
|
158 | (1) |
|
Scenario or Relationship Detection |
|
|
158 | (1) |
|
Event Identification and Extraction |
|
|
158 | (1) |
|
General Information Extraction Processes |
|
|
159 | (1) |
|
Automatic Metadata Generation |
|
|
159 | (1) |
|
Multilingual Information Extraction and Its Challenges |
|
|
160 | (2) |
|
Evaluating Multilingual Information Extraction |
|
|
162 | (1) |
|
Multilingual Question Answering |
|
|
162 | (7) |
|
|
162 | (1) |
|
|
163 | (2) |
|
Question Answering Approaches |
|
|
165 | (1) |
|
Multilingual Question Answering Research |
|
|
165 | (3) |
|
NTCIR Multilingual QA Tracks |
|
|
166 | (2) |
|
CLEF Multilingual QA Tracks |
|
|
168 | (1) |
|
Evaluating Multilingual Question Answering |
|
|
168 | (1) |
|
Challenges and Future Development |
|
|
169 | (1) |
|
Other Knowledge Discovery Tasks |
|
|
169 | (2) |
|
Multilingual Text Summarization |
|
|
169 | (1) |
|
Rapid Prototyping of Cross-Language Tools for Low-Resource Languages |
|
|
170 | (1) |
|
Cross-lingual Link Discovery |
|
|
170 | (1) |
|
Multilingual Social Network Analysis |
|
|
171 | (1) |
|
Multilingual Recommender Systems |
|
|
171 | (1) |
|
Multilingual News Systems |
|
|
171 | (1) |
|
MLIA Challenges and Future Directions |
|
|
171 | (2) |
|
Developing and Sharing Knowledge Resources and Tools |
|
|
172 | (1) |
|
Integrating Different Approaches |
|
|
173 | (1) |
|
Involving Users into the MLIA Procedures |
|
|
173 | (1) |
|
Developing and Evaluating Empirical MLIA Systems |
|
|
173 | (1) |
|
Preparing For Multilingual Information Access |
|
|
173 | (2) |
|
Studying Related Knowledge and Skills |
|
|
174 | (1) |
|
Keep an Open Mind to New Technologies |
|
|
174 | (1) |
|
Seek Collaboration with MLIA Researchers |
|
|
174 | (1) |
|
Summary and Suggested Readings |
|
|
175 | (1) |
|
|
175 | (1) |
|
|
176 | (5) |
Chapter 7 Related Internet Language Resources and Tools |
|
181 | (16) |
|
Dictionaries and Parallel Corpora |
|
|
181 | (4) |
|
Monolingual and Multilingual Dictionaries |
|
|
182 | (2) |
|
|
182 | (1) |
|
|
182 | (1) |
|
|
182 | (1) |
|
|
183 | (1) |
|
Japanese Language Learning Tools on the Web |
|
|
183 | (1) |
|
|
183 | (1) |
|
|
183 | (1) |
|
Parallel Corpora and Comparative Corpora |
|
|
184 | (1) |
|
|
184 | (1) |
|
|
184 | (1) |
|
Chinese-English Parallel Corpora |
|
|
184 | (1) |
|
|
184 | (1) |
|
|
185 | (1) |
|
|
185 | (1) |
|
|
185 | (1) |
|
Open-Source Systems and Tools |
|
|
185 | (6) |
|
Digital Library Management Systems |
|
|
185 | (2) |
|
|
186 | (1) |
|
|
186 | (1) |
|
|
186 | (1) |
|
|
186 | (1) |
|
|
187 | (1) |
|
|
187 | (1) |
|
|
187 | (1) |
|
|
187 | (1) |
|
|
188 | (1) |
|
|
188 | (1) |
|
|
188 | (1) |
|
|
188 | (1) |
|
Information Retrieval Systems |
|
|
188 | (1) |
|
|
188 | (1) |
|
|
189 | (1) |
|
|
189 | (1) |
|
Machine Translation Services and Platforms |
|
|
189 | (2) |
|
|
189 | (1) |
|
|
190 | (1) |
|
|
190 | (1) |
|
|
190 | (1) |
|
|
190 | (1) |
|
|
190 | (1) |
|
|
191 | (1) |
|
|
191 | (1) |
|
|
191 | (3) |
|
Professional Associations |
|
|
191 | (1) |
|
ACM Special Interest Group on Information Retrieval (SIGIR) |
|
|
191 | (1) |
|
The Linguistic Society of America |
|
|
191 | (1) |
|
The Association for Computational Linguistics |
|
|
192 | (1) |
|
Association for Computational Linguistics and Chinese Language Processing |
|
|
192 | (1) |
|
Asian Federation of Natural Language Processing (AFNLP) |
|
|
192 | (1) |
|
|
192 | (1) |
|
Research Groups in Information Retrieval |
|
|
192 | (1) |
|
Industrial Research Groups |
|
|
193 | (1) |
|
|
193 | (1) |
|
Linguistic Data Consortium (LDC) |
|
|
193 | (1) |
|
European Language Resources Association |
|
|
194 | (1) |
|
|
194 | (1) |
|
Research and Publication Sites |
|
|
194 | (3) |
|
|
194 | (1) |
|
|
194 | (1) |
|
Other Repositories and Archives |
|
|
195 | (1) |
|
|
195 | (1) |
|
|
195 | (1) |
|
|
195 | (1) |
|
Publications from Large-Scale Evaluations |
|
|
195 | (1) |
|
|
195 | (2) |
Acronym List |
|
197 | (4) |
Glossary |
|
201 | (4) |
Index |
|
205 | |