"Graph theory and the fields of natural language processing and information retrieval are well-studied disciplines. Traditionally, these areas have been perceived as distinct, with different algorithms, different applications, and different potential end-users. However, recent research has shown that these disciplines are intimately connected, with a large variety of natural language processing and information retrieval applications finding efficient solutions within graph-theoretical frameworks. This book extensively covers the use of graph-based algorithms for natural language processing and information retrieval. It brings together topics as diverse as lexical semantics, text summarization, text mining, ontology construction, text classification, and information retrieval, which are connected by the common underlying theme of the use of graph-theoretical methods for text and information processing tasks. Readers will come away with a firm understanding of the major methods and applications in natural language processing and information retrieval that rely on graph-based representations and algorithms"--
"This book extensively covers the use of graph-based algorithms for natural language processing and information retrieval"--
Provided by publisher.
Arvustused
'For the first time, a computational framework that unifies many algorithms and representations from the fields of natural language processing and information retrieval. This book is a comprehensive introduction to both theory and practice.' Giorgio Satta, University of Padua 'The book is highly recommended to be read not only by upper-level undergraduate and graduate students, but also by experts who are looking for a brief overview of this area. The book aims to enable the readers to gain sufficient understanding of graph-based approaches used in information retrieval and to recognize opportunities for advancing the state of art in natural language processing problems by applications of graph theory.' Korhan Gunel, Zentralblatt MATH
Muu info
This book extensively covers the use of graph-based algorithms for natural language processing and information retrieval.
Introduction |
|
1 | (10) |
|
|
3 | (1) |
|
|
4 | (3) |
|
|
7 | (4) |
|
Part I Introduction to Graph Theory |
|
|
|
1 Notations, Properties, and Representations |
|
|
11 | (9) |
|
1.1 Graph Terminology and Notations |
|
|
11 | (2) |
|
|
13 | (1) |
|
|
14 | (1) |
|
1.4 Representing Graphs as Matrices |
|
|
15 | (1) |
|
1.5 Using Matrices to Compute Graph Properties |
|
|
16 | (1) |
|
1.6 Representing Graphs as Linked Lists |
|
|
17 | (1) |
|
1.7 Eigenvalues and Eigenvectors |
|
|
18 | (2) |
|
|
20 | (33) |
|
2.1 Depth-First Graph Traversal |
|
|
20 | (2) |
|
2.2 Breadth-First Graph Traversal |
|
|
22 | (1) |
|
2.3 Minimum Spanning Trees |
|
|
23 | (3) |
|
2.4 Shortest-Path Algorithms |
|
|
26 | (3) |
|
|
29 | (2) |
|
|
31 | (1) |
|
2.7 Dimensionality Reduction |
|
|
32 | (2) |
|
2.8 Stochastic Processes on Graphs |
|
|
34 | (4) |
|
|
38 | (2) |
|
|
40 | (1) |
|
2.11 Spreading Activation |
|
|
41 | (1) |
|
2.12 Electrical Interpretation of Random Walks |
|
|
42 | (2) |
|
|
44 | (1) |
|
2.14 Linear Algebra Methods for Computing Harmonic Functions |
|
|
45 | (1) |
|
2.15 Method of Relaxations |
|
|
46 | (1) |
|
|
47 | (6) |
|
|
|
|
53 | (25) |
|
|
53 | (1) |
|
|
54 | (1) |
|
|
54 | (3) |
|
|
57 | (1) |
|
|
58 | (3) |
|
3.6 Preferential Attachment |
|
|
61 | (1) |
|
|
62 | (1) |
|
3.8 Clustering Coefficient |
|
|
62 | (1) |
|
|
63 | (2) |
|
|
65 | (2) |
|
|
67 | (1) |
|
|
67 | (1) |
|
3.13 Closeness Centrality |
|
|
68 | (1) |
|
3.14 Betweenness Centrality |
|
|
69 | (1) |
|
|
70 | (2) |
|
3.16 Dynamic Processes: Percolation |
|
|
72 | (2) |
|
3.17 Strong and Weak Ties |
|
|
74 | (2) |
|
|
76 | (1) |
|
|
76 | (2) |
|
|
78 | (13) |
|
4.1 Co-Occurrence Networks |
|
|
78 | (2) |
|
4.2 Syntactic Dependency Networks |
|
|
80 | (1) |
|
|
81 | (4) |
|
|
85 | (6) |
|
Part III Graph-Based Information Retrieval |
|
|
|
5 Link Analysis for the World Wide Web |
|
|
91 | (15) |
|
|
91 | (1) |
|
|
92 | (3) |
|
|
95 | (1) |
|
|
95 | (2) |
|
5.5 Combining Page Rank with Content Analysis |
|
|
97 | (1) |
|
5.6 Topic-Sensitive Link Analysis |
|
|
97 | (3) |
|
5.7 Query-Dependent Link Analysis |
|
|
100 | (1) |
|
5.8 Hyperlinked-Induced Topic Search |
|
|
101 | (2) |
|
5.9 Document Reranking with Induced Links |
|
|
103 | (3) |
|
|
106 | (17) |
|
6.1 Graph-Based Clustering |
|
|
108 | (3) |
|
|
111 | (2) |
|
|
113 | (1) |
|
6.4 The Kernighan--Lin Method |
|
|
114 | (1) |
|
6.5 Betweenness-Based Clustering |
|
|
115 | (2) |
|
|
117 | (2) |
|
6.7 Text Clustering Using Random Walks |
|
|
119 | (4) |
|
Part IV Graph-Based Natural Language Processing |
|
|
|
|
123 | (17) |
|
|
123 | (2) |
|
|
125 | (1) |
|
|
126 | (3) |
|
|
129 | (2) |
|
7.5 Word-Sense Disambiguation |
|
|
131 | (3) |
|
|
134 | (1) |
|
7.7 Sentiment and Subjectivity |
|
|
135 | (5) |
|
|
140 | (9) |
|
8.1 Part-of-Speech Tagging |
|
|
140 | (1) |
|
|
141 | (3) |
|
8.3 Prepositional-Phrase Attachment |
|
|
144 | (2) |
|
8.4 Co-Reference Resolution |
|
|
146 | (3) |
|
|
149 | (30) |
|
|
149 | (1) |
|
9.2 Semi-supervised Passage Retrieval |
|
|
150 | (4) |
|
|
154 | (2) |
|
|
156 | (5) |
|
|
161 | (1) |
|
|
162 | (3) |
|
|
165 | (1) |
|
9.8 Cross-Language Information Retrieval |
|
|
166 | (3) |
|
9.9 Information Extraction |
|
|
169 | (2) |
|
|
171 | (3) |
|
|
174 | (5) |
Bibliography |
|
179 | (12) |
Index |
|
191 | |
Rada Mihalcea is an Associate Professor in the Department of Computer Science and Engineering at the University of North Texas, where she leads the Language and Information Technologies research group. In 2009, she received the Presidential Early Career Award for Scientists and Engineers, awarded by President Barack Obama. She served on the editorial board of several journals, including Computational Linguistics, the Journal of Natural Language Engineering and Language Resources and Evaluations, and she co-chaired the Empirical Methods in Natural Language Processing conference in 2009 and the Association for Computational Linguistics conference in 2011. She has been published in IEEE Intelligent Systems, the Journal of Natural Language Engineering, the Journal of Machine Translation, Computational Intelligence, the International Journal of Semantic Computing and Artificial Intelligence Magazine. Dragomir Radev is a Professor in the School of Information, the Department of Electrical Engineering and Computer Science, and the Department of Linguistics at the University of Michigan, where he is leader of the Computational Linguistics and Information Retrieval research group (CLAIR). He has more than 100 publications in conferences and journals such as Communications of the ACM, the Journal of Artificial Intelligence Research, Bioinformatics, Computational Linguistics, Information Processing and Management and the American Journal of Political Science, among others. He is on the editorial boards of Information Retrieval, the Journal of Natural Language Engineering and the Journal of Artificial Intelligence Research. Radev is an ACM distinguished scientist as well as the coach of the US high school team in computational linguistics.