Preface |
|
xiii | |
|
|
xvii | |
|
Data mining meets grid computing: Time to dance? |
|
|
1 | (16) |
|
|
|
|
|
|
|
|
2 | (1) |
|
|
3 | (3) |
|
Complex data mining problems |
|
|
3 | (1) |
|
|
4 | (2) |
|
|
6 | (3) |
|
Grid computing challenges |
|
|
9 | (1) |
|
Data mining grid - mining grid data |
|
|
9 | (3) |
|
Data mining grid: a grid facilitating large-scale data mining |
|
|
9 | (2) |
|
Mining grid data: analyzing grid systems with data mining techniques |
|
|
11 | (1) |
|
|
12 | (1) |
|
Summary of Chapters in this Volume |
|
|
13 | (4) |
|
Data analysis services in the knowledge grid |
|
|
17 | (20) |
|
|
|
|
|
|
17 | (1) |
|
|
18 | (2) |
|
|
20 | (9) |
|
The Knowledge Grid architecture |
|
|
21 | (3) |
|
|
24 | (5) |
|
|
29 | (2) |
|
Design of Knowledge Grid applications |
|
|
31 | (3) |
|
|
31 | (1) |
|
UML application modelling |
|
|
32 | (1) |
|
Applications and experiments |
|
|
33 | (1) |
|
|
34 | (3) |
|
GridMiner: An advanced support for e-science analytics |
|
|
37 | (20) |
|
|
|
|
|
37 | (2) |
|
Rationale behind the design and development of GridMiner |
|
|
39 | (1) |
|
|
40 | (1) |
|
Knowledge discovery process and its support by the GridMiner |
|
|
41 | (9) |
|
Phases of knowledge discovery |
|
|
42 | (3) |
|
|
45 | (1) |
|
|
46 | (1) |
|
Data mining services and OLAP |
|
|
47 | (2) |
|
|
49 | (1) |
|
|
50 | (2) |
|
|
52 | (1) |
|
High-level data mining model |
|
|
52 | (1) |
|
Data mining query language |
|
|
52 | (1) |
|
Distributed mining of data streams |
|
|
52 | (1) |
|
|
53 | (4) |
|
ADaM services: Scientific data mining in the service-oriented architecture paradigm |
|
|
57 | (14) |
|
|
|
|
|
|
|
|
|
58 | (1) |
|
|
58 | (2) |
|
|
60 | (1) |
|
Mining in a service-oriented architecture |
|
|
61 | (1) |
|
|
62 | (4) |
|
Implementation architecture |
|
|
63 | (1) |
|
|
64 | (1) |
|
|
64 | (2) |
|
|
66 | (3) |
|
|
67 | (1) |
|
|
68 | (1) |
|
|
69 | (2) |
|
Mining for misconfigured machines in grid systems |
|
|
71 | (20) |
|
|
|
|
|
|
71 | (2) |
|
Preliminaries and related work |
|
|
73 | (2) |
|
System misconfiguration detection |
|
|
73 | (1) |
|
|
74 | (1) |
|
Acquiring, pre-processing and storing data |
|
|
75 | (2) |
|
Data sources and acquisition |
|
|
75 | (1) |
|
|
75 | (1) |
|
|
76 | (1) |
|
|
77 | (3) |
|
|
77 | (1) |
|
|
78 | (1) |
|
|
78 | (2) |
|
Correctness and termination |
|
|
80 | (1) |
|
|
80 | (2) |
|
|
82 | (6) |
|
|
82 | (1) |
|
|
83 | (2) |
|
|
85 | (3) |
|
Conclusions and future work |
|
|
88 | (3) |
|
FAEHIM: Federated Analysis Environment for Heterogeneous Intelligent Mining |
|
|
91 | (14) |
|
|
|
|
91 | (2) |
|
Requirements of a distributed knowledge discovery framework |
|
|
93 | (1) |
|
Knowledge discovery specific requirements |
|
|
93 | (1) |
|
Distributed framework specific requirements |
|
|
94 | (1) |
|
Workflow-based knowledge discovery |
|
|
94 | (1) |
|
|
95 | (1) |
|
Data mining service framework |
|
|
96 | (3) |
|
Distributed data mining services |
|
|
99 | (1) |
|
|
100 | (1) |
|
|
101 | (1) |
|
|
101 | (3) |
|
Evaluating the framework accuracy |
|
|
102 | (1) |
|
Evaluating the running time of the framework |
|
|
103 | (1) |
|
|
104 | (1) |
|
Scalable and privacy preserving distributed data analysis over a service-oriented platform |
|
|
105 | (14) |
|
|
|
105 | (1) |
|
A service-oriented solution |
|
|
106 | (1) |
|
|
107 | (2) |
|
Types of distributed data analysis |
|
|
107 | (1) |
|
A brief review of distributed data analysis |
|
|
108 | (1) |
|
Data mining services and data analysis management systems |
|
|
108 | (1) |
|
Model-based scalable, privacy preserving, distributed data analysis |
|
|
109 | (2) |
|
Hierarchical local data abstractions |
|
|
109 | (1) |
|
Learning global models from local abstractions |
|
|
110 | (1) |
|
Modelling distributed data mining and workflow processes |
|
|
111 | (1) |
|
|
111 | (1) |
|
|
112 | (1) |
|
|
112 | (2) |
|
Performance of running distributed data analysis on BPEL |
|
|
112 | (1) |
|
Issues specific to service-oriented distributed data analysis |
|
|
113 | (1) |
|
Compatibility of Web services development tools |
|
|
114 | (1) |
|
Further research directions |
|
|
114 | (2) |
|
Optimizing BPEL4WS process execution |
|
|
114 | (1) |
|
Improved support of data analysis process management |
|
|
115 | (1) |
|
Improved support of data privacy preservation |
|
|
115 | (1) |
|
|
116 | (3) |
|
Building and using analytical workflows in Discovery Net |
|
|
119 | (22) |
|
|
|
|
|
|
119 | (2) |
|
|
120 | (1) |
|
|
121 | (5) |
|
|
121 | (1) |
|
Workflow representation in DPML |
|
|
122 | (1) |
|
|
123 | (1) |
|
|
123 | (1) |
|
Multiple execution models |
|
|
123 | (1) |
|
|
124 | (1) |
|
Streaming and batch transfer of data elements |
|
|
124 | (1) |
|
|
125 | (1) |
|
|
125 | (1) |
|
Architecture for Discovery Net |
|
|
126 | (5) |
|
Motivation for a new server architecture |
|
|
126 | (1) |
|
Management of hosting environments |
|
|
127 | (1) |
|
|
127 | (1) |
|
Collaborative workflow platform |
|
|
127 | (1) |
|
|
127 | (2) |
|
Activity service definition layer |
|
|
129 | (1) |
|
|
130 | (1) |
|
Collaboration and execution services |
|
|
130 | (1) |
|
|
130 | (1) |
|
Prototyping and production clients |
|
|
130 | (1) |
|
|
131 | (2) |
|
Example of a workflow study |
|
|
133 | (3) |
|
|
133 | (1) |
|
|
133 | (1) |
|
Service for transforming event data into patient annotations |
|
|
134 | (1) |
|
Service for defining exclusions |
|
|
134 | (1) |
|
Service for defining exposures |
|
|
135 | (1) |
|
Service for building the classification model |
|
|
135 | (1) |
|
|
135 | (1) |
|
|
136 | (1) |
|
|
136 | (5) |
|
Building workflows that traverse the bioinformatics data landscape |
|
|
141 | (24) |
|
|
|
|
|
|
|
141 | (2) |
|
The bioinformatics data landscape |
|
|
143 | (1) |
|
The bioinformatics experiment landscape |
|
|
143 | (2) |
|
Taverna for bioinformatics experiments |
|
|
145 | (3) |
|
Three-tiered enactment in Taverna |
|
|
146 | (1) |
|
The open-typing data models |
|
|
147 | (1) |
|
Building workflows in Taverna |
|
|
148 | (2) |
|
Designing a SCUFL workflow |
|
|
149 | (1) |
|
|
150 | (9) |
|
|
152 | (1) |
|
Current approaches and issues |
|
|
153 | (1) |
|
|
154 | (2) |
|
Candidate genes involved in trypanosomiasis resistance |
|
|
156 | (1) |
|
Workflows and the systematic approach |
|
|
157 | (2) |
|
|
159 | (6) |
|
Specification of distributed data mining workflows with DataMiningGrid |
|
|
165 | (14) |
|
|
|
|
165 | (2) |
|
DataMiningGrid environment |
|
|
167 | (2) |
|
|
167 | (1) |
|
|
167 | (1) |
|
|
167 | (1) |
|
|
168 | (1) |
|
Operations for workflow construction |
|
|
169 | (2) |
|
|
169 | (1) |
|
|
169 | (1) |
|
|
170 | (1) |
|
|
170 | (1) |
|
|
170 | (1) |
|
|
171 | (1) |
|
|
171 | (1) |
|
|
171 | (2) |
|
|
173 | (2) |
|
Evaluation criteria and experimental methodology |
|
|
173 | (1) |
|
|
173 | (2) |
|
Classifier comparison scenario |
|
|
175 | (1) |
|
|
175 | (1) |
|
Discussion and related work |
|
|
175 | (1) |
|
|
176 | (1) |
|
|
176 | (3) |
|
Anteater: Service-oriented data mining |
|
|
179 | (22) |
|
|
|
|
|
179 | (2) |
|
|
181 | (2) |
|
|
183 | (6) |
|
|
185 | (1) |
|
Global persistent storage |
|
|
185 | (1) |
|
|
186 | (1) |
|
|
187 | (2) |
|
Parallel algorithms for data mining |
|
|
189 | (6) |
|
|
189 | (4) |
|
|
193 | (2) |
|
|
195 | (1) |
|
|
196 | (1) |
|
|
197 | (1) |
|
Conclusions and future work |
|
|
198 | (3) |
|
DMGA: A generic brokering-based Data Mining Grid Architecture |
|
|
201 | (20) |
|
|
|
|
|
|
|
201 | (1) |
|
|
202 | (2) |
|
|
204 | (2) |
|
|
206 | (2) |
|
|
208 | (1) |
|
Brokering-based data mining grid architecture |
|
|
209 | (1) |
|
Use cases: Apriori, ID3 and J4.8 algorithms |
|
|
210 | (6) |
|
Horizontal composition use case: Apriori |
|
|
210 | (3) |
|
Vertical composition use cases: ID3 and J4.8 |
|
|
213 | (3) |
|
|
216 | (1) |
|
|
217 | (4) |
|
Grid-based data mining with the Environmental Scenario Search Engine (ESSE) |
|
|
221 | (26) |
|
|
|
|
|
|
|
Environmental data source: NCEP/NCAR reanalysis data set |
|
|
222 | (1) |
|
|
223 | (8) |
|
|
224 | (2) |
|
|
226 | (1) |
|
|
227 | (2) |
|
Relative importance of parameters |
|
|
229 | (1) |
|
Fuzzy search optimization |
|
|
229 | (2) |
|
|
231 | (6) |
|
Database schema optimization |
|
|
231 | (2) |
|
|
233 | (2) |
|
|
235 | (1) |
|
|
235 | (2) |
|
|
237 | (6) |
|
Global air temperature trends |
|
|
238 | (1) |
|
Statistics of extreme weather events |
|
|
239 | (1) |
|
|
239 | (4) |
|
|
243 | (4) |
|
Data pre-processing using OGSA-DAI |
|
|
247 | (16) |
|
|
|
|
247 | (1) |
|
Data pre-processing for grid-enabled data mining |
|
|
248 | (1) |
|
Using OGSA-DAI to support data mining applications |
|
|
248 | (7) |
|
OGSA-DAI's activity framework |
|
|
249 | (4) |
|
OGSA-DAI workflows for data management and pre-processing |
|
|
253 | (2) |
|
Data pre-processing scenarios in data mining applications |
|
|
255 | (3) |
|
Calculating a data summary |
|
|
255 | (1) |
|
Discovering association rules in protein unfolding simulations |
|
|
256 | (1) |
|
Mining distributed medical databases |
|
|
257 | (1) |
|
State-of-the-art solutions for grid data management |
|
|
258 | (1) |
|
|
259 | (1) |
|
|
259 | (1) |
|
|
260 | (3) |
Index |
|
263 | |