Preface |
|
xxiii | |
Acknowledgements |
|
xxix | |
|
Cluster and Complex System Design Issues |
|
|
1 | (30) |
|
Business Reasons for Clustered Systems |
|
|
2 | (3) |
|
|
2 | (1) |
|
|
3 | (2) |
|
Failures in Complex Systems |
|
|
5 | (9) |
|
|
7 | (2) |
|
|
9 | (1) |
|
|
10 | (1) |
|
|
10 | (1) |
|
|
10 | (2) |
|
|
12 | (1) |
|
Reconfiguration Around Faults |
|
|
13 | (1) |
|
|
14 | (1) |
|
|
14 | (6) |
|
|
15 | (1) |
|
Complexity and Reliability |
|
|
15 | (1) |
|
Synchronization Techniques |
|
|
16 | (1) |
|
Microprocessor Cache Coherency |
|
|
16 | (1) |
|
Kernel-Level Synchronization |
|
|
17 | (1) |
|
Application-Level Synchronization |
|
|
18 | (1) |
|
Synchronization Consistency Failures |
|
|
18 | (1) |
|
|
18 | (1) |
|
Locks and Lock Management |
|
|
19 | (1) |
|
|
20 | (1) |
|
|
20 | (2) |
|
|
20 | (1) |
|
|
21 | (1) |
|
|
22 | (1) |
|
|
22 | (4) |
|
Cost and Latency Trade-Off |
|
|
23 | (1) |
|
|
24 | (1) |
|
|
25 | (1) |
|
|
26 | (3) |
|
|
27 | (1) |
|
|
28 | (1) |
|
|
28 | (1) |
|
Failures in Clustered Systems |
|
|
29 | (1) |
|
|
29 | (1) |
|
|
29 | (1) |
|
|
30 | (1) |
|
|
30 | (1) |
|
Enterprise Cluster Computing Building Blocks |
|
|
31 | (20) |
|
Data Repositories and Infrastructure Services |
|
|
32 | (6) |
|
|
32 | (1) |
|
|
32 | (1) |
|
|
33 | (1) |
|
|
33 | (1) |
|
|
34 | (1) |
|
Oracle 8i OPS and Oracle 9i RAC |
|
|
34 | (2) |
|
|
36 | (1) |
|
|
36 | (1) |
|
|
36 | (1) |
|
|
37 | (1) |
|
|
37 | (1) |
|
Business Logic and Application Service |
|
|
38 | (4) |
|
Packaged Business Solutions |
|
|
38 | (1) |
|
|
39 | (3) |
|
User Access Services: Web Farms |
|
|
42 | (3) |
|
|
45 | (4) |
|
|
45 | (1) |
|
|
45 | (1) |
|
High-Performance Computing |
|
|
46 | (1) |
|
|
46 | (1) |
|
Sun Cluster Runtime Environment |
|
|
47 | (1) |
|
Sun MPI Communications Library |
|
|
47 | (1) |
|
|
47 | (1) |
|
Sun Scalable Scientific Subroutine Library |
|
|
47 | (1) |
|
Prism Parallel Development Environment |
|
|
48 | (1) |
|
|
49 | (1) |
|
Technologies for Building Distributed Applications |
|
|
49 | (2) |
|
|
49 | (1) |
|
|
50 | (1) |
|
Sun Cluster 3.0 Architecture |
|
|
51 | (64) |
|
|
52 | (5) |
|
Enterprise Infrastructure |
|
|
52 | (1) |
|
Service Point Architecture |
|
|
53 | (1) |
|
|
53 | (1) |
|
High Availability Versus Disaster Recovery |
|
|
53 | (3) |
|
Data Deletion and Corruption Recovery |
|
|
56 | (1) |
|
|
57 | (4) |
|
|
58 | (1) |
|
|
59 | (1) |
|
|
60 | (1) |
|
|
61 | (30) |
|
|
62 | (1) |
|
|
62 | (1) |
|
|
63 | (1) |
|
|
64 | (1) |
|
Cluster Device Connectivity |
|
|
65 | (1) |
|
|
66 | (1) |
|
Primary and Secondary I/O Paths |
|
|
66 | (1) |
|
|
67 | (1) |
|
|
68 | (1) |
|
|
69 | (1) |
|
|
69 | (1) |
|
|
70 | (1) |
|
|
71 | (1) |
|
Read and Write Implementation |
|
|
71 | (2) |
|
|
73 | (1) |
|
File and Attribute Caches |
|
|
73 | (2) |
|
|
75 | (2) |
|
Application Binaries, Data, and Logs |
|
|
77 | (1) |
|
|
78 | (1) |
|
Node Separation Performance Impact |
|
|
79 | (1) |
|
|
80 | (1) |
|
Global Networking Service |
|
|
81 | (2) |
|
Packet Distribution Mechanisms |
|
|
83 | (1) |
|
|
84 | (1) |
|
Client Connection Recovery After a GIN Node Failure |
|
|
84 | (1) |
|
|
85 | (1) |
|
|
86 | (1) |
|
|
87 | (1) |
|
|
87 | (2) |
|
|
89 | (1) |
|
Cluster Configuration Control |
|
|
89 | (1) |
|
|
89 | (1) |
|
|
90 | (1) |
|
Amnesia and Temporally Split Configurations |
|
|
90 | (1) |
|
|
91 | (8) |
|
|
91 | (1) |
|
Failure Handling and Outage Time |
|
|
92 | (1) |
|
|
92 | (1) |
|
Public Network Monitoring |
|
|
93 | (1) |
|
|
94 | (2) |
|
Process Monitoring Facility |
|
|
96 | (1) |
|
|
96 | (1) |
|
|
96 | (1) |
|
|
97 | (1) |
|
|
97 | (1) |
|
|
98 | (1) |
|
|
98 | (1) |
|
|
99 | (8) |
|
Data Services and Application Agents |
|
|
99 | (1) |
|
Agent Application Program Interfaces |
|
|
100 | (1) |
|
|
100 | (1) |
|
|
100 | (2) |
|
|
102 | (2) |
|
|
104 | (1) |
|
Resource Group Manager Daemon |
|
|
105 | (1) |
|
|
106 | (1) |
|
|
107 | (8) |
|
|
107 | (1) |
|
|
108 | (1) |
|
Majority Voting and Quorum Principles |
|
|
108 | (1) |
|
CMM Reconfiguration Process |
|
|
109 | (1) |
|
SCSI-2 and SCSI-3 Command Set Support |
|
|
110 | (1) |
|
|
111 | (1) |
|
Uneven Cluster Partitions |
|
|
111 | (1) |
|
|
112 | (1) |
|
|
113 | (1) |
|
|
113 | (2) |
|
|
115 | (18) |
|
|
116 | (1) |
|
|
117 | (1) |
|
|
118 | (5) |
|
|
118 | (1) |
|
Consolidated Cluster Node Messages |
|
|
119 | (1) |
|
AnswerBook2 Documentation Server |
|
|
119 | (1) |
|
Sun Management Center Server |
|
|
120 | (1) |
|
Solaris Management Console |
|
|
121 | (1) |
|
|
122 | (1) |
|
|
123 | (1) |
|
|
124 | (1) |
|
Sun Explorer Data Collector |
|
|
124 | (1) |
|
|
125 | (1) |
|
|
126 | (1) |
|
|
126 | (2) |
|
|
128 | (1) |
|
|
128 | (1) |
|
Backup, Restore, and Recovery |
|
|
129 | (3) |
|
|
129 | (1) |
|
|
130 | (1) |
|
|
130 | (1) |
|
Directly Attached Tape Drives |
|
|
130 | (1) |
|
Web Start Flash Technology |
|
|
131 | (1) |
|
|
131 | (1) |
|
|
132 | (1) |
|
Case Study 1-File Server Cluster |
|
|
133 | (24) |
|
|
133 | (1) |
|
|
134 | (4) |
|
|
134 | (1) |
|
|
134 | (1) |
|
|
135 | (1) |
|
|
135 | (1) |
|
|
136 | (1) |
|
|
136 | (1) |
|
|
136 | (1) |
|
|
137 | (1) |
|
|
137 | (1) |
|
|
137 | (1) |
|
Performance, Sizing, and Capacity Planning |
|
|
138 | (1) |
|
|
138 | (1) |
|
|
138 | (5) |
|
|
139 | (1) |
|
|
139 | (1) |
|
|
139 | (2) |
|
|
141 | (1) |
|
|
142 | (1) |
|
|
142 | (1) |
|
Recommended Hardware Configuration |
|
|
143 | (12) |
|
|
143 | (2) |
|
|
145 | (1) |
|
|
146 | (1) |
|
Options Considered But Discounted |
|
|
146 | (1) |
|
|
147 | (1) |
|
|
147 | (1) |
|
|
148 | (1) |
|
Options Considered But Discounted |
|
|
148 | (1) |
|
Network and Interconnects |
|
|
149 | (1) |
|
|
149 | (1) |
|
Options Considered But Discounted |
|
|
149 | (1) |
|
|
150 | (1) |
|
|
151 | (1) |
|
Options Considered But Discounted |
|
|
151 | (1) |
|
Backup, Restore, and Recovery |
|
|
152 | (2) |
|
|
154 | (1) |
|
|
154 | (1) |
|
|
155 | (2) |
|
Case Study 2-Database Cluster |
|
|
157 | (40) |
|
|
157 | (1) |
|
Information Technology Organization |
|
|
158 | (1) |
|
|
159 | (2) |
|
|
161 | (1) |
|
|
162 | (2) |
|
|
163 | (1) |
|
|
163 | (1) |
|
|
164 | (3) |
|
|
165 | (1) |
|
|
165 | (1) |
|
|
165 | (1) |
|
|
166 | (1) |
|
|
166 | (1) |
|
|
166 | (1) |
|
|
166 | (1) |
|
|
167 | (8) |
|
|
169 | (1) |
|
|
170 | (1) |
|
|
170 | (1) |
|
|
170 | (1) |
|
|
171 | (1) |
|
Automatic Lock Remastering |
|
|
171 | (1) |
|
|
172 | (1) |
|
Local GCS Lock Mode Versus Global |
|
|
172 | (1) |
|
Cache Fusion Read-Read Example |
|
|
173 | (2) |
|
Recommended Hardware Configuration |
|
|
175 | (20) |
|
|
175 | (1) |
|
|
176 | (1) |
|
|
176 | (4) |
|
Options Considered But Discounted |
|
|
180 | (1) |
|
|
180 | (5) |
|
|
185 | (2) |
|
|
187 | (1) |
|
Options Considered But Discounted |
|
|
187 | (1) |
|
|
187 | (2) |
|
|
189 | (1) |
|
Options Considered But Discounted |
|
|
190 | (1) |
|
Environmental Requirements |
|
|
190 | (1) |
|
|
190 | (2) |
|
|
192 | (1) |
|
Ambient Relative Humidity |
|
|
192 | (1) |
|
Backup, Restore, and Recovery |
|
|
193 | (1) |
|
|
194 | (1) |
|
Options Considered But Discounted |
|
|
194 | (1) |
|
|
195 | (2) |
A. Sun Cluster 3.0 Design Checklists |
|
197 | (10) |
|
Business Case Considerations |
|
|
198 | (1) |
|
|
199 | (1) |
|
Top-Level Design Documentation |
|
|
200 | (1) |
|
|
200 | (1) |
|
|
201 | (1) |
|
|
202 | (1) |
|
|
203 | (1) |
|
Software Environment Design |
|
|
203 | (1) |
|
|
204 | (1) |
|
Systems Management Requirements |
|
|
204 | (1) |
|
|
205 | (2) |
B. Sun Cluster Technology History and Perspective |
|
207 | (14) |
|
SPARCcluster PDB 1.x and SPARCcluster HA 1.x History |
|
|
207 | (2) |
|
|
207 | (1) |
|
|
208 | (1) |
|
|
209 | (2) |
|
Sun Cluster 2.2 and 3.0 Feature Comparison |
|
|
211 | (10) |
|
|
211 | (1) |
|
|
212 | (1) |
|
|
212 | (2) |
|
|
214 | (1) |
|
|
215 | (1) |
|
Cluster Configuration Database |
|
|
215 | (2) |
|
|
217 | (1) |
|
|
218 | (1) |
|
|
219 | (1) |
|
|
219 | (1) |
|
|
220 | (1) |
C. Data Center Guidelines |
|
221 | (14) |
|
Hardware Platform Stability |
|
|
221 | (1) |
|
Server Consolidation in a Common Rack |
|
|
222 | (1) |
|
System Component Identification |
|
|
222 | (2) |
|
|
223 | (1) |
|
|
223 | (1) |
|
Component and Cable Labeling |
|
|
224 | (1) |
|
|
224 | (1) |
|
|
225 | (1) |
|
|
225 | (1) |
|
|
226 | (1) |
|
System Installation and Configuration Documentation |
|
|
227 | (2) |
|
|
229 | (1) |
|
Maintenance and Patch Strategy |
|
|
229 | (1) |
|
|
230 | (1) |
|
New Release Upgrade Process |
|
|
231 | (1) |
|
Support Agreement and Associated Response Time |
|
|
231 | (1) |
|
Backup-and-Restore Testing |
|
|
232 | (1) |
|
Cluster Recovery Procedures |
|
|
233 | (1) |
|
|
233 | (2) |
D. Tools |
|
235 | (14) |
|
|
236 | (4) |
|
|
238 | (1) |
|
|
238 | (2) |
|
Reliability Block Diagram Analysis |
|
|
240 | (1) |
|
Failure Modes and Effects Analysis |
|
|
241 | (4) |
|
|
242 | (2) |
|
|
244 | (1) |
|
|
245 | (4) |
|
|
245 | (1) |
|
|
246 | (3) |
Acronyms, Abbreviations, and Glossary |
|
249 | (10) |
Bibliography |
|
259 | (2) |
Index |
|
261 | |