| Preface |
|
xvii | |
| Author |
|
xxi | |
|
1 Biomedical Data Warehousing |
|
|
1 | (16) |
|
Nature of Biomedical Data |
|
|
1 | (3) |
|
Nature of Warehoused Data |
|
|
4 | (3) |
|
|
|
7 | (1) |
|
|
|
8 | (3) |
|
Data Queries, Reports, and Marts |
|
|
9 | (1) |
|
Data Issues and Hypotheses |
|
|
10 | (1) |
|
Performance and Control Data |
|
|
11 | (1) |
|
|
|
11 | (1) |
|
|
|
12 | (1) |
|
|
|
13 | (4) |
|
|
|
14 | (3) |
| Section I Alpha Version |
|
|
2 Dimensional Data Modeling |
|
|
17 | (22) |
|
Evolution of Data Warehouses |
|
|
17 | (5) |
|
|
|
18 | (1) |
|
|
|
19 | (3) |
|
|
|
22 | (7) |
|
Dimensions and Subdimensions |
|
|
24 | (2) |
|
|
|
26 | (3) |
|
Transposing Dimensional Schema |
|
|
29 | (3) |
|
|
|
32 | (1) |
|
|
|
33 | (6) |
|
|
|
34 | (3) |
|
|
|
37 | (2) |
|
3 Understanding Source Data |
|
|
39 | (36) |
|
Implicit versus Explicit Data |
|
|
40 | (2) |
|
|
|
42 | (11) |
|
|
|
44 | (3) |
|
|
|
47 | (6) |
|
|
|
48 | (2) |
|
BFO Spatiotemporal Regions |
|
|
50 | (1) |
|
|
|
51 | (1) |
|
|
|
52 | (1) |
|
|
|
52 | (1) |
|
|
|
52 | (1) |
|
|
|
53 | (4) |
|
IA0 Material Information Bearer |
|
|
54 | (1) |
|
IA0 Information Content Entity |
|
|
55 | (1) |
|
|
|
56 | (1) |
|
|
|
57 | (7) |
|
|
|
58 | (3) |
|
|
|
61 | (2) |
|
|
|
63 | (1) |
|
|
|
64 | (3) |
|
|
|
67 | (1) |
|
|
|
68 | (3) |
|
|
|
68 | (2) |
|
|
|
70 | (1) |
|
|
|
71 | (4) |
|
|
|
75 | (42) |
|
|
|
76 | (2) |
|
|
|
78 | (1) |
|
|
|
79 | (7) |
|
|
|
80 | (1) |
|
|
|
81 | (1) |
|
|
|
82 | (1) |
|
|
|
83 | (1) |
|
|
|
84 | (2) |
|
|
|
85 | (1) |
|
|
|
86 | (10) |
|
|
|
86 | (1) |
|
|
|
87 | (3) |
|
|
|
90 | (1) |
|
|
|
91 | (1) |
|
|
|
92 | (1) |
|
|
|
93 | (1) |
|
|
|
94 | (1) |
|
|
|
95 | (1) |
|
|
|
96 | (9) |
|
|
|
96 | (3) |
|
|
|
99 | (1) |
|
|
|
100 | (2) |
|
|
|
102 | (1) |
|
|
|
102 | (1) |
|
|
|
103 | (1) |
|
Unit of Measure Dimension |
|
|
103 | (2) |
|
|
|
105 | (1) |
|
|
|
105 | (1) |
|
|
|
106 | (1) |
|
|
|
106 | (3) |
|
|
|
107 | (2) |
|
|
|
109 | (1) |
|
|
|
109 | (1) |
|
|
|
109 | (1) |
|
|
|
109 | (8) |
|
5 Star Dimension Design Pattern |
|
|
117 | (32) |
|
|
|
117 | (1) |
|
Master Data: Definition Tables |
|
|
118 | (3) |
|
Slowly Changing Dimensions |
|
|
121 | (8) |
|
|
|
125 | (4) |
|
Source Keys: Context and Reference Tables |
|
|
129 | (5) |
|
Fact Participation: Group and Bridge Tables |
|
|
134 | (3) |
|
Interconnections: Hierarchy Tables |
|
|
137 | (5) |
|
|
|
140 | (2) |
|
|
|
142 | (1) |
|
|
|
143 | (6) |
|
|
|
149 | (48) |
|
|
|
149 | (1) |
|
Selecting and Preparing Sources |
|
|
150 | (6) |
|
Generating Surrogate Keys |
|
|
156 | (3) |
|
Simple Dimensions and Facts |
|
|
159 | (13) |
|
First Source: Patient Master Data |
|
|
159 | (3) |
|
Alternative G-B-H Processing |
|
|
162 | (2) |
|
Second Source: Patient Address Facts |
|
|
164 | (6) |
|
|
|
170 | (2) |
|
|
|
172 | (3) |
|
Complicated Dimensions and Facts |
|
|
175 | (13) |
|
Third Source: Basic Lab Results |
|
|
175 | (13) |
|
Finalizing Alpha Structures |
|
|
188 | (1) |
|
|
|
189 | (8) |
|
|
|
189 | (1) |
|
Dimension-Subdimension Counts |
|
|
190 | (1) |
|
|
|
190 | (1) |
|
|
|
191 | (6) |
| Section II Beta Version |
|
|
|
|
197 | (36) |
|
|
|
198 | (10) |
|
|
|
199 | (1) |
|
|
|
199 | (1) |
|
|
|
200 | (1) |
|
|
|
201 | (1) |
|
|
|
201 | (1) |
|
|
|
202 | (6) |
|
|
|
208 | (16) |
|
|
|
208 | (1) |
|
|
|
209 | (1) |
|
|
|
209 | (2) |
|
|
|
211 | (1) |
|
|
|
212 | (3) |
|
|
|
213 | (1) |
|
|
|
213 | (1) |
|
Codeset Translation Metadata |
|
|
214 | (1) |
|
|
|
215 | (1) |
|
|
|
215 | (9) |
|
First Alpha Source: Patient Master Data |
|
|
216 | (2) |
|
Second Alpha Source: Patient Address Facts |
|
|
218 | (4) |
|
Third Alpha Source: Basic Lab Results |
|
|
222 | (2) |
|
|
|
224 | (2) |
|
|
|
224 | (1) |
|
|
|
225 | (1) |
|
|
|
226 | (1) |
|
Reinitializing the Warehouse |
|
|
226 | (7) |
|
|
|
226 | (1) |
|
|
|
227 | (1) |
|
|
|
228 | (1) |
|
|
|
229 | (1) |
|
|
|
229 | (3) |
|
|
|
230 | (1) |
|
|
|
231 | (1) |
|
|
|
232 | (1) |
|
|
|
233 | (34) |
|
Source Mapping Challenges |
|
|
233 | (24) |
|
Coverage and Seamlessness |
|
|
234 | (4) |
|
|
|
238 | (5) |
|
|
|
243 | (11) |
|
|
|
244 | (3) |
|
|
|
247 | (3) |
|
|
|
250 | (4) |
|
|
|
254 | (3) |
|
|
|
257 | (2) |
|
|
|
259 | (8) |
|
|
|
260 | (3) |
|
|
|
263 | (1) |
|
|
|
264 | (3) |
|
9 Generalizing ETL Workflows |
|
|
267 | (42) |
|
Standardizing Source Data |
|
|
267 | (8) |
|
|
|
268 | (4) |
|
|
|
272 | (3) |
|
|
|
275 | (1) |
|
|
|
275 | (2) |
|
|
|
277 | (5) |
|
Source Data Consolidation |
|
|
282 | (1) |
|
External versus Internal Sourcing |
|
|
283 | (1) |
|
|
|
284 | (3) |
|
|
|
286 | (1) |
|
|
|
287 | (4) |
|
|
|
288 | (1) |
|
|
|
289 | (1) |
|
|
|
290 | (1) |
|
Checkpoint, Restart, and Bulk Loading |
|
|
290 | (1) |
|
|
|
291 | (8) |
|
|
|
292 | (2) |
|
General versus Functional Transformations |
|
|
294 | (2) |
|
Resolving "-A11-" Entries |
|
|
296 | (1) |
|
Early versus Late Binding |
|
|
297 | (2) |
|
|
|
299 | (7) |
|
|
|
300 | (1) |
|
|
|
300 | (4) |
|
|
|
304 | (1) |
|
Assigning Master IDs to ETL Layers |
|
|
305 | (1) |
|
|
|
306 | (3) |
|
|
|
309 | (34) |
|
|
|
312 | (5) |
|
|
|
314 | (2) |
|
|
|
316 | (1) |
|
|
|
317 | (4) |
|
|
|
319 | (1) |
|
|
|
320 | (1) |
|
|
|
321 | (6) |
|
|
|
323 | (1) |
|
|
|
324 | (3) |
|
|
|
327 | (5) |
|
|
|
328 | (1) |
|
|
|
329 | (1) |
|
Transactional Redistribution |
|
|
330 | (1) |
|
|
|
331 | (1) |
|
|
|
332 | (1) |
|
|
|
332 | (1) |
|
|
|
332 | (2) |
|
|
|
334 | (1) |
|
|
|
334 | (1) |
|
|
|
335 | (1) |
|
|
|
335 | (1) |
|
|
|
336 | (7) |
|
|
|
339 | (1) |
|
|
|
340 | (3) |
|
|
|
343 | (34) |
|
|
|
343 | (9) |
|
|
|
347 | (1) |
|
|
|
348 | (4) |
|
|
|
352 | (4) |
|
|
|
356 | (3) |
|
|
|
359 | (1) |
|
|
|
360 | (1) |
|
Definition Change Processing |
|
|
361 | (6) |
|
Slowly Changing Dimensions |
|
|
361 | (3) |
|
Multiple Simultaneous Transactions |
|
|
364 | (3) |
|
Building SCD Transaction Sets |
|
|
367 | (5) |
|
Staging Existing Definitions |
|
|
367 | (1) |
|
Assigning Deep Row Numbers |
|
|
368 | (1) |
|
Distribute Deep Non-SCD Updates |
|
|
369 | (1) |
|
Assigning Relative Wide Row Numbers |
|
|
370 | (2) |
|
Applying Transactions to Dimensions |
|
|
372 | (3) |
|
|
|
373 | (1) |
|
Auto-Adopt Orphan Definitions |
|
|
374 | (1) |
|
|
|
374 | (1) |
|
Update Existing Definitions |
|
|
375 | (1) |
|
|
|
375 | (2) |
|
|
|
377 | (30) |
|
|
|
378 | (5) |
|
|
|
378 | (2) |
|
Generating Factless Facts |
|
|
380 | (2) |
|
|
|
382 | (1) |
|
|
|
383 | (8) |
|
|
|
384 | (2) |
|
|
|
386 | (1) |
|
|
|
387 | (2) |
|
|
|
389 | (1) |
|
Distribute Surrogates to Group IDs |
|
|
390 | (1) |
|
Distribute Group IDs to Bridges |
|
|
390 | (1) |
|
|
|
390 | (1) |
|
|
|
391 | (1) |
|
|
|
391 | (3) |
|
|
|
392 | (2) |
|
|
|
394 | (1) |
|
|
|
394 | (6) |
|
|
|
395 | (3) |
|
|
|
396 | (1) |
|
Assigning the Implicit UOM |
|
|
397 | (1) |
|
|
|
398 | (1) |
|
|
|
399 | (1) |
|
|
|
400 | (1) |
|
|
|
400 | (1) |
|
|
|
400 | (1) |
|
|
|
401 | (1) |
|
|
|
401 | (1) |
|
|
|
402 | (5) |
|
|
|
403 | (4) |
|
|
|
407 | (16) |
|
|
|
407 | (3) |
|
|
|
410 | (1) |
|
Verification and Validation |
|
|
411 | (8) |
|
|
|
412 | (13) |
|
|
|
412 | (1) |
|
|
|
413 | (2) |
|
|
|
415 | (1) |
|
|
|
416 | (1) |
|
|
|
416 | (1) |
|
|
|
417 | (1) |
|
|
|
418 | (1) |
|
|
|
419 | (4) |
| Section III Gamma Version |
|
|
14 Finalizing ETL Workflows |
|
|
423 | (28) |
|
Alternatively Sourced Keys |
|
|
425 | (6) |
|
Sourcing Compound Natural Keys |
|
|
425 | (2) |
|
Sourcing Warehouse Surrogates |
|
|
427 | (1) |
|
Alternatives in the Reference Pipe |
|
|
428 | (3) |
|
|
|
431 | (1) |
|
|
|
432 | (5) |
|
Value Trimming and Cleanup |
|
|
433 | (1) |
|
|
|
433 | (1) |
|
|
|
433 | (3) |
|
|
|
436 | (1) |
|
|
|
437 | (4) |
|
Undetermined Dimensionality |
|
|
441 | (4) |
|
|
|
445 | (1) |
|
|
|
446 | (1) |
|
|
|
447 | (3) |
|
Continuous Functional Evolution |
|
|
450 | (1) |
|
15 Establishing Data Controls |
|
|
451 | (30) |
|
Finalizing Warehouse Design |
|
|
451 | (11) |
|
|
|
451 | (1) |
|
|
|
452 | (1) |
|
Indexing and Partitioning |
|
|
453 | (1) |
|
|
|
454 | (3) |
|
|
|
457 | (5) |
|
Redaction Control Settings |
|
|
462 | (3) |
|
|
|
465 | (8) |
|
|
|
466 | (1) |
|
|
|
466 | (1) |
|
Unexpected and Undesired Values |
|
|
467 | (2) |
|
|
|
469 | (4) |
|
|
|
473 | (2) |
|
|
|
475 | (1) |
|
Implementing Dataset Controls |
|
|
476 | (3) |
|
|
|
479 | (2) |
|
|
|
481 | (30) |
|
|
|
481 | (5) |
|
|
|
486 | (10) |
|
|
|
487 | (1) |
|
New Dimension and Fact Tables |
|
|
488 | (4) |
|
|
|
490 | (1) |
|
|
|
491 | (1) |
|
|
|
492 | (3) |
|
|
|
495 | (1) |
|
|
|
496 | (1) |
|
|
|
496 | (2) |
|
Populating Control Values |
|
|
498 | (2) |
|
|
|
500 | (11) |
|
|
|
502 | (5) |
|
Metric—Control Fact Views |
|
|
507 | (4) |
|
|
|
511 | (54) |
|
|
|
511 | (7) |
|
|
|
512 | (1) |
|
|
|
512 | (1) |
|
|
|
513 | (2) |
|
|
|
514 | (1) |
|
Institutional Review Board |
|
|
515 | (1) |
|
|
|
515 | (2) |
|
|
|
516 | (1) |
|
|
|
516 | (1) |
|
|
|
517 | (24) |
|
|
|
517 | (1) |
|
Privacy-Oriented Usage Profiles |
|
|
518 | (2) |
|
|
|
520 | (4) |
|
|
|
524 | (4) |
|
|
|
528 | (1) |
|
|
|
529 | (10) |
|
|
|
539 | (2) |
|
|
|
541 | (7) |
|
|
|
541 | (6) |
|
|
|
547 | (1) |
|
|
|
548 | (17) |
|
|
|
548 | (2) |
|
|
|
550 | (3) |
|
|
|
553 | (1) |
|
|
|
553 | (4) |
|
|
|
557 | (3) |
|
|
|
560 | (5) |
|
|
|
565 | (12) |
|
|
|
565 | (1) |
|
|
|
566 | (4) |
|
|
|
566 | (2) |
|
|
|
568 | (1) |
|
|
|
569 | (1) |
|
|
|
570 | (4) |
|
|
|
570 | (2) |
|
|
|
572 | (1) |
|
|
|
573 | (1) |
|
|
|
574 | (3) |
| Section IV Release 1.0* |
|
|
|
|
577 | (28) |
|
|
|
577 | (8) |
|
|
|
578 | (1) |
|
|
|
579 | (5) |
|
|
|
584 | (1) |
|
|
|
585 | (3) |
|
|
|
586 | (1) |
|
|
|
587 | (1) |
|
|
|
588 | (5) |
|
|
|
590 | (3) |
|
|
|
593 | (1) |
|
|
|
593 | (1) |
|
Statistical Process Control |
|
|
594 | (1) |
|
|
|
595 | (10) |
|
|
|
605 | (12) |
|
Organizing for Governance |
|
|
605 | (5) |
|
|
|
606 | (1) |
|
|
|
607 | (1) |
|
|
|
608 | (1) |
|
|
|
609 | (1) |
|
|
|
610 | (7) |
|
|
|
610 | (1) |
|
|
|
610 | (1) |
|
|
|
611 | (3) |
|
|
|
612 | (1) |
|
Source Application Feedback |
|
|
612 | (1) |
|
|
|
613 | (1) |
|
|
|
614 | (3) |
|
|
|
615 | (1) |
|
|
|
615 | (2) |
| Index |
|
617 | |