Introduction |
|
xix | |
|
Chapter 1 Hadoop Introduction |
|
|
1 | (14) |
|
Business Analytics and Big Data |
|
|
2 | (1) |
|
|
2 | (1) |
|
The Distributed File System (HDFS) |
|
|
2 | (1) |
|
|
3 | (1) |
|
|
4 | (1) |
|
|
4 | (1) |
|
|
5 | (1) |
|
Integration with Other Systems |
|
|
6 | (7) |
|
|
7 | (2) |
|
Data Integration and Hadoop |
|
|
9 | (4) |
|
|
13 | (2) |
|
|
15 | (32) |
|
|
16 | (10) |
|
|
16 | (3) |
|
|
19 | (3) |
|
|
22 | (4) |
|
Setting Up the HDFS Cluster in Distributed Mode |
|
|
26 | (4) |
|
|
26 | (4) |
|
Advanced Features of HDFS |
|
|
30 | (11) |
|
|
30 | (2) |
|
|
32 | (5) |
|
|
37 | (2) |
|
|
39 | (2) |
|
|
41 | (3) |
|
|
44 | (1) |
|
|
45 | (2) |
|
|
47 | (20) |
|
Basics of Hadoop MapReduce |
|
|
47 | (7) |
|
|
48 | (2) |
|
|
50 | (4) |
|
How to Launch a MapReduce Job |
|
|
54 | (6) |
|
|
55 | (1) |
|
|
56 | (1) |
|
|
57 | (2) |
|
|
59 | (1) |
|
Advanced Features of MapReduce |
|
|
60 | (4) |
|
|
60 | (2) |
|
|
62 | (1) |
|
|
63 | (1) |
|
The Difference from a Spark Job |
|
|
64 | (1) |
|
|
65 | (2) |
|
Chapter 4 User Experience |
|
|
67 | (22) |
|
|
68 | (8) |
|
|
69 | (1) |
|
|
70 | (3) |
|
|
73 | (2) |
|
|
75 | (1) |
|
|
76 | (3) |
|
|
76 | (1) |
|
|
77 | (2) |
|
|
79 | (1) |
|
|
79 | (2) |
|
|
80 | (1) |
|
|
81 | (7) |
|
|
82 | (2) |
|
|
84 | (1) |
|
|
85 | (3) |
|
|
88 | (1) |
|
|
88 | (1) |
|
Chapter 5 Integration with Other Systems |
|
|
89 | (20) |
|
|
90 | (3) |
|
|
90 | (3) |
|
|
93 | (4) |
|
|
93 | (4) |
|
|
97 | (5) |
|
|
98 | (2) |
|
|
100 | (1) |
|
|
101 | (1) |
|
|
102 | (5) |
|
|
103 | (2) |
|
|
105 | (1) |
|
|
105 | (2) |
|
|
107 | (2) |
|
Chapter 6 Hadoop Security |
|
|
109 | (32) |
|
Securing the Hadoop Cluster |
|
|
110 | (14) |
|
|
110 | (2) |
|
Authentication Using Kerberos |
|
|
112 | (4) |
|
Service Level Authorization in Hadoop |
|
|
116 | (3) |
|
|
119 | (2) |
|
Securing the HTTP Channel |
|
|
121 | (3) |
|
|
124 | (10) |
|
|
125 | (1) |
|
Bringing Data to the Cluster |
|
|
125 | (4) |
|
Protecting Data in the Cluster |
|
|
129 | (5) |
|
|
134 | (4) |
|
|
134 | (1) |
|
Application Submission in YARN |
|
|
134 | (4) |
|
|
138 | (3) |
|
Chapter 7 Ecosystem at Large: Hadoop with Apache Bigtop |
|
|
141 | (20) |
|
|
142 | (2) |
|
|
142 | (1) |
|
|
143 | (1) |
|
|
143 | (1) |
|
Developing a Custom-Tailored Stack |
|
|
144 | (5) |
|
Apache Bigtop: The History |
|
|
144 | (1) |
|
Apache Bigtop: The Concept and Philosophy |
|
|
145 | (1) |
|
The Structure of the Project |
|
|
146 | (1) |
|
|
147 | (1) |
|
Toolchain and Development Environment |
|
|
148 | (1) |
|
|
148 | (1) |
|
|
149 | (5) |
|
|
149 | (1) |
|
Master-less Puppet Deployment of a Cluster |
|
|
150 | (2) |
|
Configuration Management with Puppet |
|
|
152 | (2) |
|
|
154 | (5) |
|
iTests and Validation Applications |
|
|
154 | (1) |
|
Stack Integration Test Development |
|
|
155 | (2) |
|
|
157 | (1) |
|
|
158 | (1) |
|
|
158 | (1) |
|
|
159 | (1) |
|
|
159 | (2) |
|
Chapter 8 In-Memory Computing in Hadoop Stack |
|
|
161 | (22) |
|
Introduction to In-Memory Computing |
|
|
162 | (2) |
|
Apache Ignite: Memory First |
|
|
164 | (6) |
|
System Architecture of Apache Ignite |
|
|
165 | (1) |
|
|
165 | (2) |
|
A Discourse on High Availability |
|
|
167 | (1) |
|
|
168 | (1) |
|
|
169 | (1) |
|
|
169 | (1) |
|
|
170 | (1) |
|
Legacy Hadoop Acceleration with Ignite |
|
|
170 | (5) |
|
Benefits of In-Memory Storage |
|
|
171 | (1) |
|
Memory Filesystem: HDFS Caching |
|
|
171 | (1) |
|
|
172 | (3) |
|
Advanced Use of Apache Ignite |
|
|
175 | (6) |
|
|
175 | (1) |
|
|
176 | (1) |
|
|
177 | (1) |
|
|
178 | (2) |
|
Streaming with Apache Ignite |
|
|
180 | (1) |
|
|
181 | (2) |
Glossary |
|
183 | (4) |
Index |
|
187 | |