| Preface |
|
xiii | |
| Author Bios |
|
xv | |
| Acknowledgments |
|
xvii | |
| List of Examples |
|
xix | |
| List of Figures |
|
xxiii | |
| List of Tables |
|
xxvii | |
| Section I Introduction |
|
|
Chapter 1 Introduction To Big Data Systems |
|
|
3 | (8) |
|
1.1 Introduction: Review Of Big Data Systems |
|
|
3 | (1) |
|
1.2 Understanding Big Data |
|
|
4 | (1) |
|
1.3 Type Of Data: Transactional Or Analytical |
|
|
5 | (3) |
|
1.4 Requirements And Challenges Of Big Data |
|
|
8 | (1) |
|
|
|
9 | (1) |
|
|
|
9 | (1) |
|
|
|
9 | (2) |
|
Chapter 2 Architecture And Organization Of Big Data Systems |
|
|
11 | (18) |
|
2.1 Architecture For Big Data Systems |
|
|
11 | (2) |
|
2.2 Organization Of Big Data Systems: Clusters |
|
|
13 | (5) |
|
2.3 Classification Of Clusters: Distributed Memory vs. Shared Memory |
|
|
18 | (7) |
|
|
|
25 | (1) |
|
|
|
25 | (1) |
|
|
|
26 | (3) |
|
Chapter 3 Cloud Computing For Big Data |
|
|
29 | (36) |
|
|
|
30 | (9) |
|
|
|
39 | (2) |
|
3.3 Processor Virtualization |
|
|
41 | (4) |
|
|
|
45 | (2) |
|
3.5 Virtualization Or Containerization |
|
|
47 | (1) |
|
|
|
48 | (4) |
|
|
|
52 | (1) |
|
|
|
53 | (5) |
|
|
|
58 | (1) |
|
|
|
59 | (1) |
|
|
|
59 | (6) |
| Section II Storage And Processing For Big Data |
|
|
Chapter 4 Hadoop: An Efficient Platform For Storing And Processing Big Data |
|
|
65 | (28) |
|
4.1 Requirements For Processing And Storing Big Data |
|
|
66 | (1) |
|
4.2 Hadoop - The Big Picture |
|
|
66 | (1) |
|
4.3 Hadoop Distributed File System |
|
|
67 | (5) |
|
|
|
72 | (15) |
|
|
|
87 | (3) |
|
|
|
90 | (1) |
|
|
|
90 | (1) |
|
|
|
90 | (3) |
|
Chapter 5 Enhancements In Hadoop |
|
|
93 | (24) |
|
|
|
93 | (1) |
|
|
|
94 | (4) |
|
|
|
98 | (2) |
|
|
|
100 | (3) |
|
|
|
103 | (1) |
|
|
|
104 | (1) |
|
|
|
105 | (1) |
|
|
|
106 | (5) |
|
|
|
111 | (2) |
|
|
|
113 | (1) |
|
|
|
114 | (1) |
|
|
|
114 | (3) |
|
|
|
117 | (26) |
|
6.1 Limitations Of Mapreduce |
|
|
118 | (1) |
|
6.2 Introduction To Spark |
|
|
119 | (1) |
|
|
|
120 | (6) |
|
|
|
126 | (1) |
|
|
|
127 | (5) |
|
|
|
132 | (1) |
|
|
|
133 | (5) |
|
|
|
138 | (2) |
|
|
|
140 | (1) |
|
|
|
140 | (1) |
|
|
|
140 | (3) |
|
|
|
143 | (28) |
|
|
|
144 | (1) |
|
7.2 Handling Big Data Systems - Parallel RDBMS |
|
|
144 | (4) |
|
7.3 Emergence Of NoSQL Systems |
|
|
148 | (2) |
|
|
|
150 | (5) |
|
7.5 Document-Oriented Database |
|
|
155 | (5) |
|
7.6 Column-Oriented Database |
|
|
160 | (4) |
|
|
|
164 | (4) |
|
|
|
168 | (1) |
|
|
|
168 | (1) |
|
|
|
169 | (2) |
|
|
|
171 | (12) |
|
|
|
171 | (1) |
|
8.2 Types Of Newsql Systems |
|
|
171 | (1) |
|
|
|
172 | (2) |
|
8.4 NewSQL Systems: Case Studies |
|
|
174 | (5) |
|
|
|
179 | (1) |
|
|
|
179 | (1) |
|
|
|
179 | (4) |
| Section III Networking, Security, And Privacy For Big Data |
|
|
Chapter 9 Networking For Big Data |
|
|
183 | (20) |
|
9.1 Network Architecture For Big Data Systems |
|
|
183 | (3) |
|
9.2 Challenges And Requirements |
|
|
186 | (1) |
|
9.3 Network Programmability And Software-Defined Net-Working |
|
|
187 | (5) |
|
9.4 Low-Latency And High-Speed Data Transfer |
|
|
192 | (5) |
|
9.5 Avoiding TCP Incast - Achieving Low-Latency And High-Throughput |
|
|
197 | (1) |
|
|
|
198 | (1) |
|
|
|
199 | (1) |
|
|
|
200 | (1) |
|
|
|
200 | (3) |
|
Chapter 10 Security For Big Data |
|
|
203 | (16) |
|
|
|
203 | (1) |
|
10.2 Security Requirements |
|
|
204 | (1) |
|
10.3 Security: Attack Types And Mechanisms |
|
|
205 | (3) |
|
10.4 Attack Detection And Prevention |
|
|
208 | (8) |
|
|
|
216 | (1) |
|
|
|
216 | (1) |
|
|
|
216 | (3) |
|
Chapter 11 Privacy For Big Data |
|
|
219 | (14) |
|
|
|
219 | (1) |
|
11.2 Understanding Big Data And Privacy |
|
|
220 | (1) |
|
11.3 Privacy Violations And Their Impact |
|
|
220 | (1) |
|
11.4 Types Of Privacy Violations |
|
|
221 | (3) |
|
11.5 Privacy Protection Solutions And Their Limitations |
|
|
224 | (5) |
|
|
|
229 | (1) |
|
|
|
229 | (1) |
|
|
|
229 | (4) |
| Section IV Computation For Big Data |
|
|
Chapter 12 High-Performance Computing For Big Data |
|
|
233 | (20) |
|
|
|
233 | (1) |
|
12.2 Scalability: Need For HPC |
|
|
234 | (1) |
|
12.3 Graphic Processing Unit |
|
|
235 | (4) |
|
12.4 Tensor Processing Unit |
|
|
239 | (2) |
|
12.5 High Speed Interconnects |
|
|
241 | (2) |
|
12.6 Message Passing Interface |
|
|
243 | (4) |
|
|
|
247 | (2) |
|
|
|
249 | (1) |
|
|
|
249 | (1) |
|
|
|
249 | (1) |
|
|
|
250 | (3) |
|
Chapter 13 Deep Learning With Big Data |
|
|
253 | (20) |
|
|
|
253 | (1) |
|
|
|
254 | (3) |
|
|
|
257 | (1) |
|
13.4 Types Of Deep Neural Network |
|
|
258 | (6) |
|
13.5 Big Data Applications Using Deep Learning |
|
|
264 | (4) |
|
|
|
268 | (1) |
|
|
|
268 | (1) |
|
|
|
268 | (5) |
| Section V Case Studies And Future Trends |
|
|
Chapter 14 Big Data: Case Studies And Future Trends |
|
|
273 | (10) |
|
|
|
273 | (1) |
|
14.2 Facebook Messages Application |
|
|
274 | (2) |
|
14.3 Hadoop For Real-Time Analytics |
|
|
276 | (1) |
|
14.4 Big Data Processing At Uber |
|
|
277 | (1) |
|
14.5 Big Data Processing At Linkedin |
|
|
278 | (2) |
|
14.6 Distributed Graph Processing At Google |
|
|
280 | (1) |
|
|
|
280 | (1) |
|
|
|
281 | (1) |
|
|
|
281 | (1) |
|
|
|
281 | (2) |
| Bibliography |
|
283 | (26) |
| Index |
|
309 | |