Muutke küpsiste eelistusi

Big Data with Hadoop MapReduce: A Classroom Approach [Kõva köide]

, , (National Institute of Technology, India)
  • Formaat: Hardback, 406 pages, kõrgus x laius: 234x156 mm, kaal: 950 g, 33 Tables, black and white; 9 Illustrations, color; 101 Illustrations, black and white
  • Ilmumisaeg: 09-Sep-2020
  • Kirjastus: Apple Academic Press Inc.
  • ISBN-10: 1771888342
  • ISBN-13: 9781771888349
Teised raamatud teemal:
  • Formaat: Hardback, 406 pages, kõrgus x laius: 234x156 mm, kaal: 950 g, 33 Tables, black and white; 9 Illustrations, color; 101 Illustrations, black and white
  • Ilmumisaeg: 09-Sep-2020
  • Kirjastus: Apple Academic Press Inc.
  • ISBN-10: 1771888342
  • ISBN-13: 9781771888349
Teised raamatud teemal:
"The authors of Big Data with Hadoop MapReduce: A Classroom Approach have framed the book to facilitate understanding big data and MapReduce by visualizing the basic terminologies and concepts. They employed over 100 illustrations and many worked-out examples to convey the concepts and methods used in big data, the inner workings of MapReduce, and single node/multi-node installation on physical/virtual machines. This book covers almost all necessary information on Hadoop MapReduce for most online certification exams. Upon completing this book, readers will find it easy to understand other big data processing tools such as Spark, Storm, etc. Ultimately, readers will be able to: Understand what big data is and the factors that are involved Understand the inner workings of MapReduce, which is essential for certification exams Learn the MapReduce program's features along its weaknesses Set up Hadoop clusters with 100s of physical/virtual machines Create a virtual machine in AWS and set up Hadoop MapReduce Write MapReduce with Eclipse in a simple way Understand other big data processing tools and their applications Understand various job positions in data science Regardless of the user's domain and expertise level in Hadoop MapReduce, this volume will broadentheir knowledge and understanding of writing MapReduce programs to process big data. The authors advise that while it is not necessary to be an expert, readers should have some minimal knowledge of working in Ubuntu, Java, and Eclipse to set up clusters and write MapReduce jobs. The authors have emphasized more on Hadoop v2 when compared to Hadoop v1, in order to meet today's trend"--

The authors provide an understanding of big data and MapReduce by clearly presenting the basic terminologies and concepts. They have employed over 100 illustrations and many worked-out examples to convey the concepts and methods used in big data, the inner workings of MapReduce, and single node/multi-node installation on physical/virtual machines. This book covers almost all the necessary information on Hadoop MapReduce for most online certification exams. Upon completing this book, readers will find it easy to understand other big data processing tools such as Spark, Storm, etc.

Ultimately, readers will be able to:

• understand what big data is and the factors that are involved

• understand the inner workings of MapReduce, which is essential for certification exams

• learn the features and weaknesses of MapReduce

• set up Hadoop clusters with 100s of physical/virtual machines

• create a virtual machine in AWS

• write MapReduce with Eclipse in a simple way

• understand other big data processing tools and their applications

Abbreviations xi
Preface xv
Dedication and Acknowledgment xvii
Introduction xix
1 Big Data
1(46)
2 Hadoop Framework
47(66)
3 Hadoop 1.2.1 Installation
113(40)
4 Hadoop Ecosystem
153(14)
5 Hadoop 2.7.0
167(30)
6 Hadoop 2.7.0 Installation
197(160)
7 Data Science
357(14)
Appendix A Public Datasets 371(4)
Appendix B MapReduce Exercise 375(8)
Appendix C Case Study: Application Development for NYSE Dataset 383(8)
Web References 391(2)
Index 393
Rathinaraja Jeyaraj is a Research Scholar in the Department of Information Technology at the National Institute of Technology Karnataka, India. He recently worked as a visiting researcher at Connected Computing and Media Processing Lab, Kyungpook National University, South Korea. His research interests include big data processing tools, cloud computing, IoT, and machine learning.

Ganeshkumar Pugalendhi, PhD, is an Assistant Professor in the Department of Information Technology, Anna University Regional Campus, Coimbatore, India. He is the resource person for delivering technical talks and seminars sponsored by various organizations, including the University Grants Commission of India, All India Council for Technical Education, Technical Education Quality Improvement Programme of Government of India, Indian Council of Medical Research, and many others. He has written two research-oriented textbooks: Data Classification Using Soft Computing and Soft Computing for Microarray Data Analysis.

Anand Paul, PhD, is an Associate Professor at the School of Computer Science and Engineering at Kyungpook National University, South Korea. He was a delegate representing South Korea for the M2M focus group in 20102012 and is serving as associate editor for the journals IEEE Access, IET Wireless Sensor Systems, ACM Applied Computing Reviews, Cyber Physical Systems, Human Behaviour and Emerging Technology, and the Journal of Platform Technology. He is the track chair for smart human computer interaction with the Association for Computing Machinery Symposium on Applied Computing 20142019, and general chair for the 8th International Conference on Orange Technology (ICOT 2020).