Muutke küpsiste eelistusi

Training Students to Extract Value from Big Data: Summary of a Workshop [Pehme köide]

  • Formaat: Paperback / softback, 66 pages, kõrgus x laius: 254x178 mm
  • Ilmumisaeg: 16-Feb-2015
  • Kirjastus: National Academies Press
  • ISBN-10: 0309314372
  • ISBN-13: 9780309314374
Teised raamatud teemal:
  • Formaat: Paperback / softback, 66 pages, kõrgus x laius: 254x178 mm
  • Ilmumisaeg: 16-Feb-2015
  • Kirjastus: National Academies Press
  • ISBN-10: 0309314372
  • ISBN-13: 9780309314374
Teised raamatud teemal:
As the availability of high-throughput data-collection technologies, such as information-sensing mobile devices, remote sensing, internet log records, and wireless sensor networks has grown, science, engineering, and business have rapidly transitioned from striving to develop information from scant data to a situation in which the challenge is now that the amount of information exceeds a human's ability to examine, let alone absorb, it. Data sets are increasingly complex, and this potentially increases the problems associated with such concerns as missing information and other quality concerns, data heterogeneity, and differing data formats.



The nation's ability to make use of data depends heavily on the availability of a workforce that is properly trained and ready to tackle high-need areas. Training students to be capable in exploiting big data requires experience with statistical analysis, machine learning, and computational infrastructure that permits the real problems associated with massive data to be revealed and, ultimately, addressed. Analysis of big data requires cross-disciplinary skills, including the ability to make modeling decisions while balancing trade-offs between optimization and approximation, all while being attentive to useful metrics and system robustness. To develop those skills in students, it is important to identify whom to teach, that is, the educational background, experience, and characteristics of a prospective data-science student; what to teach, that is, the technical and practical content that should be taught to the student; and how to teach, that is, the structure and organization of a data-science program.



Training Students to Extract Value from Big Data summarizes a workshop convened in April 2014 by the National Research Council's Committee on Applied and Theoretical Statistics to explore how best to train students to use big data. The workshop explored the need for training and curricula and coursework that should be included. One impetus for the workshop was the current fragmented view of what is meant by analysis of big data, data analytics, or data science. New graduate programs are introduced regularly, and they have their own notions of what is meant by those terms and, most important, of what students need to know to be proficient in data-intensive work. This report provides a variety of perspectives about those elements and about their integration into courses and curricula.

Table of Contents



Front Matter 1 Introduction 2 The Need for Training: Experiences and Case Studies 3 Principles for Working with Big Data 4 Courses, Curricula, and Interdisciplinary Programs 5 Shared Resources 6 Workshop Lessons References Appendixes Appendix A: Registered Workshop Participants Appendix B: Workshop Agenda Appendix C: Acronyms
1 Introduction
1(7)
Workshop Overview
2(2)
National Efforts in Big Data
4(3)
Organization of This Report
7(1)
2 The Need For Training: Experiences And Case Studies
8(5)
Training Students to Do Good with Big Data
9(1)
The Need for Training in Big Data: Experiences and Case Studies
10(3)
3 Principles For Working With Big Data
13(9)
Teaching about MapReduce
14(1)
Big Data Machine Learning---Principles for Industry
15(1)
Principles for the Data Science Process
16(3)
Principles for Working with Big Data
19(3)
4 Courses, Curricula, And Interdisciplinary Programs
22(9)
Computational Training and Data Literacy for Domain Scientists
23(2)
Data Science and Analytics Curriculum Development at Rensselaer (and the Tetherless World Constellation)
25(4)
Experience with a First Massive Online Open Course on Data Science
29(2)
5 Shared Resources
31(9)
Can Knowledge Bases Help Accelerate Science?
32(1)
Divide and Recombine for Large, Complex Data
33(3)
Yahoo's Webscope Data Sharing Program
36(1)
Resource Sharing
37(3)
6 Workshop Lessons
40(9)
Whom to Teach: Types of Students to Target in Teaching Big Data
40(2)
How to Teach: The Structure of Teaching Big Data
42(1)
What to Teach: Content in Teaching Big Data
42(2)
Parallels in Other Disciplines
44(1)
References
45(4)
APPENDIXES
A Registered Workshop Participants
49(3)
B Workshop Agenda
52(2)
C Acronyms
54