Muutke küpsiste eelistusi

E-raamat: Secondary Analysis of Electronic Health Records

  • Formaat: PDF+DRM
  • Ilmumisaeg: 09-Sep-2016
  • Kirjastus: Springer International Publishing AG
  • Keel: eng
  • ISBN-13: 9783319437422
  • Formaat - PDF+DRM
  • Hind: 4,08 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Lisa ostukorvi
  • Lisa soovinimekirja
  • See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.
  • Formaat: PDF+DRM
  • Ilmumisaeg: 09-Sep-2016
  • Kirjastus: Springer International Publishing AG
  • Keel: eng
  • ISBN-13: 9783319437422

DRM piirangud

  • Kopeerimine (copy/paste):

    ei ole lubatud

  • Printimine:

    ei ole lubatud

  • Kasutamine:

    Digitaalõiguste kaitse (DRM)
    Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale  Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

    Vajalik tarkvara
    Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

    PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

    Seda e-raamatut ei saa lugeda Amazon Kindle's. 

This book trains the next generation of scientists representing different disciplines to leverage the data generated during routine patient care. It formulates a more complete lexicon of evidence-based recommendations and support shared, ethical decision making by doctors with their patients. 

Diagnostic and therapeutic technologies continue to evolve rapidly, and both individual practitioners and clinical teams face increasingly complex ethical decisions. Unfortunately, the current state of medical knowledge does not provide the guidance to make the majority of clinical decisions on the basis of evidence.

The present research infrastructure is inefficient and frequently produces unreliable results that cannot be replicated. Even randomized controlled trials (RCTs), the traditional gold standards of the research reliability hierarchy, are not without limitations. They can be costly, labor intensive, and slow, and can return results that are seldom generalizable to every patient population. Furthermore, many pertinent but unresolved clinical and medical systems issues do not seem to have attracted the interest of the research enterprise, which has come to focus instead on cellular and molecular investigations and single-agent (e.g., a drug or device) effects. For clinicians, the end result is a bit of a “data desert” when it comes to making decisions. The new research infrastructure proposed in this book will help the medical profession to make ethically sound and well informed decisions for their patients.

 

Muu info

This is an open access book, the electronic versions are freely accessible online.
Part I Setting the Stage: Rationale Behind and Challenges to Health Data Analysis
1 Objectives of the Secondary Analysis of Electronic Health Record Data
3(6)
1.1 Introduction
3(1)
1.2 Current Research Climate
3(1)
1.3 Power of the Electronic Health Record
4(1)
1.4 Pitfalls and Challenges
5(1)
1.5 Conclusion
6(3)
References
7(2)
2 Review of Clinical Databases
9(8)
2.1 Introduction
9(1)
2.2 Background
9(1)
2.3 The Medical Information Mart for Intensive Care (MIMIC) Database
10(2)
2.3.1 Included Variables
11(1)
2.3.2 Access and Interface
12(1)
2.4 PCORnet
12(1)
2.4.1 Included Variables
12(1)
2.4.2 Access and Interface
13(1)
2.5 Open NHS
13(1)
2.5.1 Included Variables
13(1)
2.5.2 Access and Interface
13(1)
2.6 Other Ongoing Research
14(3)
2.6.1 elCU---Philips
14(1)
2.6.2 VistA
14(1)
2.6.3 NSQUIP
15(1)
References
16(1)
3 Challenges and Opportunities in Secondary Analyses of Electronic Health Record Data
17(10)
3.1 Introduction
17(1)
3.2 Challenges in Secondary Analysis of Electronic Health Records Data
17(3)
3.3 Opportunities in Secondary Analysis of Electronic Health Records Data
20(1)
3.4 Secondary EHR Analyses as Alternatives to Randomized Controlled Clinical Trials
21(1)
3.5 Demonstrating the Power of Secondary EHR Analysis: Examples in Pharmacovigilance and Clinical Care
22(1)
3.6 A New Paradigm for Supporting Evidence-Based Practice and Ethical Considerations
23(4)
References
25(2)
4 Pulling It All Together: Envisioning a Data-Driven, Ideal Care System
27(16)
4.1 Use Case Examples Based on Unavoidable Medical Heterogeneity
28(1)
4.2 Clinical Workflow, Documentation, and Decisions
29(3)
4.3 Levels of Precision and Personalization
32(3)
4.4 Coordination, Communication, and Guidance Through the Clinical Labyrinth
35(1)
4.5 Safety and Quality in an ICS
36(3)
4.6 Conclusion
39(4)
References
41(2)
5 The Story of MIMIC
43(8)
5.1 The Vision
43(1)
5.2 Data Acquisition
44(2)
5.2.1 Clinical Data
44(1)
5.2.2 Physiological Data
45(1)
5.2.3 Death Data
46(1)
5.3 Data Merger and Organization
46(1)
5.4 Data Sharing
47(1)
5.5 Updating
47(1)
5.6 Support
48(1)
5.7 Lessons Learned
48(1)
5.8 Future Directions
49(2)
References
49(2)
6 Integrating Non-clinical Data with EHRs
51(10)
6.1 Introduction
51(1)
6.2 Non-clinical Factors and Determinants of Health
51(2)
6.3 Increasing Data Availability
53(1)
6.4 Integration, Application and Calibration
54(3)
6.5 A Well-Connected Empowerment
57(1)
6.6 Conclusion
58(3)
References
59(2)
7 Using EHR to Conduct Outcome and Health Services Research
61(10)
7.1 Introduction
61(1)
7.2 The Rise of EHRs in Health Services Research
62(2)
7.2.1 The EHR in Outcomes and Observational Studies
62(1)
7.2.2 The EHR as Tool to Facilitate Patient Enrollment in Prospective Trials
63(1)
7.2.3 The EHR as Tool to Study and Improve Patient Outcomes
64(1)
7.3 How to Avoid Common Pitfalls When Using EHR to Do Health Services Research
64(3)
7.3.1 Step 1: Recognize the Fallibility of the EHR
65(1)
7.3.2 Step 2: Understand Confounding, Bias, and Missing Data When Using the HER for Research
65(2)
7.4 Future Directions for the EHR and Health Services Research
67(1)
7.4.1 Ensuring Adequate Patient Privacy Protection
67(1)
7.5 Multidimensional Collaborations
67(1)
7.6 Conclusion
68(3)
References
68(3)
8 Residual Confounding Lurking in Big Data: A Source of Error
71(10)
8.1 Introduction
71(1)
8.2 Confounding Variables in Big Data
72(5)
8.2.1 The Obesity Paradox
72(1)
8.2.2 Selection Bias
73(1)
8.2.3 Uncertain Pathophysiology
74(3)
8.3 Conclusion
77(4)
References
77(4)
Part II A Cookbook: From Research Question Formulation to Validation of Findings
9 Formulating the Research Question
81(12)
9.1 Introduction
81(1)
9.2 The Clinical Scenario: Impact of Indwelling Arterial Catheters
82(1)
9.3 Turning Clinical Questions into Research Questions
82(3)
9.3.1 Study Sample
82(1)
9.3.2 Exposure
83(1)
9.3.3 Outcome
84(1)
9.4 Matching Study Design to the Research Question
85(2)
9.5 Types of Observational Research
87(2)
9.6 Choosing the Right Database
89(1)
9.7 Putting It Together
90(3)
References
91(2)
10 Defining the Patient Cohort
93(8)
10.1 Introduction
93(1)
10.2 Part 1---Theoretical Concepts
94(4)
10.2.1 Exposure and Outcome of Interest
94(1)
10.2.2 Comparison Group
95(1)
10.2.3 Building the Study Cohort
95(2)
10.2.4 Hidden Exposures
97(1)
10.2.5 Data Visualization
97(1)
10.2.6 Study Cohort Fidelity
98(1)
10.3 Part 2---Case Study: Cohort Selection
98(3)
References
100(1)
11 Data Preparation
101(14)
11.1 Introduction
101(1)
11.2 Part 1---Theoretical Concepts
102(7)
11.2.1 Categories of Hospital Data
102(1)
11.2.2 Context and Collaboration
103(1)
11.2.3 Quantitative and Qualitative Data
104(1)
11.2.4 Data Files and Databases
104(3)
11.2.5 Reproducibility
107(2)
11.3 Part 2---Practical Examples of Data Preparation
109(6)
11.3.1 MIMIC Tables
109(1)
11.3.2 SQL Basics
109(3)
11.3.3 Loins
112(1)
11.3.4 Ranking Across Rows Using a Window Function
113(1)
11.3.5 Making Queries More Manageable Using WITH
113(1)
References
114(1)
12 Data Pre-processing
115(28)
12.1 Introduction
115(1)
12.2 Part 1---Theoretical Concepts
116(5)
12.2.1 Data Cleaning
116(2)
12.2.2 Data Integration
118(1)
12.2.3 Data Transformation
119(1)
12.2.4 Data Reduction
120(1)
12.3 Part 2---Examples of Data Pre-processing in R
121(19)
12.3.1 R---The Basics
121(8)
12.3.2 Data Integration
129(3)
12.3.3 Data Transformation
132(4)
12.3.4 Data Reduction
136(4)
12.4 Conclusion
140(3)
References
141(2)
13 Missing Data
143(20)
13.1 Introduction
143(1)
13.2 Part 1---Theoretical Concepts
144(9)
13.2.1 Types of Missingness
144(2)
13.2.2 Proportion of Missing Data
146(1)
13.2.3 Dealing with Missing Data
146(6)
13.2.4 Choice of the Best Imputation Method
152(1)
13.3 Part 2---Case Study
153(8)
13.3.1 Proportion of Missing Data and Possible Reasons for Missingness
153(1)
13.3.2 Univariate Missingness Analysis
154(5)
13.3.3 Evaluating the Performance of Imputation Methods on Mortality Prediction
159(2)
13.4 Conclusion
161(2)
References
161(2)
14 Noise Versus Outliers
163(22)
14.1 Introduction
163(1)
14.2 Part 1---Theoretical Concepts
164(1)
14.3 Statistical Methods
165(3)
14.3.1 Tukey's Method
166(1)
14.3.2 Z-Score
166(1)
14.3.3 Modified Z-Score
166(1)
14.3.4 Interquartile Range with Log-Normal Distribution
167(1)
14.3.5 Ordinary and Studentized Residuals
167(1)
14.3.6 Cook's Distance
167(1)
14.3.7 Mahalanobis Distance
168(1)
14.4 Proximity Based Models
168(3)
14.4.1 k-Means
169(1)
14.4.2 k-Medoids
169(1)
14.4.3 Criteria for Outlier Detection
169(2)
14.5 Supervised Outlier Detection
171(1)
14.6 Outlier Analysis Using Expert Knowledge
171(1)
14.7 Case Study: Identification of Outliers in the Indwelling Arterial Catheter (IAC) Study
171(1)
14.8 Expert Knowledge Analysis
172(1)
14.9 Univariate Analysis
172(5)
14.10 Multivariable Analysis
177(2)
14.11 Classification of Mortality in IAC and Non-IAC Patients
179(2)
14.12 Conclusions and Summary
181(4)
Code Appendix
182(1)
References
183(2)
15 Exploratory Data Analysis
185(20)
15.1 Introduction
185(1)
15.2 Part 1---Theoretical Concepts
186(13)
15.2.1 Suggested EDA Techniques
186(1)
15.2.2 Non-graphical EDA
187(4)
15.2.3 Graphical EDA
191(8)
15.3 Part 2---Case Study
199(3)
15.3.1 Non-graphical EDA
199(1)
15.3.2 Graphical EDA
200(2)
15.4 Conclusion
202(3)
Code Appendix
202(1)
References
203(2)
16 Data Analysis
205(58)
16.1 Introduction to Data Analysis
205(5)
16.1.1 Introduction
205(1)
16.1.2 Identifying Data Types and Study Objectives
206(3)
16.1.3 Case Study Data
209(1)
16.2 Linear Regression
210(14)
16.2.1 Section Goals
210(1)
16.2.2 Introduction
210(3)
16.2.3 Model Selection
213(7)
16.2.4 Reporting and Interpreting Linear Regression
220(3)
16.2.5 Caveats and Conclusions
223(1)
16.3 Logistic Regression
224(13)
16.3.1 Section Goals
224(1)
16.3.2 Introduction
225(1)
16.3.3 2 x 2 Tables
225(2)
16.3.4 Introducing Logistic Regression
227(5)
16.3.5 Hypothesis Testing and Model Selection
232(1)
16.3.6 Confidence Intervals
233(1)
16.3.7 Prediction
234(1)
16.3.8 Presenting and Interpreting Logistic Regression Analysis
235(1)
16.3.9 Caveats and Conclusions
236(1)
16.4 Survival Analysis
237(7)
16.4.1 Section Goals
237(1)
16.4.2 Introduction
237(1)
16.4.3 Kaplan-Meier Survival Curves
238(2)
16.4.4 Cox Proportional Hazards Models
240(3)
16.4.5 Caveats and Conclusions
243(1)
16.5 Case Study and Summary
244(19)
16.5.1 Section Goals
244(1)
16.5.2 Introduction
244(6)
16.5.3 Logistic Regression Analysis
250(9)
16.5.4 Conclusion and Summary
259(2)
References
261(2)
17 Sensitivity Analysis and Model Validation
263(12)
17.1 Introduction
263(1)
17.2 Part 1---Theoretical Concepts
264(3)
17.2.1 Bias and Variance
264(1)
17.2.2 Common Evaluation Tools
265(1)
17.2.3 Sensitivity Analysis
265(1)
17.2.4 Validation
266(1)
17.3 Case Study: Examples of Validation and Sensitivity Analysis
267(3)
17.3.1 Analysis 1: Varying the Inclusion Criteria of Time to Mechanical Ventilation
267(1)
17.3.2 Analysis 2: Changing the Caliper Level for Propensity Matching
268(1)
17.3.3 Analysis 3: Hosmer-Lemeshow Test
269(1)
17.3.4 Implications for a `Failing' Model
269(1)
17.4 Conclusion
270(5)
Code Appendix
270(1)
References
271(4)
Part III Case Studies Using MIMIC
18 Trend Analysis: Evolution of Tidal Volume Over Time for Patients Receiving Invasive Mechanical Ventilation
275(10)
18.1 Introduction
275(2)
18.2 Study Dataset
277(1)
18.3 Study Pre-processing
277(1)
18.4 Study Methods
277(1)
18.5 Study Analysis
278(2)
18.6 Study Conclusions
280(1)
18.7 Next Steps
280(1)
18.8 Connections
281(4)
Code Appendix
282(1)
References
282(3)
19 Instrumental Variable Analysis of Electronic Health Records
285(10)
19.1 Introduction
285(2)
19.2 Methods
287(4)
19.2.1 Dataset
287(1)
19.2.2 Methodology
287(3)
19.2.3 Pre-processing
290(1)
19.3 Results
291(1)
19.4 Next Steps
292(1)
19.5 Conclusions
293(2)
Code Appendix
293(1)
References
293(2)
20 Mortality Prediction in the ICU Based on MIMIC-II Results from the Super ICU Learner Algorithm (SICULA) Project
295(20)
20.1 Introduction
295(2)
20.2 Dataset and Pre-preprocessing
297(2)
20.2.1 Data Collection and Patients Characteristics
297(1)
20.2.2 Patient Inclusion and Measures
297(2)
20.3 Methods
299(3)
20.3.1 Prediction Algorithms
299(2)
20.3.2 Performance Metrics
301(1)
20.4 Analysis
302(6)
20.4.1 Discrimination
302(1)
20.4.2 Calibration
303(2)
20.4.3 Super Learner Library
305(1)
20.4.4 Reclassification Tables
305(3)
20.5 Discussion
308(1)
20.6 What Are the Next Steps?
309(1)
20.7 Conclusions
309(6)
Code Appendix
310(1)
References
311(4)
21 Mortality Prediction in the ICU
315(10)
21.1 Introduction
315(1)
21.2 Study Dataset
316(1)
21.3 Pre-processing
317(1)
21.4 Methods
318(1)
21.5 Analysis
319(1)
21.6 Visualization
319(2)
21.7 Conclusions
321(1)
21.8 Next Steps
321(1)
21.9 Connections
322(3)
Code Appendix
323(1)
References
323(2)
22 Data Fusion Techniques for Early Warning of Clinical Deterioration
325(14)
22.1 Introduction
325(1)
22.2 Study Dataset
326(1)
22.3 Pre-processing
327(1)
22.4 Methods
328(2)
22.5 Analysis
330(3)
22.6 Discussion
333(2)
22.7 Conclusions
335(1)
22.8 Further Work
335(1)
22.9 Personalised Prediction of Deteriorations
336(3)
Code Appendix
337(1)
References
337(2)
23 Comparative Effectiveness: Propensity Score Analysis
339(12)
23.1 Incentives for Using Propensity Score Analysis
339(1)
23.2 Concerns for Using Propensity Score
340(1)
23.3 Different Approaches for Estimating Propensity Scores
340(1)
23.4 Using Propensity Score to Adjust for Pre-treatment Conditions
341(2)
23.5 Study Pre-processing
343(3)
23.6 Study Analysis
346(1)
23.7 Study Results
346(1)
23.8 Conclusions
347(1)
23.9 Next Steps
347(4)
Code Appendix
348(1)
References
348(3)
24 Markov Models and Cost Effectiveness Analysis: Applications in Medical Research
351(18)
24.1 Introduction
351(1)
24.2 Formalization of Common Markov Models
352(4)
24.2.1 The Markov Chain
352(1)
24.2.2 Exploring Markov Chains with Monte Carlo Simulations
353(2)
24.2.3 Markov Decision Process and Hidden Markov Models
355(1)
24.2.4 Medical Applications of Markov Models
356(1)
24.3 Basics of Health Economics
356(3)
24.3.1 The Goal of Health Economics: Maximizing Cost-Effectiveness
356(1)
24.3.2 Definitions
357(2)
24.4 Case Study: Monte Carlo Simulations of a Markov Chain for Daily Sedation Holds in Intensive Care, with Cost-Effectiveness Analysis
359(5)
24.5 Model Validation and Sensitivity Analysis for Cost-Effectiveness Analysis
364(1)
24.6 Conclusion
365(1)
24.7 Next Steps
366(3)
Code Appendix
366(1)
References
366(3)
25 Blood Pressure and the Risk of Acute Kidney Injury in the ICU: Case-Control Versus Case-Crossover Designs
369(8)
25.1 Introduction
369(1)
25.2 Methods
370(4)
25.2.1 Data Pre-processing
370(1)
25.2.2 A Case-Control Study
370(2)
25.2.3 A Case-Crossover Design
372(2)
25.3 Discussion
374(1)
25.4 Conclusions
374(3)
Code Appendix
375(1)
References
375(2)
26 Waveform Analysis to Estimate Respiratory Rate
377(14)
26.1 Introduction
377(1)
26.2 Study Dataset
378(2)
26.3 Pre-processing
380(1)
26.4 Methods
381(3)
26.5 Results
384(1)
26.6 Discussion
385(1)
26.7 Conclusions
386(1)
26.8 Further Work
386(1)
26.9 Non-contact Vital Sign Estimation
387(4)
Code Appendix
388(1)
References
389(2)
27 Signal Processing: False Alarm Reduction
391(14)
27.1 Introduction
391(2)
27.2 Study Dataset
393(1)
27.3 Study Pre-processing
394(1)
27.4 Study Methods
395(2)
27.5 Study Analysis
397(1)
27.6 Study Visualizations
398(1)
27.7 Study Conclusions
399(1)
27.8 Next Steps/Potential Follow-Up Studies
400(5)
References
401(4)
28 Improving Patient Cohort Identification Using Natural Language Processing
405(14)
28.1 Introduction
405(2)
28.2 Methods
407(3)
28.2.1 Study Dataset and Pre-processing
407(1)
28.2.2 Structured Data Extraction from MIMIC-III Tables
408(1)
28.2.3 Unstructured Data Extraction from Clinical Notes
409(1)
28.2.4 Analysis
410(1)
28.3 Results
410(3)
28.4 Discussion
413(1)
28.5 Conclusions
414(5)
Code Appendix
414(1)
References
415(4)
29 Hyperparameter Selection
419
29.1 Introduction
419(1)
29.2 Study Dataset
420(1)
29.3 Study Methods
420(3)
29.4 Study Analysis
423(1)
29.5 Study Visualizations
424(1)
29.6 Study Conclusions
425(1)
29.7 Discussion
425(1)
29.8 Conclusions
426
References
427
MIT Critical Data

MIT Critical Data consists of data scientists and clinicians from around the globe brought together by a vision to engender a data-driven healthcare system supported by clinical informatics without walls. In this ecosystem, the creation of evidence and clinical decision support tools is initiated, updated, honed and enhanced by scaling the access to and meaningful use of clinical data.

Leo Anthony Celi

Leo has practiced medicine in three continents, giving him broad perspectives in healthcare delivery. His research is on secondary analysis of electronic health records and global health informatics. He founded and co-directs Sana at the Institute for Medical Engineering and Science at the Massachusetts Institute of Technology. He also holds a faculty position at Harvard Medical School as an intensivist at the Beth Israel Deaconess Medical Center and is the clinical research director for the Laboratory of Computational Physiology at MIT.Finally, he is one of the course directors for HST.936 at MIT innovations in global health informatics and HST.953 secondary analysis of electronic health records.

 

Peter Charlton

Peter gained the degree of MEng in Engineering Science in 2010 from the University of Oxford. Since then he held a research position, working jointly with Guy's and St Thomas' NHS Foundation Trust, and King's College London. Peters research focuses on physiological monitoring of hospital patients, divided into three areas. The first area concerns the development of signal processing techniques to estimate clinical parameters from physiological signals. He has focused on unobtrusive estimation of respiratory rate for use in ambulatory settings, invasive estimation of cardiac output for use in critical care, and novel techniques for analysis of the pulse oximetry (photoplethysmogram) signal. Secondly, he is investigating the effectiveness of technologies for the acquisition of continuous and intermittent physiological measurements in ambulatory and intensive care settings. Thirdly, he is developing techniques to transform continuous monitoring data into measurements that are appropriate for real-time alerting of patient deteriorations.

Mohammad Ghassemi

Mohammad is a doctoral candidate at the Massachusetts Institute of Technology. As an undergraduate, he studied Electrical Engineering and graduated as both a Goldwater scholar and the University's Outstanding Engineer. In 2011, Mohammad received an MPhil in Information Engineering from the University of Cambridge where he was also a recipient of the Gates-Cambridge Scholarship. Since arriving at MIT, he has perused research at the interface of machine learning and medical informatics. Mohammad's doctoral focus is on signal processing and machine learning techniques in the context of multi-modal, multi-scale datasets. He has helped put together the largest collection of post-anoxic coma EEGs inthe world. In addition to his thesis work, Mohammad has worked with the Samsung corporation, and several entities across campus building smart devices including: a multi-sensor wearable that passively monitors the physiological, audio and video activity of a user to estimate a latent emotional state.

Alistair Johnson

Alistair joined the Laboratory for Computational Physiology as a postdoctoral associate in 2015. He received his B.Eng in Biomedical and Electrical Engineering at McMaster University, Canada, and subsequently read for a D.Phil in Healthcare Innovation at the University of Oxford. His thesis was titled Mortality and acuity assessment in critical care, and its focus included using machine learning techniques to predict mortality and develop new severity of illness scores for patients admitted to intensive care units. Before joining the LCP, Alistair spent a year as a research assistant at the John Radcliffe hospital in Oxford, where he worked on building early alerting models for patients post-ICU discharge. Alistairs research interests revolve around the use of data collected during routine clinical practice to improve patient care.

Matthieu Komorowski

Matthieu holds board certification in anesthesiology and critical care in both France and the UK. A former medical research fellow at the European Space Agency, he completed a Master of Research in Biomedical Engineering at Imperial College London focusing on machine learning. Dr Komorowski now pursues a PhD at Imperial College and a research fellowship in intensive care at Charing Cross Hospital in London. In his research, he combines his expertise in machine learning and critical care to generate new clinical evidence and build the next generation of clinical tools such as decision support systems, with a particular interest in septic shock, the number one killer in intensive care and the single most expensive condition treated in hospitals. 

DominicMarshall

Dominic is an Academic Foundation doctor in Oxford, United Kingdom. Dominic read Molecular and Cellular biology at the University of Bath and worked at Eli Lilly in their Alzheimers disease drug hunting research program. He pursued his medical training at Imperial College London where he was awarded the Santander Undergraduate scholarship for academic performance and ranked first overall in his graduating class. His research interests range from molecular biology to analysis of large clinical data sets and he has received non-industry grant funding to pursue the development of novel antibiotics and chemotherapeutic agents. Alongside clinical training, he is involved in a number of research projects focusing on analysis of electronic health care records.

Tristan Neumann

Tristan Naumann is a PhD candidate in Electrical Engineering and Computer Science at MIT working with Dr. Peter Szolovits in CSAILs Clinical Decision Making group. His research includes exploring relationships in complex, unstructured data using data-informed unsupervised learning techniques, and the application of natural language processing techniques in healthcare data. He has been an organizer for workshops and datathon events, which bring together participants with diverse backgrounds in order to address biomedical and clinical questions in a manner that is reliable and reproducible.





Kenneth Paik

Kenneth is a clinical informatician driving quality improvement and democratizing access through technology innovation, combining a multidisciplinary background in medicine, artificial intelligence, business management, and technology strategy.  He is a research scientist at the MIT Laboratory for Computational Physiology investigating the secondary analysis of health data and building intelligent decision support system. As the co-director of Sana, he leads programs and project driving qualityimprovement and building capacity in global health. He received his MD and MBA degrees from Georgetown University and completed fellowship training in biomedical informatics at Harvard Medical School and the Massachusetts General Hospital Laboratory for Computer Science.

Tom Joseph Pollard

Tom is a Postdoctoral Associate at the MIT Laboratory for Computational Physiology. Most recently he has been working with colleagues to release MIMIC-III, an openly-accessible critical care database. Prior to joining MIT in 2015, Tom completed his PhD at University College London, UK, where he explored models of health in critical care patients in an interdisciplinary project between the Mullard Space Science Laboratory and University College Hospital. Tom has a broad interest in how we can improve the way that critical care data is managed, shared, and analyzed for the benefit of patients. He is a Fellow of the Software Sustainability Institute.

Jesse Raffa

Jesse is a research scientist in the Lab for Computational Physiology at the Massachusetts Institute of Technology in Cambridge, USA. He received his PhD in biostatistics from the University of Waterloo (Canada) in 2013.  His primary methodological interests are related to the modeling of complex longitudinal data, latent variable models and reproducible research. In addition to his methodological contributions, he has collaborated and published over 20 academic articles with colleagues in a diverse set of areas including: infectious diseases, addiction and critical care, among others.  Jesse was the recipient of the distinguished student paper award at the Eastern North American Region International Biometric Society conference in 2013, and the new investigator of the year for the Canadian Association of HIV/AIDS Research in 2004.

Justin Salciccioli

Justin is an Academic Foundation doctor in London, United Kingdom. Originally from Toronto, Canada, Justin completed his undergraduate and graduate studies in the United States before pursuing his medical studies at Imperial College London. His research pursuits started as an undergraduate student while completing a biochemistry degree. Subsequently, he worked on clinical trials in emergency medicine and intensive care medicine at Beth Israel Deaconess Medical Center in Boston and completed a Masters degree with his thesis on Vitamin D deficiency in critically ill patients with sepsis. During this time he developed a keen interest in statistical methods and programming particularly in SAS and R. He has co-authored more than 30 peer-reviewed manuscripts and, in addition to his current clinical training, continues with his research interests on analytical methods for observational and clinical trial data as well as education in analytics for medical students and clinicians.