This slim volume on fault tolerance engineering for clustered computer systems examines the use of checkpointing to provide reliable backups in the case of foreseeable and statistically ever more likely node failures. The work covers application level checkpointing including compile-time and run-time schema, migrations safety, heterogeneity support, user-level checkpointing and special problems in high performance computing and virtualization. Numerous illustrations, charts and graphs are provided. Chaudhary is a professor of computer science at the University of Buffalo, Walters teaches at the University of Southern California information Sciences Institute and Jaing is a professor of computer science at Arkansas State University. Annotation ©2011 Book News, Inc., Portland, OR (booknews.com)