Klienditugi: 7440010 (E-R 10-18)

Abi | Registreeri | Logi sisse

Data Cleaning Pocket Primer: Pocket Primer [Pehme köide]

4.00/5 (4 hinnangut Goodreads-ist)

Oswald Campesato

Formaat: Paperback / softback, 188 pages, kaal: 290 g
Sari: Pocket Primer
Ilmumisaeg: 07-Feb-2018
Kirjastus: Mercury Learning & Information
ISBN-10: 1683922174
ISBN-13: 9781683922179

Teised raamatud teemal:

Operating systems - (Hetkel poes: 1 nimetust)
Computer programming / software development - (Hetkel poes: 4 nimetust)
Databases - (Hetkel poes: 1 nimetust)

Pehme köide
Hind: 50,04 €
Raamatu kohalejõudmiseks kirjastusest kulub orienteeruvalt 2-4 nädalat
Kogus:
- - 1
  - 2
  - 3
  - 4
  - 5
  - 6
  - 7
  - 8
  - 9
  - 10
Lisa ostukorvi
Tasuta tarne
Tellimisaeg 2-4 nädalat
Lisa soovinimekirja

Formaat: Paperback / softback, 188 pages, kaal: 290 g
Sari: Pocket Primer
Ilmumisaeg: 07-Feb-2018
Kirjastus: Mercury Learning & Information
ISBN-10: 1683922174
ISBN-13: 9781683922179

Teised raamatud teemal:

Operating systems - (Hetkel poes: 1 nimetust)
Computer programming / software development - (Hetkel poes: 4 nimetust)
Databases - (Hetkel poes: 1 nimetust)

Püsilink: https://www.kriso.ee/db/9781683922179.html

Märksõnad:

Campesato introduces a powerful, flexible, and free set of data manipulation and cleansing commands developed during decades in the Unix/Linux environment that are now available in any operating system with a minimum amount of effort to set up the environment. He writes for data scientists, data analysts, and other people who perform data cleaning tasks and have a modest knowledge of shell programming, but may be relatively new to a "bash" environment. His examples and scripts use the bash command set, but many of the concepts translate into other forms of shell scripting (ksh, sh, csh), including the concept of piping data between commands, regular expression substitution, and the sed and awk commands. Distributed in North America by Stylus Publishing and Distribution Annotation ©2018 Ringgold, Inc., Portland, OR (protoview.com)

As part of the best selling Pocket Primer series, this book is an effort to give programmers sufficient knowledge of data cleaning to be able to work on their own projects. It is designed as a practical introduction to using flexible, powerful (and free) Unix / Linux shell commands to perform common data cleaning tasks. The book is packed with realistic examples and numerous commands that illustrate both the syntax and how the commands work together. Companion files with source code are available for downloading from the publisher.

Features:
- A practical introduction to using flexible, powerful (and free) Unix / Linux shell commands to perform common data cleaning tasks

- Includes the concept of piping data between commands, regular expression substitution, and the sed and awk commands

- Packed with realistic examples and numerous commands that illustrate both the syntax and how the commands work together

- Assumes the reader has no prior experience, but the topic is covered comprehensively enough to teach a pro some new tricks

- Includes companion files with all of the source code examples (download from the publisher).

Preface: Data Cleaning Pocket Primer

xiii

What Is the Goal?

xiii

Is This Book is for Me and What Will I Learn?

xiii

How Were the Code Samples Created?

xiv

What You Need to Know for This Book

xiv

Which bash Commands are Excluded?

xiv

How Do I Set Up a Command Shell?

What Are the "Next Steps" after Finishing This Book?

About the Technical Reviewer

xvii

Chapter 1 Introduction

(38)

What Is Unix?

(1)

Available Shell Types

(1)

What Is bash?

(3)

Getting Help for bash Commands

(1)

Navigating Around Directories

(1)

The history Command

(1)

Lasting Filenames with the Is Command

(1)

Displaying Contents of Files

(5)

The cat Command

(1)

The head and tail Commands

(1)

The Pipe Symbol

(1)

The fold Command

(1)

File Ownership: Owner, Group, and World

(1)

Hidden Files

(1)

Handling Problematic Filenames

(1)

Working with Environment Variables

(2)

The env Command

(1)

Useful Environment Variables

(1)

Setting the PATH Environment Variable

(1)

Specifying Aliases and Environment Variables

(1)

Finding Executable Files

(1)

What Are Shell Scripts?

(2)

A Simple Shell Script

(1)

Using a Semicolon to Separate Commands

(1)

The printf Command and the echo Command

(1)

The echo Command and Whitespaces

(1)

Command Substitution ("back tick")

(1)

Setting Environment Variables via Shell Scripts

(2)

Sourcing or "Dotting" a Shell Script

(1)

Working with Arrays

(3)

Working with Nested Loops

(2)

The paste Command

(1)

Inserting Blank Lines with the paste Command

(1)

The cut Command

(1)

Working with Metacharacters

(1)

Working with Character Classes

(1)

The "pipe" Symbol and Multiple Commands

(1)

A Simple Use Case

(1)

Another Simple Use Case

(2)

Summary

(2)

Chapter 2 Useful Commands

(26)

The join Command

(1)

The fold Command

(1)

The split Command

(1)

The sort Command

(2)

The uniq Command

(1)

How to Compare Files

(1)

The od Command

(1)

The tr Command

(4)

A Simple Use Case

(1)

The find Command

(1)

The tee Command

(1)

File Compression Commands

(3)

The tar command

(1)

The cpio Command

(1)

The gzip and gunzip Commands

(1)

The bunzip2 Command

(1)

The zip Command

(1)

Commands for zip Files and bz Files

(1)

Internal Field Separator (IFS)

(1)

Data from a Range of Columns in a Dataset

(2)

Working with Uneven Rows in Datasets

(1)

Working with Functions in Shell Scripts

(2)

Recursion and Shell Scripts

(1)

Iterative Solutions for Factorial Values

(2)

Summary

(1)

Chapter 3 Filtering Data with grep

(26)

What Is the grep Command?

(1)

Metacharacters and the grep Command

(1)

Escaping Metacharacters with the grep Command

(1)

Useful Options for the grep Command

(5)

Character Classes and the grep Command

(1)

Working with the --c Option in grep

(1)

Matching a Range of Lines

(2)

Using Back References in the grep Command

(2)

Finding Empty Lines in Datasets

(1)

Using Keys to Search Datasets

(1)

The Backslash Character and the grep Command

(1)

Multiple Matches in the grep Command

(1)

The grep Command and the xargs Command

(3)

Searching zip Files for a String

(1)

Checking for a Unique Key Value

(1)

Redirecting Error Messages

(1)

The egrep Command and the fgrep Command

(3)

Displaying "Pure" Words in a Dataset with egrep

(2)

The fgrep Command

(1)

A Simple Use Case

(2)

Summary

(1)

Chapter 4 Transforming Data with sed

(24)

What Is the sed Command?

(1)

The sed Execution Cycle

(1)

Matching String Patterns Using sed

(1)

Substituting String Patterns Using sed

(3)

Replacing Vowels from a String or a File

(1)

Deleting Multiple Digits and Letters from a String

(1)

Search and Replace with sed

(3)

Datasets with Multiple Delimiters

(1)

Useful Switches in sed

(1)

Working with Datasets

100

(4)

Printing Lines

101

(1)

Character Classes and sed

102

(1)

Removing Control Characters

103

(1)

Counting Words in a Dataset

104

(1)

Back References in sed

104

(1)

Displaying Only "Pure" Words in a Dataset

105

(2)

One-Line sed Commands

107

(7)

Summary

114

(1)

Chapter 5 Doing Everything Else with awk

115

(36)

The awk Command

116

(2)

Built-in Variables That Control awk

116

(1)

How Does the awk Command Work?

117

(1)

Aligning Text with the printf Command

118

(1)

Conditional Logic and Control Statements

119

(3)

The while Statement

119

(1)

A for loop in awk

120

(1)

A for loop with a break Statement

121

(1)

The next and continue Statements

121

(1)

Deleting Alternate Lines in Datasets

122

(1)

Merging Lines in Datasets

122

(4)

Printing File Contents as a Single Line

123

(1)

Joining Groups of Lines in a Text File

124

(1)

Joining Alternate Lines in a Text File

125

(1)

Matching with Metacharacters and Character Sets

126

(1)

Printing Lines Using Conditional Logic

127

(1)

Splitting Filenames with awk

128

(1)

Working with Postfix Arithmetic Operators

129

(1)

Numeric Functions in awk

130

(2)

One-Line awk Commands

132

(1)

Useful Short awk Scripts

133

(2)

Printing the Words in a Text String in awk

135

(1)

Count Occurrences of a String in Specific Rows

135

(1)

Printing a String in a Fixed Number of Columns

136

(1)

Printing a Dataset in a Fixed Number of Columns

137

(1)

Aligning Columns in Datasets

138

(1)

Aligning Columns and Multiple Rows in Datasets

139

(1)

Removing a Column from a Text File

140

(1)

Subsets of Columns of Even Rows in Datasets

141

(1)

Counting Word Frequency in Datasets

142

(2)

Displaying Only "Pure" Words in a Dataset

144

(2)

Working with Multiline Records in awk

146

(1)

A Simple Use Case

147

(1)

Another Use Case

148

(1)

Summary

149

(2)

Appendix: Other Code Samples

151

(32)

Examples for
Chapter 1

151

(1)

Examples for
Chapter 2

151

(1)

Calculating Fibonacci Numbers

152

(1)

Calculating the GCD of Two Positive Integers

153

(2)

Calculating the LCM of Two Positive Integers

155

(1)

Calculating Prime Divisors

156

(2)

Examples for
Chapter 3

158

(6)

Simulating Relational Data with the grep Command

164

(3)

Checking Updates in a Logfile

167

(2)

Examples for
Chapter 4

169

(1)

Examples for
Chapter 5

169

(1)

Processing Multiline Records

169

(2)

Adding the Contents of Records

171

(1)

Using the split Function in awk

171

(1)

Scanning Diagonal Elements in Datasets

172

(3)

Adding Values from Multiple Datasets (1)

175

(1)

Adding Values from Multiple Datasets (2)

176

(2)

Adding Values from Multiple Datasets (3)

178

(2)

Calculating Combinations of Field Values

180

(1)

Summary

181

(2)

Index

183

Campesato Oswald : Oswald Campesato (San Francisco, CA) is an adjunct instructor at UC-Santa Clara and specializes in Deep Learning, Java, Android, TensorFlow, and NLP. He is the author/co-author of over twenty-five books including TensorFlow 2 Pocket Primer, Python 3 for Machine Learning, and the NLP Using R Pocket Primer (all Mercury Learning and Information).

Data Cleaning Pocket Primer: Pocket Primer [Pehme köide]

Konto & seaded

Otsing

Otsingu andmebaas

Filtreeri tulemusi

Teemad Ingliskeelsed raamatud

Vali ostukorv