Customer Support: +372 7440010

Help | New account | Log In

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond [Paperback / softback]

4.05/5 (40 ratings by Goodreads)

Alexander J. Smola, Bernhard Schölkopf (Max Planck Institute for Intelligent Systems)

Format: Paperback / softback, 648 pages, height x width x depth: 254x203x27 mm, 138 illus.; 138 Illustrations
Series: Adaptive Computation and Machine Learning series
Pub. Date: 05-Jun-2018
Publisher: MIT Press
ISBN-10: 0262536579
ISBN-13: 9780262536578

Other books in subject:

Computer science - (Currently in stock: 7 items)

Paperback / softback
Price: 105,40 €
This book is not in stock. Book will arrive in about 2-4 weeks. Please allow another 2 weeks for shipping outside Estonia.
Quantity:
- - 1
  - 2
  - 3
  - 4
  - 5
  - 6
  - 7
  - 8
  - 9
  - 10
Add to basket
Delivery time 4-6 weeks
Add to Wishlist

Format: Paperback / softback, 648 pages, height x width x depth: 254x203x27 mm, 138 illus.; 138 Illustrations
Series: Adaptive Computation and Machine Learning series
Pub. Date: 05-Jun-2018
Publisher: MIT Press
ISBN-10: 0262536579
ISBN-13: 9780262536578

Other books in subject:

Computer science - (Currently in stock: 7 items)

Permanent link: https://www.kriso.ee/db/9780262536578.html

A comprehensive introduction to Support Vector Machines and related kernel methods.In the 1990s, a new type of learning algorithm was developed, based on results from statistical learning theory: the Support Vector Machine (SVM). This gave rise to a new class of theoretically elegant learning machines that use a central concept of SVMs—-kernels—for a number of learning tasks. Kernel machines provide a modular framework that can be adapted to different tasks and domains by the choice of the kernel function and the base algorithm. They are replacing neural networks in a variety of fields, including engineering, information retrieval, and bioinformatics.Learning with Kernels provides an introduction to SVMs and related kernel methods. Although the book begins with the basics, it also includes the latest research. It provides all of the concepts necessary to enable a reader equipped with some basic mathematical knowledge to enter the world of machine learning using theoretically well-founded yet easy-to-use kernel algorithms and to understand and apply the powerful algorithms that have been developed over the last few years. A comprehensive introduction to Support Vector Machines and related kernel methods.

Series Foreword

xiii

Preface

1 A Tutorial Introduction

(22)

1.1 Data Representation and Similarity

(3)

1.2 A Simple Pattern Recognition Algorithm

(2)

1.3 Some Insights From Statistical Learning Theory

(5)

1.4 Hyperplane Classifiers

(4)

1.5 Support Vector Classification

(2)

1.6 Support Vector Regression

(2)

1.7 Kernel Principal Component Analysis

(2)

1.8 Empirical Results and Implementations

(2)

I CONCEPTS AND TOOLS

(164)

2 Kernels

(36)

2.1 Product Features

(3)

2.2 The Representation of Similarities in Linear Spaces

(16)

2.3 Examples and Properties of Kernels

(3)

2.4 The Representation of Dissimilarities in Linear Spaces

(7)

2.5 Summary

(1)

2.6 Problems

(6)

3 Risk and Loss Functions

(26)

3.1 Loss Functions

(3)

3.2 Test Error and Expected Risk

(3)

3.3 A Statistical Perspective

(7)

3.4 Robust Estimators

(8)

3.5 Summary

(1)

3.6 Problems

(3)

4 Regularization

(38)

4.1 The Regularized Risk Functional

(1)

4.2 The Representer Theorem

(3)

4.3 Regularization Operators

(4)

4.4 Translation Invariant Kernels

(9)

4.5 Translation Invariant Kernels in Higher Dimensions

105

(5)

4.6 Dot Product Kernels

110

(3)

4.7 Multi-Output Regularization

113

(2)

4.8 Semiparametric Regularization

115

(3)

4.9 Coefficient Based Regularization

118

(3)

4.10 Summary

121

(1)

4.11 Problems

122

(3)

5 Elements of Statistical Learning Theory

125

(24)

5.1 Introduction

125

(3)

5.2 The Law of Large Numbers

128

(3)

5.3 When Does Learning Work: the Question of Consistency

131

(1)

5.4 Uniform Convergence and Consistency

131

(3)

5.5 How to Derive a VC Bound

134

(10)

5.6 A Model Selection Example

144

(2)

5.7 Summary

146

(1)

5.8 Problems

146

(3)

6 Optimization

149

(38)

6.1 Convex Optimization

150

(4)

6.2 Unconstrained Problems

154

(11)

6.3 Constrained Problems

165

(10)

6.4 Interior Point Methods

175

(4)

6.5 Maximum Search Problems

179

(4)

6.6 Summary

183

(1)

6.7 Problems

184

(3)

II SUPPORT VECTOR MACHINES

187

(218)

7 Pattern Recognition

189

(38)

7.1 Separating Hyperplanes

189

(3)

7.2 The Role of the Margin

192

(4)

7.3 Optimal Margin Hyperplanes

196

(4)

7.4 Nonlinear Support Vector Classifiers

200

(4)

7.5 Soft Margin Hyperplanes

204

(7)

7.6 Multi-Class Classification

211

(3)

7.7 Variations on a Theme

214

(1)

7.8 Experiments

215

(7)

7.9 Summary

222

(1)

7.10 Problems

222

(5)

8 Single-Class Problems: Quantile Estimation and Novelty Detection

227

(24)

8.1 Introduction

228

(1)

8.2 A Distribution's Support and Quantiles

229

(1)

8.3 Algorithms

230

(4)

8.4 Optimization

234

(2)

8.5 Theory

236

(5)

8.6 Discussion

241

(2)

8.7 Experiments

243

(4)

8.8 Summary

247

(1)

8.9 Problems

248

(3)

9 Regression Estimation

251

(28)

9.1 Linear Regression with Insensitive Loss Function

251

(3)

9.2 Dual Problems

254

(6)

9.3 u-SV Regression

260

(6)

9.4 Convex Combinations and i-Norms

266

(3)

9.5 Parametric Insensitivity Models

269

(3)

9.6 Applications

272

(1)

9.7 Summary

273

(1)

9.8 Problems

274

(5)

10 Implementation

279

(54)

10.1 Tricks of the Trade

281

(7)

10.2 Sparse Greedy Matrix Approximation

288

(7)

10.3 Interior Point Algorithms

295

(5)

10.4 Subset Selection Methods

300

(5)

10.5 Sequential Minimal Optimization

305

(7)

10.6 Iterative Methods

312

(15)

10.7 Summary

327

(2)

10.8 Problems

329

(4)

11 Incorporating Invariances

333

(26)

11.1 Prior Knowledge

333

(2)

11.2 Transformation Invariance

335

(2)

11.3 The Virtual SV Method

337

(6)

11.4 Constructing Invariance Kernels

343

(11)

11.5 The Jittered SV Method

354

(2)

11.6 Summary

356

(1)

11.7 Problems

357

(2)

12 Learning Theory Revisited

359

(46)

12.1 Concentration of Measure Inequalities

360

(6)

12.2 Leave-One-Out Estimates

366

(15)

12.3 PAC-Bayesian Bounds

381

(10)

12.4 Operator-Theoretic Methods in Learning Theory

391

(12)

12.5 Summary

403

(1)

12.6 Problems

404

(1)

III KERNEL METHODS

405

(164)

13 Designing Kernels

407

(20)

13.1 Tricks for Constructing Kernels

408

(4)

13.2 String Kernels

412

(2)

13.3 Locality-Improved Kernels

414

(4)

13.4 Natural Kernels

418

(5)

13.5 Summary

423

(1)

13.6 Problems

423

(4)

14 Kernel Feature Extraction

427

(30)

14.1 Introduction

427

(2)

14.2 Kernel PCA

429

(8)

14.3 Kernel PCA Experiments

437

(5)

14.4 A Framework for Feature Extraction

442

(5)

14.5 Algorithms for Sparse KFA

447

(3)

14.6 KFA Experiments

450

(1)

14.7 Summary

451

(1)

14.8 Problems

452

(5)

15 Kernel Fisher Discriminant

457

(12)

15.1 Introduction

457

(1)

15.2 Fisher's Discriminant in Feature Space

458

(2)

15.3 Efficient Training of Kernel Fisher Discriminants

460

(4)

15.4 Probabilistic Outputs

464

(2)

15.5 Experiments

466

(1)

15.6 Summary

467

(1)

15.7 Problems

468

(1)

16 Bayesian Kernel Methods

469

(48)

16.1 Bayesics

470

(5)

16.2 Inference Methods

475

(5)

16.3 Gaussian Processes

480

(8)

16.4 Implementation of Gaussian Processes

488

(11)

16.5 Laplacian Processes

499

(7)

16.6 Relevance Vector Machines

506

(5)

16.7 Summary

511

(2)

16.8 Problems

513

(4)

17 Regularized Principal Manifolds

517

(26)

17.1 A Coding Framework

518

(4)

17.2 A Regularized Quantization Functional

522

(4)

17.3 An Algorithm for Minimizing Rreg[ /]

526

(3)

17.4 Connections to Other Algorithms

529

(4)

17.5 Uniform Convergence Bounds

533

(4)

17.6 Experiments

537

(2)

17.7 Summary

539

(1)

17.8 Problems

540

(3)

18 Pre-Images and Reduced Set Methods

543

(26)

18.1 The Pre-Image Problem

544

(3)

18.2 Finding Approximate Pre-Images

547

(5)

18.3 Reduced Set Methods

552

(2)

18.4 Reduced Set Selection Methods

554

(7)

18.5 Reduced Set Construction Methods

561

(3)

18.6 Sequential Evaluation of Reduced Set Expansions

564

(2)

18.7 Summary

566

(1)

18.8 Problems

567

(2)

A Addenda

569

(6)

A.1 Data Sets

569

(3)

A.2 Proofs

572

(3)

B Mathematical Prerequisites

575

(16)

B.1 Probability

575

(5)

B.2 Linear Algebra

580

(6)

B.3 Functional Analysis

586

(5)

References

591

(26)

Index

617

(8)

Notation and Symbols

625

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond [Paperback / softback]

Account & settings

Search

Search database

Refine By

Subjects English Books

Choose shopping cart