Muutke küpsiste eelistusi

Data Classification: Algorithms and Applications [Kõva köide]

Edited by (IBM Research, Yorktown Heights, New York, USA)
Teised raamatud teemal:
Teised raamatud teemal:
Comprehensive Coverage of the Entire Area of ClassificationResearch on the problem of classification tends to be fragmented across such areas as pattern recognition, database, data mining, and machine learning. Addressing the work of these different communities in a unified way, Data Classification: Algorithms and Applications explores the underlying algorithms of classification as well as applications of classification in a variety of problem domains, including text, multimedia, social network, and biological data.This comprehensive book focuses on three primary aspects of data classification:MethodsThe book first describes common techniques used for classification, including probabilistic methods, decision trees, rule-based methods, instance-based methods, support vector machine methods, and neural networks. DomainsThe book then examines specific methods used for data domains such as multimedia, text, time-series, network, discrete sequence, and uncertain data. It also covers large data sets and data streams due to the recent importance of the big data paradigm. VariationsThe book concludes with insight on variations of the classification process. It discusses ensembles, rare-class learning, distance function learning, active learning, visual learning, transfer learning, and semi-supervised learning as well as evaluation aspects of classifiers-- This book homes in on three primary aspects of data classification: the core methods for data classification including probabilistic classification, decision trees, rule-based methods, and SVM methods; different problem domains and scenarios such as multimedia data, text data, biological data, categorical data, network data, data streams and uncertain data: and different variations of the classification problem such as ensemble methods, visual methods, transfer learning, semi-supervised methods and active learning. These advanced methods can be used to enhance the quality of the underlying classification results-- Comprehensive Coverage of the Entire Area of ClassificationResearch on the problem of classification tends to be fragmented across such areas as pattern recognition, database, data mining, and machine learning. Addressing the work of these different communities in a unified way, Data Classification: Algorithms and Applications explores the underlying algorithms of classification as well as applications of classification in a variety of problem domains, including text, multimedia, social network, and biological data.This comprehensive book focuses on three primary aspects of data classification:Methods-The book first describes common techniques used for classification, including probabilistic methods, decision trees, rule-based methods, instance-based methods, support vector machine methods, and neural networks. Domains-The book then examines specific methods used for data domains such as multimedia, text, time-series, network, discrete sequence, and uncertain data. It also covers large data sets and data streams due to the recent importance of the big data paradigm. Variations-The book concludes with insight on variations of the classification process. It discusses ensembles, rare-class learning, distance function learning, active learning, visual learning, transfer learning, and semi-supervised learning as well as evaluation aspects of classifiers.
An Introduction to Data Classification. Feature Selection for
Classification: A Review. Probabilistic Models for Classification. Decision
Trees: Theory and Algorithms. Rule-Based Classification. Instance-Based
Learning: A Survey. Support Vector Machines. Neural Networks: A Review. A
Survey of Stream Classification Algorithms. Big Data Classification. Text
Classification. Multimedia Classification. Time Series Data Classification.
Discrete Sequence Classification. Collective Classification of Network Data.
Uncertain Data Classification. Rare Class Learning. Distance Metric Learning
for Data Classification. Ensemble Learning. Semi-Supervised Learning.
Transfer Learning. Active Learning: A Survey. Visual Classification.
Evaluation of Classification Methods. Educational and Software Resources for
Data Classification. Index.
Charu C. Aggarwal is a research scientist at the IBM T.J. Watson Research Center. A fellow of the IEEE and the ACM, he is the author/editor of ten books, an associate editor of several journals, and the vice-president of the SIAM Activity Group on Data Mining. Dr. Aggarwal has published over 200 papers, has applied for or been granted over 80 patents, and has received numerous honors, including the IBM Outstanding Technical Achievement Award and EDBT 2014 Test of Time Award. His research interests include performance analysis, databases, and data mining. He earned a Ph.D. from the Massachusetts Institute of Technology.