Muutke küpsiste eelistusi

Learning to Quantify 1st ed. 2023 [Pehme köide]

  • Formaat: Paperback / softback, 137 pages, kõrgus x laius: 235x155 mm, kaal: 250 g, 1 Illustrations, black and white; XVI, 137 p. 1 illus., 1 Paperback / softback
  • Sari: The Information Retrieval Series 47
  • Ilmumisaeg: 17-Mar-2023
  • Kirjastus: Springer International Publishing AG
  • ISBN-10: 3031204662
  • ISBN-13: 9783031204661
  • Pehme köide
  • Hind: 39,43 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Tavahind: 46,39 €
  • Säästad 15%
  • Raamatu kohalejõudmiseks kirjastusest kulub orienteeruvalt 2-4 nädalat
  • Kogus:
  • Lisa ostukorvi
  • Tasuta tarne
  • Tellimisaeg 2-4 nädalat
  • Lisa soovinimekirja
  • Formaat: Paperback / softback, 137 pages, kõrgus x laius: 235x155 mm, kaal: 250 g, 1 Illustrations, black and white; XVI, 137 p. 1 illus., 1 Paperback / softback
  • Sari: The Information Retrieval Series 47
  • Ilmumisaeg: 17-Mar-2023
  • Kirjastus: Springer International Publishing AG
  • ISBN-10: 3031204662
  • ISBN-13: 9783031204661
This open access book provides an introduction and an overview of learning to quantify (a.k.a. quantification), i.e. the task of training estimators of class proportions in unlabeled data by means of supervised learning. In data science, learning to quantify is a task of its own related to classification yet different from it, since estimating class proportions by simply classifying all data and counting the labels assigned by the classifier is known to often return inaccurate (biased) class proportion estimates.





The book introduces learning to quantify by looking at the supervised learning methods that can be used to perform it, at the evaluation measures and evaluation protocols that should be used for evaluating the quality of the returned predictions, at the numerous fields of human activity in which the use of quantification techniques may provide improved results with respect to the naive use of classification techniques, and at advanced topics in quantification research.





The book is suitable to researchers, data scientists, or PhD students, who want to come up to speed with the state of the art in learning to quantify, but also to researchers wishing to apply data science technologies to fields of human activity (e.g., the social sciences, political science, epidemiology, market research) which focus on aggregate (macro) data rather than on individual (micro) data.
1 The Case for Quantification
1(18)
1.1 Class Distributions and Their Estimation
2(1)
1.2 The Suboptimality of Classify and Count
3(2)
1.3 Notational Conventions
5(1)
1.4 Quantification Problems
6(2)
1.5 Dataset Shift and Quantification
8(6)
1.5.1 Types of Dataset Shift and Their Relation to Quantification
11(3)
1.6 Quantification and Bias Mitigation
14(2)
1.7 Structure of This Book
16(3)
2 Applications of Quantification
19(14)
2.1 Improving Classification Accuracy
19(3)
2.1.1 Word Sense Disambiguation
21(1)
2.2 Fairness
22(2)
2.2.1 Improving Fairness
22(1)
2.2.2 Measuring Fairness
23(1)
2.3 Sentiment Analysis
24(1)
2.4 Social and Political Sciences
25(2)
2.5 Market Research
27(1)
2.6 Epidemiology
28(1)
2.7 Ecological Modelling
29(2)
2.8 Resource Allocation
31(2)
3 Evaluation of Quantification Algorithms
33(22)
3.1 Measures for Evaluating SLQ, BQ, and MLQ
34(11)
3.1.1 Properties of Evaluation Measures for SLQ, BQ, and MLQ
35(2)
3.1.2 Bias
37(1)
3.1.3 Absolute Error and its Variants
37(1)
3.1.4 Relative Absolute Error and its Variants
38(3)
3.1.5 Kullback-Leibler Divergence and its Variants
41(1)
3.1.6 Which Measure is the Best for SLQ?
42(3)
3.2 Measures for Evaluating OQ
45(2)
3.2.1 Earth Mover's Distance
45(1)
3.2.2 Root Normalised Order-Aware Divergence
46(1)
3.3 Measures for Evaluating Regression Quantification
47(1)
3.4 Experimental Protocols for Evaluating Quantification
48(5)
3.4.1 Natural Prevalence Protocol (NPP)
49(1)
3.4.2 Artificial Prevalence Protocol (APP)
49(1)
3.4.3 A Variant of the APP Based on the Kraemer Algorithm
50(1)
3.4.4 Should we Use the NPP or the APP?
51(2)
3.5 Model Selection in Quantification
53(2)
4 Methods for Learning to Quantify
55(32)
4.1 Maximum Likelihood Prevalence Estimation
56(1)
4.2 Aggregative Methods Based on General-Purpose Learners
57(17)
4.2.1 Classify and Count
57(1)
4.2.2 Probabilistic Classify and Count
58(1)
4.2.3 Adjusted Classify and Count
59(2)
4.2.4 Probabilistic Adjusted Classify and Count
61(2)
4.2.5 X, MAX, and Threshold@0.50
63(1)
4.2.6 Median Sweep
64(1)
4.2.7 The Ratio Estimator
64(2)
4.2.8 Mixture Models
66(3)
4.2.9 Expectation Maximisation for Quantification
69(2)
4.2.10 Class Distribution Estimation
71(1)
4.2.11 Ensemble Methods for Quantification
72(1)
4.2.12 QuaNet
73(1)
4.3 Aggregative Methods Based on Special-Purpose Learners
74(4)
4.3.1 Methods Based on Explicit Loss Minimisation
75(1)
4.3.2 Quantification Trees and Quantification Forests
76(2)
4.4 Non-Aggregative Methods
78(9)
4.4.1 The README Method
78(1)
4.4.2 The iSA Method
79(1)
4.4.3 The ReadMe2 Method
80(1)
4.4.4 The HDx Method
81(1)
4.4.5 The MMD-RKHS Method
82(1)
4.4.6 The Uncertainty-Aware Generative Model
82(1)
4.4.7 Deep Quantification Network
83(4)
5 Advanced Topics
87(16)
5.1 Ordinal Quantification
87(1)
5.2 Regression Quantification
88(2)
5.3 Cross-Lingual Quantification
90(1)
5.4 Quantification for Networked Data
91(1)
5.5 Cost Quantification
92(2)
5.6 Quantification in Data Streams
94(3)
5.7 One-Class Quantification
97(2)
5.8 Confidence Intervals for Class Prevalence Estimates
99(4)
6 The Quantification Landscape
103(18)
6.1 Historical Development
103(2)
6.1.1 The Trajectory of Quantification
103(1)
6.1.2 Shared Tasks
104(1)
6.2 Software
105(4)
6.2.1 Publicly Available Implementations
105(1)
6.2.2 QuaPy: A Comprehensive Framework for Quantification
106(3)
6.3 How Do Different Quantification Methods Fare?
109(8)
6.3.1 A Tour of Experimental Results
109(6)
6.3.2 Visualisation Tools for the Analysis of Results
115(2)
6.4 Related Tasks
117(4)
6.4.1 Links to Existing Tasks
117(2)
6.4.2 A Possible Variant of the Quantification Task
119(2)
7 The Road Ahead
121(4)
Bibliography 125(10)
Index 135
About the Author ix
About the Technical Reviewer xi
Introduction xiii
Chapter 1 Getting Started
1(1)
Linking Pages Together
1(2)
Where Are Web Pages Stored?
3(1)
What Is a URL?
3(2)
Index Pages
5(2)
HTML5
7(1)
What Is CSS?
7(2)
Hosting
9(4)
Installing Our Web Server
9(4)
Starting the Web Server
13(2)
Saving Your Web Pages
15(4)
Local Machine
15(1)
Using a Web Host
16(3)
Development Tools and Code Editors
19(4)
Lab Demo
23(5)
Lab Exercises
28(1)
Summary
29(2)
Chapter 2 Introduction to HTML
31(10)
Structure of an HTML Page
31(3)
HTML Element Structure
34(3)
Metadata
37(1)
Lab Exercises
38(1)
Summary
39(2)
Chapter 3 Getting Started with HTML
41(34)
Setting Up
42(2)
Elements for Formatting Text
44(7)
Headings
46(1)
Paragraphs
47(1)
Bold Text
47(1)
Italic Text
48(3)
Page Background Color
51(2)
Text Color
53(1)
Fonts
54(1)
HTML Entities
55(1)
Adding Images
56(2)
Understanding Image Dimensions
58(1)
Image Alignment
59(1)
Background Image
60(3)
Adding Tables
63(1)
Adding Links
64(3)
Using Images As Links
67(2)
Preserve Formatting
69(1)
Adding Lists
69(1)
Unordered List
69(1)
Ordered List
70(1)
Structuring Your Web Page
70(2)
Lab Exercises
72(1)
Summary
73(2)
Chapter 4 Cascading Style Sheets
75(52)
External CSS Files
76(2)
CSS Syntax
78(5)
Element Type Selector
80(1)
Class Selector
80(1)
ID Selector
81(1)
Universal Selector
82(1)
Grouping Selectors
82(1)
Styling Text
83(2)
Specifying Colors
85(2)
Keyword
85(1)
Hex Value
86(1)
RGB Value
86(1)
Understanding Measurement Units
87(1)
Padding, Margins, and Borders
88(2)
Layouts
90(34)
Flexbox
91(33)
Lab Exercises
124(1)
Summary
125(2)
Chapter 5 Special Effects
127(8)
Text Effects
127(2)
Rounded Image Corners
129(1)
Buttons
130(1)
Gradients
131(2)
Lab Exercises
133(1)
Summary
134(1)
Chapter 6 Multimedia
135(12)
Adding Video
135(4)
Adding Audio
139(1)
Adding Image Maps
140(5)
Lab Exercises
145(1)
Summary
146(1)
Chapter 7 HTML Forms
147(22)
Adding Forms
148(4)
Input Types
148(3)
Labels
151(1)
Submit Button
152(1)
Building a Form
152(1)
Styling a Form
153(3)
Processing the Form Data
156(6)
Configure the Web Server to Execute Scripts
156(6)
Executing the Script
162(2)
Submission Method
164(2)
Get
164(1)
Post
165(1)
Lab Exercises
166(1)
Summary
167(2)
Chapter 8 Introduction to JavaScript
169(12)
JavaScript Syntax
172(2)
Statements
172(1)
Blocks
172(1)
Identifiers
172(1)
Keywords
173(1)
Comments
173(1)
First Program
174(1)
Lab Exercise
175(4)
Summary
179(2)
Chapter 9 Content Management Systems
181(22)
Set Up WordPress on Our Server
183(16)
Web Development Frameworks
199(1)
Summary
200(3)
Appendix A HTML Element Reference 203(12)
Appendix B CSS Selector Reference 215(12)
Appendix C CSS Color Codes 227(8)
Index 235
Andrea Esuli is a tenured Senior Researcher at the Italian National Council of Research. His research interests include learning to quantify, deep learning for text analysis, cross-modal classification, technology-assisted review, and representation learning.





Alessandro Fabris is a PhD student at the University of Padova. His research interests include learning to quantify, and the fairness and bias of retrieval and classification systems.





Alejandro Moreo is a tenured Researcher at the Italian National Council of Research. His research interests include learning to quantify, deep learning for text analysis, cross-lingual text classification, authorship analysis, and representation learning.





Fabrizio Sebastiani is a tenured Director of Research at the Italian National Council of Research. His research interests include learning to quantify, cross-lingual text classification, technology-assisted review, authorship analysis, and representation learning.