About the Author |
|
xv | |
About the Technical Reviewer |
|
xvii | |
Acknowledgments |
|
xix | |
Introduction |
|
xxi | |
Chapter 1 Creating an Azure Data Factory Instance |
|
1 | (22) |
|
|
2 | (2) |
|
Create a Free Azure Account |
|
|
2 | (1) |
|
|
2 | (2) |
|
|
4 | (3) |
|
Create an Azure Data Factory |
|
|
7 | (2) |
|
Explore the Azure Data Factory User Experience |
|
|
9 | (3) |
|
|
10 | (1) |
|
|
11 | (1) |
|
|
12 | (5) |
|
Create a Git Repository in Azure Repos |
|
|
13 | (2) |
|
Link the Data Factory to the Git Repository |
|
|
15 | (2) |
|
The ADF UX as a Web-Based IDE |
|
|
17 | (2) |
|
|
19 | (1) |
|
|
20 | (2) |
|
|
22 | (1) |
Chapter 2 Your First Pipeline |
|
23 | (22) |
|
|
23 | (5) |
|
Create an Azure Storage Account |
|
|
23 | (3) |
|
|
26 | (1) |
|
|
27 | (1) |
|
|
28 | (4) |
|
|
32 | (8) |
|
|
33 | (1) |
|
|
34 | (1) |
|
|
35 | (1) |
|
|
36 | (1) |
|
|
37 | (2) |
|
|
39 | (1) |
|
|
40 | (2) |
|
Run the Pipeline in Debug Mode |
|
|
41 | (1) |
|
Inspect Execution Results |
|
|
42 | (1) |
|
|
42 | (1) |
|
|
42 | (2) |
|
|
44 | (1) |
Chapter 3 The Copy Data Activity |
|
45 | (38) |
|
Prepare an Azure SQL Database |
|
|
45 | (6) |
|
|
46 | (3) |
|
|
49 | (2) |
|
Import Structured Data into Azure SQL DB |
|
|
51 | (11) |
|
Create the Basic Pipeline |
|
|
51 | (8) |
|
|
59 | (2) |
|
|
61 | (1) |
|
Map Source and Sink Schemas |
|
|
62 | (5) |
|
Create a New Source Dataset |
|
|
63 | (1) |
|
|
64 | (1) |
|
|
65 | (2) |
|
Import Semi-structured Data into Azure SQL DB |
|
|
67 | (6) |
|
Create a JSON File Dataset |
|
|
67 | (1) |
|
|
68 | (1) |
|
|
68 | (1) |
|
Set the Collection Reference |
|
|
69 | (1) |
|
The Effect of Schema Drift |
|
|
70 | (2) |
|
Understanding Type Conversion |
|
|
72 | (1) |
|
Transform JSON Files into Parquet |
|
|
73 | (3) |
|
Create a New JSON Dataset |
|
|
74 | (1) |
|
|
74 | (1) |
|
Create and Run the Transformation Pipeline |
|
|
75 | (1) |
|
|
76 | (1) |
|
|
76 | (1) |
|
Degree of Copy Parallelism |
|
|
77 | (1) |
|
|
77 | (1) |
|
|
78 | (1) |
|
Azure Data Factory User Experience (ADF UX) |
|
|
79 | (2) |
|
|
81 | (2) |
Chapter 4 Expressions |
|
83 | (30) |
|
Explore the Expression Builder |
|
|
83 | (3) |
|
|
86 | (2) |
|
Enable Storage of Audit Information |
|
|
86 | (1) |
|
|
86 | (1) |
|
|
86 | (1) |
|
|
87 | (1) |
|
Access Activity Run Properties |
|
|
88 | (6) |
|
|
89 | (1) |
|
Add Stored Procedure Activity |
|
|
90 | (3) |
|
|
93 | (1) |
|
|
94 | (8) |
|
|
94 | (2) |
|
Configure the Lookup Activity |
|
|
96 | (2) |
|
|
98 | (2) |
|
|
100 | (1) |
|
Update the Stored Procedure Activity |
|
|
100 | (1) |
|
|
101 | (1) |
|
|
102 | (4) |
|
|
102 | (1) |
|
|
103 | (1) |
|
|
104 | (1) |
|
|
105 | (1) |
|
|
106 | (2) |
|
|
107 | (1) |
|
|
107 | (1) |
|
|
108 | (1) |
|
|
108 | (1) |
|
|
108 | (2) |
|
|
110 | (3) |
Chapter 5 Parameters |
|
113 | (32) |
|
Set Up an Azure Key Vault |
|
|
113 | (8) |
|
|
114 | (1) |
|
Create a Key Vault Secret |
|
|
115 | (1) |
|
Grant Access to the Key Vault |
|
|
116 | (2) |
|
Create a Key Vault ADF Linked Service |
|
|
118 | (1) |
|
Create a New Storage Account Linked Service |
|
|
119 | (2) |
|
|
121 | (6) |
|
Create a Parameterized Dataset |
|
|
123 | (1) |
|
Use the Parameterized Dataset |
|
|
124 | (2) |
|
Reuse the Parameterized Dataset |
|
|
126 | (1) |
|
Use Linked Service Parameters |
|
|
127 | (6) |
|
Create a Parameterized Linked Service |
|
|
127 | (4) |
|
Increase Dataset Reusability |
|
|
131 | (1) |
|
|
132 | (1) |
|
Why Parameterize Linked Services? |
|
|
133 | (1) |
|
|
133 | (6) |
|
Create a Parameterized Pipeline |
|
|
133 | (2) |
|
Run the Parameterized Pipeline |
|
|
135 | (2) |
|
Use the Execute Pipeline Activity |
|
|
137 | (2) |
|
|
139 | (1) |
|
|
139 | (1) |
|
|
140 | (1) |
|
|
141 | (1) |
|
|
142 | (3) |
Chapter 6 Controlling Flow |
|
145 | (36) |
|
Create a Per-File Pipeline |
|
|
145 | (2) |
|
Use Activity Dependency Conditions |
|
|
147 | (9) |
|
Explore Dependency Condition Interactions |
|
|
149 | (3) |
|
Understand Pipeline Outcome |
|
|
152 | (4) |
|
|
156 | (1) |
|
Use Conditional Activities |
|
|
157 | (10) |
|
|
157 | (4) |
|
|
161 | (4) |
|
Understand the Switch Activity |
|
|
165 | (2) |
|
|
167 | (9) |
|
Use the Get Metadata Activity |
|
|
167 | (2) |
|
|
169 | (3) |
|
|
172 | (3) |
|
Understand the Until Activity |
|
|
175 | (1) |
|
|
176 | (1) |
|
|
177 | (2) |
|
|
179 | (2) |
Chapter 7 Data Flows |
|
181 | (36) |
|
|
181 | (21) |
|
Enable Data Flow Debugging |
|
|
182 | (2) |
|
Add a Data Flow Transformation |
|
|
184 | (4) |
|
Use the Filter Transformation |
|
|
188 | (3) |
|
Use the Lookup Transformation |
|
|
191 | (3) |
|
Use the Derived Column Transformation |
|
|
194 | (2) |
|
Use the Select Transformation |
|
|
196 | (1) |
|
Use the Sink Transformation |
|
|
197 | (1) |
|
|
198 | (4) |
|
Maintain a Product Dimension |
|
|
202 | (10) |
|
|
203 | (1) |
|
Create Supporting Datasets |
|
|
203 | (1) |
|
Build the Product Maintenance Data Flow |
|
|
204 | (6) |
|
Execute the Dimension Data Flow |
|
|
210 | (2) |
|
|
212 | (1) |
|
|
212 | (2) |
|
|
214 | (3) |
Chapter 8 Integration Runtimes |
|
217 | (24) |
|
Azure Integration Runtime |
|
|
217 | (7) |
|
Inspect the AutoResolveIntegrationRuntime |
|
|
218 | (1) |
|
Create a New Azure Integration Runtime |
|
|
219 | (2) |
|
Use the New Azure Integration Runtime |
|
|
221 | (3) |
|
Self-Hosted Integration Runtime |
|
|
224 | (7) |
|
Create a Shared Data Factory |
|
|
225 | (1) |
|
Create a Self-Hosted Integration Runtime |
|
|
225 | (1) |
|
Link to a Self-Hosted Integration Runtime |
|
|
226 | (1) |
|
Use the Self-Hosted Integration Runtime |
|
|
227 | (4) |
|
Azure-SSIS Integration Runtime |
|
|
231 | (7) |
|
Create an Azure-SSIS Integration Runtime |
|
|
231 | (3) |
|
Deploy SSIS Packages to the Azure-SSIS IR |
|
|
234 | (2) |
|
Run an SSIS Package in ADF |
|
|
236 | (1) |
|
|
237 | (1) |
|
|
238 | (1) |
|
|
239 | (1) |
|
|
240 | (1) |
Chapter 9 Power Query in ADF |
|
241 | (12) |
|
Create a Power Query Mashup |
|
|
241 | (2) |
|
Explore the Power Query Editor |
|
|
243 | (2) |
|
|
245 | (3) |
|
Run the Power Query Activity |
|
|
248 | (2) |
|
|
250 | (3) |
Chapter 10 Publishing to ADF |
|
253 | (28) |
|
Publish to Your Factory Instance |
|
|
254 | (3) |
|
Trigger a Pipeline from the ADF UX |
|
|
254 | (1) |
|
Publish Factory Resources |
|
|
255 | (1) |
|
Inspect Published Pipeline Run Outcome |
|
|
256 | (1) |
|
Publish to Another Data Factory |
|
|
257 | (6) |
|
Prepare a Production Environment |
|
|
257 | (2) |
|
Export ARM Template from Your Development Factory |
|
|
259 | (1) |
|
Import ARM Template into Your Production Factory |
|
|
260 | (2) |
|
Understand Deployment Parameters |
|
|
262 | (1) |
|
Automate Publishing to Another Factory |
|
|
263 | (9) |
|
Create a DevOps Service Connection |
|
|
264 | (1) |
|
Create an Azure DevOps Pipeline |
|
|
265 | (5) |
|
Trigger an Automatic Deployment |
|
|
270 | (2) |
|
|
272 | (6) |
|
Azure Data Factory Utilities |
|
|
274 | (1) |
|
Publish Resources as JSON |
|
|
275 | (3) |
|
|
278 | (3) |
Chapter 11 Triggers |
|
281 | (26) |
|
|
281 | (8) |
|
Create a Schedule Trigger |
|
|
281 | (2) |
|
|
283 | (1) |
|
Inspect Trigger Definitions |
|
|
284 | (1) |
|
|
285 | (1) |
|
|
286 | (1) |
|
|
287 | (1) |
|
Advanced Recurrence Options |
|
|
288 | (1) |
|
Use an Event-Based Trigger |
|
|
289 | (7) |
|
Register the Event Grid Resource Provider |
|
|
290 | (1) |
|
Create an Event-Based Trigger |
|
|
291 | (2) |
|
|
293 | (2) |
|
Trigger-Scoped System Variables |
|
|
295 | (1) |
|
Use a Tumbling Window Trigger |
|
|
296 | (6) |
|
|
296 | (1) |
|
Create a Windowed Copy Pipeline |
|
|
297 | (2) |
|
Create a Tumbling Window Trigger |
|
|
299 | (1) |
|
|
299 | (2) |
|
|
301 | (1) |
|
Publishing Triggers Automatically |
|
|
302 | (1) |
|
Triggering Pipelines Programmatically |
|
|
303 | (1) |
|
|
303 | (1) |
|
|
304 | (1) |
|
|
305 | (2) |
Chapter 12 Monitoring |
|
307 | (24) |
|
Generate Factory Activity |
|
|
307 | (1) |
|
|
308 | (6) |
|
|
308 | (1) |
|
|
309 | (2) |
|
|
311 | (3) |
|
|
314 | (2) |
|
|
316 | (3) |
|
Create a Log Analytics Workspace |
|
|
316 | (1) |
|
Configure Diagnostic Settings |
|
|
316 | (2) |
|
Inspect Logs in Blob Storage |
|
|
318 | (1) |
|
Use the Log Analytics Workspace |
|
|
319 | (4) |
|
|
319 | (2) |
|
Use a Log Analytics Workbook |
|
|
321 | (2) |
|
|
323 | (4) |
|
Configure Metric-Based Alerts |
|
|
323 | (2) |
|
Configure Log-Based Alerts |
|
|
325 | (2) |
|
|
327 | (1) |
|
|
327 | (1) |
|
|
328 | (1) |
|
|
329 | (2) |
Index |
|
331 | |