Muutke küpsiste eelistusi

E-raamat: Simplifying Data Engineering and Analytics with Delta: Create analytics-ready data that fuels artificial intelligence and business intelligence

  • Formaat: 334 pages
  • Ilmumisaeg: 29-Jul-2022
  • Kirjastus: Packt Publishing Limited
  • Keel: eng
  • ISBN-13: 9781801810715
Teised raamatud teemal:
  • Formaat - EPUB+DRM
  • Hind: 32,74 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Lisa ostukorvi
  • Lisa soovinimekirja
  • See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.
  • Formaat: 334 pages
  • Ilmumisaeg: 29-Jul-2022
  • Kirjastus: Packt Publishing Limited
  • Keel: eng
  • ISBN-13: 9781801810715
Teised raamatud teemal:

DRM piirangud

  • Kopeerimine (copy/paste):

    ei ole lubatud

  • Printimine:

    ei ole lubatud

  • Kasutamine:

    Digitaalõiguste kaitse (DRM)
    Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale  Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

    Vajalik tarkvara
    Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

    PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

    Seda e-raamatut ei saa lugeda Amazon Kindle's. 

Explore how Delta brings reliability, performance, and governance to your data lake and all the AI and BI use cases built on top of it

Key Features

Learn Deltas core concepts and features as well as what makes it a perfect match for data engineering and analysis Solve business challenges of different industry verticals using a scenario-based approach Make optimal choices by understanding the various tradeoffs provided by Delta

Book DescriptionDelta helps you generate reliable insights at scale and simplifies architecture around data pipelines, allowing you to focus primarily on refining the use cases being worked on. This is especially important when you consider that existing architecture is frequently reused for new use cases. In this book, youll learn about the principles of distributed computing, data modeling techniques, and big data design patterns and templates that help solve end-to-end data flow problems for common scenarios and are reusable across use cases and industry verticals. Youll also learn how to recover from errors and the best practices around handling structured, semi-structured, and unstructured data using Delta. After that, youll get to grips with features such as ACID transactions on big data, disciplined schema evolution, time travel to help rewind a dataset to a different time or version, and unified batch and streaming capabilities that will help you build agile and robust data products. By the end of this Delta book, youll be able to use Delta as the foundational block for creating analytics-ready data that fuels all AI/BI use cases.What you will learn

Explore the key challenges of traditional data lakes Appreciate the unique features of Delta that come out of the box Address reliability, performance, and governance concerns using Delta Analyze the open data format for an extensible and pluggable architecture Handle multiple use cases to support BI, AI, streaming, and data discovery Discover how common data and machine learning design patterns are executed on Delta Build and deploy data and machine learning pipelines at scale using Delta

Who this book is forData engineers, data scientists, ML practitioners, BI analysts, or anyone in the data domain working with big data will be able to put their knowledge to work with this practical guide to executing pipelines and supporting diverse use cases using the Delta protocol. Basic knowledge of SQL, Python programming, and Spark is required to get the most out of this book.
Table of Contents

An Introduction to Data Engineering
Data Modeling and ETL
Delta
Unifying Batch and Streaming with Delta
Data Consolidation in Delta Lake
Solving Common Data Pattern Scenarios with Delta
Delta for Data Warehouse Use Cases
Handling Atypical Data Scenarios with Delta
Delta for Reproducible Machine Learning Pipelines
Delta for Data Products and Services
Operationalizing Data and ML Pipelines
Optimizing Cost and Performance with Delta
Managing Your Data Journey
Anindita Mahapatra is a lead solutions architect at Databricks in the data and AI space helping clients across all industry verticals reap value from their data infrastructure investments. She teaches a data engineering and analytics course at Harvard University as part of their extension school program. She has extensive big data and Hadoop consulting experience from Think Big/Teradata, prior to which she was managing the development of algorithmic app discovery and promotion for both Nokia and Microsoft stores. She holds a master's degree in liberal arts and management from Harvard Extension School, a master's in computer science from Boston University, and a bachelor's in computer science from BITS Pilani, India.