E-raamat: PySpark SQL Recipes: With HiveQL, Dataframe and Graphframes

5.00/5 (2 hinnangut Goodreads-ist)

Sundar Rajan Raman, Raju Kumar Mishra

Formaat: PDF+DRM
Sari: Professional and Applied Computing
Ilmumisaeg: 18-Mar-2019
Kirjastus: APress
Keel: eng
ISBN-13: 9781484243350

Teised raamatud teemal:

Formaat - PDF+DRM
Hind: 55,56 €*
* hind on lõplik, st. muud allahindlused enam ei rakendu
Lisa ostukorvi
Lisa soovinimekirja
See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.

Formaat: PDF+DRM
Sari: Professional and Applied Computing
Ilmumisaeg: 18-Mar-2019
Kirjastus: APress
Keel: eng
ISBN-13: 9781484243350

Teised raamatud teemal:

DRM piirangud

Kopeerimine (copy/paste):

ei ole lubatud
Printimine:

ei ole lubatud
Kasutamine:

Digitaalõiguste kaitse (DRM)
Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

Vajalik tarkvara
Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

Seda e-raamatut ei saa lugeda Amazon Kindle's.

Carry out data analysis with PySpark SQL, graphframes, and graph data processing using a problem-solution approach. This book provides solutions to problems related to dataframes, data manipulation summarization, and exploratory analysis. You will improve your skills in graph data analysis using graphframes and see how to optimize your PySpark SQL code.

PySpark SQL Recipes starts with recipes on creating dataframes from different types of data source, data aggregation and summarization, and exploratory data analysis using PySpark SQL. You’ll also discover how to solve problems in graph analysis using graphframes.

On completing this book, you’ll have ready-made code for all your PySpark SQL tasks, including creating dataframes using data from different file formats as well as from SQL or NoSQL databases.

What You Will Learn

Understand PySpark SQL and its advanced features
Use SQL and HiveQL with PySpark SQL
Work with structured streaming
Optimize PySpark SQL
Master graphframes and graph processing

Who This Book Is For

Data scientists, Python programmers, and SQL programmers.

Chapter 1: Introduction to PySparkSQL.
Chapter 2: Some time with
Installation.
Chapter 3: IO in PySparkSQL.
Chapter 4 : Operations on
PySparkSQL DataFrames.
Chapter 5 : Data Merging and Data Aggregation using
PySparkSQL.
Chapter 6: SQL, NoSQL and PySparkSQL.
Chapter 7: Structured
Streaming.
Chapter 8 : Optimizing PySparkSQL.
Chapter 9 : GraphFrames.

Raju Kumar Mishra has strong interests in data science and systems that have the capability of handling large amounts of data and operating complex mathematical models through computational programming. He was inspired to pursue an M. Tech in computational sciences from Indian Institute of Science in Bangalore, India. Raju primarily works in the areas of data science and its different applications. Working as a corporate trainer he has developed unique insights that help him in teaching and explaining complex ideas with ease. Raju is also a data science consultant solving complex industrial problems. He works on programming tools such as R, Python, scikit-learn, Statsmodels, Hadoop, Hive, Pig, Spark, and many others. His venture Walsoul Private Ltd provides training in data science, programming, and big data. Sundar Rajan Raman is an artificial intelligence practitioner currently working at Bank of America. He holds a Bachelor of Technology degree from the National Institute of Technology, India. Being a seasoned Java and J2EE programmer he has worked on critical applications for companies such as AT&T, Singtel, and Deutsche Bank. He is also a seasoned big data architect. His current focus is on artificial intelligence space including machine learning and deep learning.

Lisainfo e-raamatute kohta

Püsilink: https://www.kriso.ee/db/97814842433502e.html

Märksõnad:

E-raamat: PySpark SQL Recipes: With HiveQL, Dataframe and Graphframes

DRM piirangud

Kopeerimine (copy/paste):

Printimine:

Kasutamine:

Konto & seaded

Otsing

Otsingu andmebaas

Filtreeri tulemusi

Teemad E-raamatute teemad

Vali ostukorv