10% Discount with Use Code SAVEON10
  • Cart
  • Contact us
  • FAQ
logo8_16_194741 bk pdf
Login / Register
Wishlist
0 Compare
7 items $99.70
Menu
logo8_16_194741 bk pdf
7 items $99.70
  • Home
  • Shop
  • My account
  • Blog
  • About us
  • Contact us
  • Request an eBook
Learning Spark: Lightning-Fast Data Analytics 2nd Edition Jules S. Damji, ISBN-13: 978-1492050049
Home Computing Learning Spark: Lightning-Fast Data Analytics 2nd Edition Jules S. Damji, ISBN-13: 978-1492050049
LTE Optimization Engineering Handbook Xincheng Zhang, ISBN-13: 978-1119158974
LTE Optimization Engineering Handbook Xincheng Zhang, ISBN-13: 978-1119158974 $50.00 Original price was: $50.00.$12.08Current price is: $12.08.
Back to products
Java Foundations: Introduction to Program Design and Data Structures 5th Edition John Lewis, ISBN-13: 978-0135205976
Java Foundations: Introduction to Program Design and Data Structures 5th Edition John Lewis, ISBN-13: 978-0135205976 $50.00 Original price was: $50.00.$11.34Current price is: $11.34.

Learning Spark: Lightning-Fast Data Analytics 2nd Edition Jules S. Damji, ISBN-13: 978-1492050049

Rated 4.67 out of 5 based on 3 customer ratings
(3 customer reviews)

$50.00 Original price was: $50.00.$17.48Current price is: $17.48.

Compare
Add to wishlist
Category: Computing Tags: Brooke Wenig, Denny Lee, ISBN-13: 978-1492050049, Jules S. Damji, Learning Spark Lightning-Fast Data Analytics 2nd Edition by Jules S. Damji, Tathagata Das
Share:
  • Description
  • Reviews (3)
  • Shipping & Delivery
Description

Learning Spark: Lightning-Fast Data Analytics 2nd Edition by Jules S. Damji, ISBN-13: 978-1492050049

[PDF eBook eTextbook]

  • Publisher: ‎ O’Reilly Media; 2nd edition (August 11, 2020)
  • Language: ‎ English
  • 400 pages
  • ISBN-10: ‎ 1492050040
  • ISBN-13: ‎ 978-1492050049

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark.

Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, you’ll be able to:

  • Learn Python, SQL, Scala, or Java high-level Structured APIs
  • Understand Spark operations and SQL Engine
  • Inspect, tune, and debug Spark operations with Spark configurations and Spark UI
  • Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka
  • Perform analytics on batch and streaming data using Structured Streaming
  • Build reliable data pipelines with open source Delta Lake and Spark
  • Develop machine learning pipelines with MLlib and productionize models using MLflow

Who This Book Is For

Most developers who grapple with big data are data engineers, data scientists, or machine learning engineers. This book is aimed at those professionals who are looking to use Spark to scale their applications to handle massive amounts of data.

In particular, data engineers will learn how to use Spark’s Structured APIs to perform complex data exploration and analysis on both batch and streaming data; use Spark SQL for interactive queries; use Spark’s built-in and external data sources to read, refine, and write data in different file formats as part of their extract, transform, and load (ETL) tasks; and build reliable data lakes with Spark and the open source Delta Lake table format.

For data scientists and machine learning engineers, Spark’s MLlib library offers many common algorithms to build distributed machine learning models. We will cover how to build pipelines with MLlib, best practices for distributed machine learning, how to use Spark to scale single-node models, and how to manage and deploy these models using the open source library MLflow.

While the book is focused on learning Spark as an analytical engine for diverse workloads, we will not cover all of the languages that Spark supports. Most of the examples in the chapters are written in Scala, Python, and SQL. Where necessary, we have infused a bit of Java. For those interested in learning Spark with R, we recommend Javier Luraschi, Kevin Kuo, and Edgar Ruiz’s Mastering Spark with R (O’Reilly).

Finally, because Spark is a distributed engine, building an understanding of Spark application concepts is critical. We will guide you through how your Spark application interacts with Spark’s distributed components and how execution is decomposed into parallel tasks on a cluster. We will also cover which deployment modes are supported and in what environments.

While there are many topics we have chosen to cover, there are a few that we have opted to not focus on. These include the older low-level Resilient Distributed Dataset (RDD) APIs and GraphX, Spark’s API for graphs and graph-parallel computation. Nor have we covered advanced topics such as how to extend Spark’s Catalyst optimizer to implement your own operations, how to implement your own catalog, or how to write your own DataSource V2 data sinks and sources. Though part of Spark, these are beyond the scope of your first book on learning Spark.

Jules S. Damji is a senior developer advocate at Databricks and an MLflow contributor. He is a hands-on developer with over 20 years of experience and has worked as a software engineer at leading companies such as Sun Microsystems, Netscape, @Home, Loudcloud/Opsware, Verisign, ProQuest, and Hortonworks, building large scale distributed systems. He holds a B.Sc. and an M.Sc. in computer science and an MA in political advocacy and communication from Oregon State University, Cal State, and Johns Hopkins University, respectively.

Brooke Wenig is a machine learning practice lead at Databricks. She leads a team of data scientists who develop large-scale machine learning pipelines for customers, as well as teaching courses on distributed machine learning best practices. Previously, she was a principal data science consultant at Databricks. She holds an M.S. in computer science from UCLA with a focus on distributed machine learning.

Tathagata Das is a staff software engineer at Databricks, an Apache Spark committer, and a member of the Apache Spark Project Management Committee (PMC). He is one of the original developers of Apache Spark, the lead developer of Spark Streaming (DStreams), and is currently one of the core developers of Structured Streaming and Delta Lake. Tathagata holds an M.S. in computer science from UC Berkeley.

Denny Lee is a staff developer advocate at Databricks who has been working with Apache Spark since 0.6. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premises and cloud environments. He also has an M.S. in biomedical informatics from Oregon Health and Sciences University and has architected and implemented powerful data solutions for enterprise healthcare customers.

What makes us different?

• Instant Download

• Always Competitive Pricing

• 100% Privacy

• FREE Sample Available

• 24-7 LIVE Customer Support

Reviews (3)

3 reviews for Learning Spark: Lightning-Fast Data Analytics 2nd Edition Jules S. Damji, ISBN-13: 978-1492050049

  1. David Foster (verified owner) – September 9, 2023

    Rated 5 out of 5

    Downloaded my eBook instantly—superb!

  2. David Foster (verified owner) – March 1, 2024

    Rated 5 out of 5

    Support team was great, eBook delivered fast.

  3. Isabella Hayes (verified owner) – March 16, 2024

    Rated 4 out of 5

    Super fast, received my eBook immediately!

Add a review Cancel reply

You must be logged in to post a review.

Shipping & Delivery

You will receive the link of your eBook 30 seconds after purchase on your email (check you email or junk mail), and you can login to your account at anytime using your username to read or download your eBook.

If you have any problem or any other questions, you can email us or try the chat widget.

Visit contact us.

Related products

-64%
Security in Fixed and Wireless Networks 2nd Edition, ISBN-13: 978-1119040743
Compare

Security in Fixed and Wireless Networks 2nd Edition, ISBN-13: 978-1119040743

Computing
$50.00 Original price was: $50.00.$17.88Current price is: $17.88.
Rated 4.33 out of 5
Security in Fixed and Wireless Networks 2nd Edition, ISBN-13: 978-1119040743 [PDF eBook eTextbook]    624 pages ISBN-10: 1119040744 ISBN-13: 978-1119040743
Add to wishlist
Add to cart
Quick view
-61%
Programming Multicore and Many-core Computing Systems, ISBN-13: 978-0470936900
Compare

Programming Multicore and Many-core Computing Systems, ISBN-13: 978-0470936900

Computing
$50.00 Original price was: $50.00.$19.50Current price is: $19.50.
Rated 5.00 out of 5
Programming Multicore and Many-core Computing Systems, ISBN-13: 978-0470936900 [PDF eBook eTextbook] Series: Wiley Series on Parallel and Distributed Computing (Book
Add to wishlist
Add to cart
Quick view
-83%
The Singularity Is Near: When Humans Transcend Biology Ray Kurzweil, ISBN-13: 978-0670033843
Compare

The Singularity Is Near: When Humans Transcend Biology Ray Kurzweil, ISBN-13: 978-0670033843

Computing
$50.00 Original price was: $50.00.$8.74Current price is: $8.74.
Rated 5.00 out of 5
The Singularity Is Near: When Humans Transcend Biology by Ray Kurzweil, ISBN-13: 978-0670033843 [PDF eBook eTextbook] Publisher: ‎ The Viking
Add to wishlist
Add to cart
Quick view
-75%
Starting Out with Python 4th Edition, ISBN-13: 978-0134444321
Compare

Starting Out with Python 4th Edition, ISBN-13: 978-0134444321

Computing
$50.00 Original price was: $50.00.$12.43Current price is: $12.43.
Starting Out with Python 4th Edition, ISBN-13: 978-0134444321 [PDF eBook eTextbook]   Publisher: Pearson; 4th edition (March 6, 2017) Language:
Add to wishlist
Add to cart
Quick view
-64%
Python Crash Course 2nd Edition by Eric Matthes, ISBN-13: 978-1593279288
Compare

Python Crash Course 2nd Edition by Eric Matthes, ISBN-13: 978-1593279288

Computing
$50.00 Original price was: $50.00.$17.99Current price is: $17.99.
Rated 4.33 out of 5
Python Crash Course: A Hands-On, Project-Based Introduction to Programming by Eric Matthes, ISBN-13: 978-1593279288 [PDF eBook eTextbook] Publisher: ‎ NO
Add to wishlist
Add to cart
Quick view
-74%
Structure and Interpretation of Computer Programs 2nd Edition Harold Abelson, ISBN-13: 978-0262510875
Compare

Structure and Interpretation of Computer Programs 2nd Edition Harold Abelson, ISBN-13: 978-0262510875

Computing
$50.00 Original price was: $50.00.$12.84Current price is: $12.84.
Rated 4.33 out of 5
Structure and Interpretation of Computer Programs 2nd Edition by Harold Abelson, ISBN-13: 978-0262510875 [PDF eBook eTextbook] Publisher: ‎ The MIT
Add to wishlist
Add to cart
Quick view
-70%
Programming Logic and Design, Comprehensive by Joyce Farrell, ISBN-13: 978-1337102070
Compare

Programming Logic and Design, Comprehensive by Joyce Farrell, ISBN-13: 978-1337102070

Computing
$50.00 Original price was: $50.00.$14.99Current price is: $14.99.
Rated 4.33 out of 5
Programming Logic and Design, Comprehensive by Joyce Farrell, ISBN-13: 978-1337102070 [PDF eBook eTextbook] Publisher: ‎ Cengage Learning; 9th edition (January
Add to wishlist
Add to cart
Quick view
-80%
Python 3 for Machine Learning by Oswald Campesato, ISBN-13: 978-1683924951
Compare

Python 3 for Machine Learning by Oswald Campesato, ISBN-13: 978-1683924951

Computing
$50.00 Original price was: $50.00.$9.99Current price is: $9.99.
Rated 4.33 out of 5
Python 3 for Machine Learning by Oswald Campesato, ISBN-13: 978-1683924951  [PDF eBook eTextbook] Publisher: ‎ Mercury Learning and Information (March
Add to wishlist
Add to cart
Quick view

Free Shipping.

Via Email.

24/7 Support.

Contact Or Chat With Us.

Online Payment.

One Time Payement.

Fast Delivery.

30 Seconds After Purchase.

  • OUR COMPANY
    • BKPDF LLC
    • Email: [email protected]
    • Website: bkpdf.com
  • USEFUL LINKS
    • Home
    • Shop
    • Wishlist
    • Blog
  • OUR POLICY
    • Privacy Policy
    • Refund Policy
    • Terms & Conditions
    • DMCA
  • INFORMATIONS
    • About Us
    • FAQ
    • Contact Us
    • Request an eBook

Payment System:

BKPDF 2023 CREATED BY BKPDF LLC. PREMIUM E-COMMERCE SOLUTIONS.
  • Home
  • Shop
  • Blog
  • About us
  • Contact us
  • Request an eBook
  • Wishlist
  • Compare
  • Login / Register
Shopping cart
Close
Sign in
Close

Lost your password?

No account yet?

Create an Account
Shop
Wishlist
7 items Cart
My account