Practical Apache Spark: Using the Scala API

Practical Apache Spark: Using the Scala API

作者: Subhashini Chellappan Dharanitharan Ganesan
出版社: Apress
出版在: 2018-12-13
ISBN-13: 9781484236512
ISBN-10: 1484236513
裝訂格式: Paperback
總頁數: 280 頁




內容描述


Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You’ll follow a learn-to-do-by-yourself approach to learning – learn the concepts, practice the code snippets in Scala, and complete the assignments given to get an overall exposure. 

On completion, you’ll have knowledge of the functional programming aspects of Scala, and hands-on expertise in various Spark components. You’ll also become familiar with machine learning algorithms with real-time usage.
 
What You Will Learn

Discover the functional programming features of Scala
Understand the complete architecture of Spark and its components
Integrate Apache Spark with Hive and Kafka 
Use Spark SQL, DataFrames, and Datasets to process data using traditional SQL queries
Work with different machine learning concepts and libraries using Spark's MLlib packages

 
Who This Book Is For
 
Developers and professionals who deal with batch and stream data processing.




相關書籍

邊緣計算與算力網絡 — 5G + AI 時代的新型算力平臺與網絡連接

作者 雷波 等

2018-12-13

Test-Driven Development with React: Apply Test-Driven Development in Your Applications

作者 Qiu Juntao

2018-12-13

Spark 大數據商業實戰三部曲:內核解密|商業案例|性能調優, 2/e

作者 王家林 段智華

2018-12-13