Data Analysis with Python and Pyspark

Data Analysis with Python and Pyspark

作者: Rioux Jonathan
出版社: Manning
出版在: 2022-03-22
ISBN-13: 9781617297205
ISBN-10: 1617297208
裝訂格式: Quality Paper - also called trade paper
總頁數: 456 頁





內容描述


Data Analysis with Python and PySpark is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale. When it comes to data analytics, it pays to think big. PySpark blends the powerful Spark big data processing engine with the Python programming language to provide a data analysis platform that can scale up for nearly any task. Data Analysis with Python and PySpark is your guide to delivering successful Python-driven data projects. Data Analysis with Python and PySpark is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale. This clear and hands-on guide shows you how to enlarge your processing capabilities across multiple machines with data from any source, ranging from Hadoop-based clusters to Excel worksheets. You'll learn how to break down big analysis tasks into manageable chunks and how to choose and use the best PySpark data abstraction for your unique needs. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.


作者介紹


As a data scientist for an engineering consultancy Jonathan Rioux uses PySpark daily. He teaches the software to data scientists, engineers, and data-savvy business analysts.




相關書籍

SPSS統計分析從入門到精通(第二版)

作者 杜琳琳 時立文 薛曉光

2022-03-22

大師帶你立即上手:機器學習+人工智慧一點也不難

作者 唐宇迪

2022-03-22

Python Flask Web 開發入門與項目實戰

作者 錢游

2022-03-22