Machine Learning with PySpark: With Natural Language Processing and Recommender Systems

Machine Learning with PySpark: With Natural Language Processing and Recommender Systems

作者: Pramod Singh
出版社: Apress
出版在: 2018-12-15
ISBN-13: 9781484241301
ISBN-10: 1484241304
裝訂格式: Paperback
總頁數: 223 頁




內容描述


Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and recommender systems using PySpark. 
 
Machine Learning with PySpark shows you how to build supervised machine learning models such as linear regression, logistic regression, decision trees, and random forest. You’ll also see unsupervised machine learning models such as K-means and hierarchical clustering. A major portion of the book focuses on feature engineering to create useful features with PySpark to train the machine learning models. The natural language processing section covers text processing, text mining, and embedding for classification. 
 
After reading this book, you will understand how to use PySpark’s machine learning library to build and train various machine learning models. Additionally you’ll become comfortable with related PySpark components, such as data ingestion, data processing, and data analysis, that you can use to develop data-driven intelligent applications.

What You Will Learn

Build a spectrum of supervised and unsupervised machine learning algorithms
Implement machine learning algorithms with Spark MLlib libraries
Develop a recommender system with Spark MLlib libraries
Handle issues related to feature engineering, class balance, bias and variance, and cross validation for building an optimal fit model

 
Who This Book Is For 
 
Data science and machine learning professionals.




相關書籍

深度學習基礎教程

作者 阿努拉格·巴德瓦杰 楊偉 李征等

2018-12-15

Big Data Analysis with Python

作者 Ivan Marin Ankit Shukla Sarang VK

2018-12-15

機器學習概論:機器學習發展+演算法原理實務

作者 鄭捷

2018-12-15