Data Science on the Google Cloud Platform: Implementing End-to-End Real-Time Data Pipelines: From Ingest to Machine Learning

Data Science on the Google Cloud Platform: Implementing End-to-End Real-Time Data Pipelines: From Ingest to Machine Learning

作者: Valliappa Lakshmanan
出版社: O'Reilly
出版在: 2018-01-23
ISBN-13: 9781491974568
ISBN-10: 1491974567
裝訂格式: Paperback
總頁數: 410 頁




內容描述


Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). This hands-on guide shows developers entering the data science field how to implement an end-to-end data pipeline, using statistical and machine learning methods and tools on GCP. Through the course of the book, you’ll work through a sample business decision by employing a variety of data science approaches.
Follow along by implementing these statistical and machine learning solutions in your own project on GCP, and discover how this platform provides a transformative and more collaborative way of doing data science.
You’ll learn how to:

Automate and schedule data ingest, using an App Engine application
Create and populate a dashboard in Google Data Studio
Build a real-time analysis pipeline to carry out streaming analytics
Conduct interactive data exploration with Google BigQuery
Create a Bayesian model on a Cloud Dataproc cluster
Build a logistic regression machine-learning model with Spark
Compute time-aggregate features with a Cloud Dataflow pipeline
Create a high-performing prediction model with TensorFlow
Use your deployed model as a microservice you can access from both batch and real-time pipelines




相關書籍

深度學習實踐 : 基於 Caffe 的解析

作者 薛雲峰

2018-01-23

TensorFlow 與自然語言處理應用

作者 李孟全

2018-01-23

超極制霸 -- Data Analysis Using Excel 輕鬆速成祕典

作者 Data Analyst輯委員會

2018-01-23