Data Algorithms: Recipes for Scaling Up with Hadoop and Spark (Paperback)

Data Algorithms: Recipes for Scaling Up with Hadoop and Spark (Paperback)

作者: Mahmoud Parsian
出版社: O'Reilly
出版在: 2015-08-01
ISBN-13: 9781491906187
ISBN-10: 1491906189
裝訂格式: Paperback
總頁數: 778 頁





內容描述


Learn the algorithms and tools you need to build MapReduce applications with Hadoop and Spark for processing gigabyte, terabyte, or petabyte-sized datasets on clusters of commodity hardware. With this practical book, author Mahmoud Parsian, head of the big data team at Illumina, takes you step-by-stepthrough the design of machine-learning algorithms, such as Naive Bayes and Markov Chain, and shows you how apply them to clinical and biological datasets, using MapReduce design patterns.Apply MapReduce algorithms to clinical and biological data, such as DNA-Seq and RNA-SeqUse the most relevant regression/analytical algorithms used for different biological data typesApply t-test, joins, top-10, and correlation algorithms using MapReduce/Hadoop and Spark




相關書籍

JavaScript 網頁設計與 TensorFlow.js 人工智慧應用教本

作者 陳會安

2015-08-01

大數據統計理論 (舊版: 專家親授:極深度大數據專用統計理論)

作者 楊旭

2015-08-01

Developing Web Components with Typescript: Native Web Development Using Thin Libraries

作者 Krause Jörg

2015-08-01