Mastering Large Datasets: Parallelize and Distribute Your Python Code

Mastering Large Datasets: Parallelize and Distribute Your Python Code

作者: Wolohan J. T.
出版社: Manning
出版在: 2020-01-21
ISBN-13: 9781617296239
ISBN-10: 1617296236
裝訂格式: Quality Paper - also called trade paper
總頁數: 296 頁





內容描述


With an emphasis on clarity, style, and performance, author J.T. Wolohan expertly guides you through implementing a functionally-influenced approach to Python coding. You'll get familiar with Python's functional built-ins like the functools operator and itertools modules, as well as the toolz library. Mastering Large Datasets teaches you to write easily readable, easily scalable Python code that can efficiently process large volumes of structured and unstructured data. By the end of this comprehensive guide, you'll have a solid grasp on the tools and methods that will take your code beyond the laptop and your data science career to the next level Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.


作者介紹


J.T. Wolohan is a lead data scientist at Booz Allen Hamilton and a PhD researcher at Indiana University, Bloomington, affiliated with the Department of Information and Library Science and the School of Informatics and Computing. His professional work focuses on rapid prototyping and scalable AI. His research focuses on computational analysis of social uses of language online.




相關書籍

自然語言處理 Python 進階

作者 [印度]克裡希納·巴夫薩(Krishna Bhavsar)  納雷什·庫馬爾(Naresh Kumar)普拉塔普·丹蒂(Pratap Dangeti)

2020-01-21

Google Bigquery: The Definitive Guide: Data Warehousing, Analytics, and Machine Learning at Scale

作者 Lakshmanan Valliappa Tigani Jordan

2020-01-21

不懂程式也能學會的大數據分析術 - 使用 RapidMiner

作者 黃柏崴 李童宇

2020-01-21







2
2
2