Thoughtful Data Science: Working with data by creating visually intuitive insights with Jupyter and Pixiedust

Thoughtful Data Science: Working with data by creating visually intuitive insights with Jupyter and Pixiedust

作者: David Taieb
出版社: Packt Publishing
出版在: 2018-07-30
ISBN-13: 9781788839969
ISBN-10: 178883996X
裝訂格式: Paperback
總頁數: 490 頁





內容描述


Approaching the practice of data science by scripting your own data pipeline and dashboardsKey FeaturesDavid teaches how to build a new data pipeline using PixiedustHow to get the most out of Jupyter notebooksThink about the data and their visualisations, before worrying about the algorithmsBook DescriptionData science has become the one scientific endeavor every business has to contend with today. We also need to learn why data algorithms work, but even more importantly, we need to be able to create new insights from our data that we can actually work with. The why is addressed in many publications today, but it is not easy to create insights such that the data scientist does not look like a mountebank creating opaque notebook code before getting to the visually compelling bits of data science: the data science process itself has to be transparent, easy to understand, and it has to be straightforward to optimise.David Taieb created Pixiedust in Python to be able to teach non-data scientists to use Jupyter notebooks, without having to slog through the considerable amount of Jupyter code required to be able to create simple and sometimes not-so-simple insights into data. It is possible to use Pixiedust by just writing a few lines in HTML and CSS, while retaining the ability to drop or remove algorithms and visualisation options, adjust the data pipeline to the requirements posed by the data or just get some very quick results. The case studies represent a carefully graded ladder of progress, ranging all the way from data mined from social media to geo-analytical data helpful in business decision making.It is, however, possible to use both Python and Scala to add features to the Pixiedust data pipeline, and ultimately, to bring the power of the Spark big data framework to the data scientist.What you will learnHow to write basic Pixiedust dashboardsBuilding your own data pipelines without writing connecting pipeline codeLearn how to use Jupyter notebooks without the painCreate compelling data visualisations in PixiedustWrite applications running on Spark, without writing Spark codeWho This Book Is ForTo produce a functioning Pixiedust dashboard, only a modicum of HMTL and CSS is required. Fluency in data interpretation and visualization is also a necessary, since this book is addressed to data professionals, e.g. business and general data analysts. The later chapters also much to offer to the budding data scientist, and to developers on a path to becoming data scientists, since they get to play with Python code running in Jupyter notebooks.




相關書籍

OpenCV 4 with Python Blueprints, Second Edition

作者 Gevorgyan Menua Mamikonyan Arsen Beyeler Michael

2018-07-30

人腦電腦黃金交叉:人工智慧終將一統世界

作者 集智俱樂部

2018-07-30

R 語言數據分析

作者 哥格利·達羅克茲(Gergely Daroczi)

2018-07-30