Learning Apache Drill: Query and Analyze Structured Data

Learning Apache Drill: Query and Analyze Structured Data

作者: Charles Givre Paul Rogers
出版社: O'Reilly
出版在: 2018-11-19
ISBN-13: 9781492032793
ISBN-10: 1492032794
裝訂格式: Paperback
總頁數: 332 頁





內容描述


Apache Drill enables interactive analysis of massively large datasets, allowing you to execute SQL queries against data in many different data sources—including Hadoop and MongoDB clusters, HBase, or even your local file system—and get results quickly. With this practical guide, analysts and data scientists focused on business or research applications will learn how to incorporate Drill capabilities into complex programs, including how to use Drill queries to replace some MapReduce operations in a large-scale program.
Drill committers Charles Givre and Paul Rogers provide an introduction to Drill and its ability to handle large files containing data in flexible formats with nested data structures and tables. You’ll discover how this capability fills a gap in the Hadoop ecosystem.
Additional topics show you how to:

Prepare and organize data to maximize Drill performance
Set expectations for Drill performance on different data types and volumes
Reconcile Drill’s schema-free features with schema-full JDBC and ODBC clients




相關書籍

分佈式服務架構:原理、設計與實戰

作者 李艷鵬 楊彪

2018-11-19

從零開始學 Flutter 開發

作者 譚東

2018-11-19

邊緣計算與算力網絡 — 5G + AI 時代的新型算力平臺與網絡連接

作者 雷波 等

2018-11-19