HBase Administration Cookbook

HBase Administration Cookbook

作者: Yifeng Jiang
出版社: Packt Publishing
出版在: 2012-08-17
ISBN-13: 9781849517140
ISBN-10: 1849517142
裝訂格式: Paperback
總頁數: 332 頁





內容描述


Master HBase configuration and administration for optimum database performance Move large amounts of data into HBase and learn how to manage it efficiently Set up HBase on the cloud, get it ready for production, and run it smoothly with high performance Maximize the ability of HBase with the Hadoop eco-system including HDFS, MapReduce, Zookeeper, and Hive In Detail As an Open Source distributed big data store, HBase scales to billions of rows, with millions of columns and sits on top of the clusters of commodity machines. If you are looking for a way to store and access a huge amount of data in real-time, then look no further than HBase. HBase Administration Cookbook provides practical examples and simple step-by-step instructions for you to administrate HBase with ease. The recipes cover a wide range of processes for managing a fully distributed, highly available HBase cluster on the cloud. Working with such a huge amount of data means that an organized and manageable process is key and this book will help you to achieve that. The recipes in this practical cookbook start from setting up a fully distributed HBase cluster and moving data into it. You will learn how to use all of the tools for day-to-day administration tasks as well as for efficiently managing and monitoring the cluster to achieve the best performance possible. Understanding the relationship between Hadoop and HBase will allow you to get the best out of HBase so the book will show you how to set up Hadoop clusters, configure Hadoop to cooperate with HBase, and tune its performance. What you will learn from this book Set up a fully distributed, highly available HBase cluster and load data into it using the normal client API or your own MapReduce job Access data in HBase via HBase Shell or Hive using its SQL-like query language Backup and restore HBase table, along with its data distribution, and move or replicate data between different HBase clusters Gather metrics then show them in graphs, monitor the cluster's status, and get notified if thresholds are exceeded Tune your kernel settings with JVM GC, Hadoop, and HBase configuration to maximize the performance Discover troubleshooting tools and tips in order to avoid the most commonly-found problems with HBase Gain optimum performance with data compression, region splits, and by manually managing compaction Learn advanced configuration and tuning for read and write-heavy clusters Approach As part of Packt's cookbook series, each recipe offers a practical, step-by-step solution to common problems found in HBase administration. Who this book is written for This book is for HBase administrators, developers, and will even help Hadoop administrators. You are not required to have HBase experience, but are expected to have a basic understanding of Hadoop and MapReduce.




相關書籍

Pro SQL Server Relational Database Design and Implementation: Best Practices for Scalability and Performance

作者 Davidson Louis

2012-08-17

Joe Celko's Trees and Hierarchies in SQL for Smarties, 2/e (Paperback)

作者 Joe Celko

2012-08-17

Getting Started with SQL: A Hands-On Approach for Beginners (Paperback)

作者 Thomas Nield

2012-08-17