Saturday , November 23 2024

Using HBase for Real-time Access to your Big Data

Course Name :-  Using HBase for Real-time Access to your Big Data

Module 1 : Introduction HBase

Question 1 : What are some of the key properties of HBase? Select all that apply.

  • All HBase data is stored as bytes
  • HBase can run up to 1000 queries per second at the most
  • HBase is ACID compliant across all rows and tables
  • HBase is a NoSQL technology
  • HBase is an open source Apache project

Question 2 : Which HBase component is responsible for storing the rows of a table?

  • ZooKeeper
  • Master
  • Region
  • HDFS
  • API

Question 3: What is NOT a characteristic of an HBase table?

  • Columns are grouped into column families
  • NULL columns aren’t supported
  • Columns can have multiple timestamps
  • Columns can be added on the fly
  • Each row must have a unique row key

Module 2:- HBase client API – The basics

Question 1 : Which HBase command is used to update existing data in a table?

  • Scan
  • Batch
  • Delete
  • Put
  • Get

Question 2 : The batch command allows the user to determine the order of execution. True or false?

  • True
  • False

Question 3 : Which of the following statements are true of the scan operation? Select all that apply.

  • Scanner caching is enabled by default
  • The startRow and endRow parameters are both inclusive
  • The addColumn() method can be used to restrict a scan
  • Scanning is a resource-intensive operation
  • Scan operations are used to iterate over HBase tables

Module 3 :- Client API : Administrative Advance feature

Question 1 : Which statement about HBase tables is incorrect?

  • Performance may suffer if a table has more than three column families
  • A table requires two descriptor classes
  • Each table must contain at least one column family
  • Everything in HBase is stored within tables
  • HColumnDescriptor is used to describe columns, not column families

Question 2 :- When using a CompareFilter, you must specify what to include as part of the scan, rather than what to exclude. True or false?

  • True
  • False

Question 3 :- What is an example of a Dedicated Filter? Select all that apply.

  • SingleColumnValueFilter
  • QualifierFilter
  • ColumnPrefixFilter
  • TimestampsFilter
  • FamilyFilter

Module 4 :- Available HBase Client

Question 1 : Which statements accurately describe the HBase interactive clients? Select all that apply.

  • Thrift is included with Hbase
  • Thrift and Avro both support C++
  • With REST, data transport is always performed in binary
  • Avro has a dynamic schema
  • REST needs to be complied before it can run

Question 2 :- Unlike an interactive client, a batch client is used to run a large set of operations in the background. True or false?

  • True
  • False

Question 3 :- Which of the following is an example of a batch client?

  • AsyncHBase
  • Pig
  • HBql
  • JRuby
  • PyHBase

Module 5 : HBase and Mapreduce Integration

Question 1 :- HBase can act both as a source and a sink of a MapReduce job.

  • False
  • True

Question 2 :- Which HBase class is responsible for splitting the source data?

  • TableOutputFormat
  • TableInputFormat
  • TableMapReduceUtil
  • TableReducer
  • TableMapper

Question 3 :- Which of the following is NOT a component of the MapReduce framework?

  • Reducer
  • Mapper
  • OutputFormat
  • InputFormat
  • All of the above are part of the MapReduce framework

Module 6 : HBase Configuration and Administration

Question 1 :- Which of the following statements accurately describe the HBase run modes? Select all that apply.

  • The standalone mode is suited for a production environment
  • The pseudo-distributed mode is used for performance evaluation
  • The standalone mode uses local file systems
  • The distributed mode is suited for a production environment
  • The distributed mode requires the HDFS

Question 2 :- Which is NOT a component of a region server?

  • StoreFile
  • MemStore
  • HFile
  • ZooKeeper
  • HLog

Question 3 :- What is an example of an operational task? Select all that apply.

  • BulkImport
  • CopyTable
  • Adding Servers
  • Node decommissioning
  • Import and export

Using HBase for Real-time Access to your Big Data Final Exam answers :-

Question 1 : – Which statements accurately describe column families in HBase? Select all that apply.

  • You aren’t required to specify any column families when declaring a table
  • Each region contains multiple column families
  • You typically want no more than two or three column families per table
  • Column families have their own compression methods
  • Column families can be defined dynamically after table creation

Question 2 :- Which of the following is NOT a component of HBase?

  • Master
  • ZooKeeper
  • Region Server
  • Region
  • Pig

Question 3 :- Which programming language is supported by Thrift?

  • PHP
  • Perl
  • Python
  • C#
  • All of the above

Question 4 :- Which HBase command is used to retrieve data from a table?

  • Scan
  • Get
  • Put
  • Batch
  • Delete

Question 5 :- The HBase Shell and the native Java API are the only available tools for interacting with HBase. True or false?

  • True
  • False

Question 6 :- Without this filter, a scan will need to check every file to see if a piece of data exists.

  • SkipFilter
  • TimeStampsFilter
  • BloomFilter
  • PageFilter
  • WhileMatchFilter

Question 7 :- What are the characteristics of the Avro client? Select all that apply.

  • Avro is included with HBase
  • Data transport is performed in binary
  • Avro needs to be compiled before running
  • Avro is a batch client
  • Avro supports Python and PHP, among others

Question 8 :- Deleting an internal table in Hive automatically deletes the corresponding HBase table. True or false?

  • True
  • False

Question 9 : -What is the main purpose of an HBase Counter?

  • To count the number of column families
  • To increment column values for statistical data collection
  • To count the number of region servers
  • To count the number of regions
  • All of the above

Question 10 :- Which file is used to specify configurations for HBase, HDFS, and ZooKeeper?

  • hbase-env.sh
  • RegionServer
  • log4j.properties
  • hbase-default.xml
  • hbase-site.xml

Question 11 :- Which HBase component manages the race to add a backup master?

  • Region Server
  • HDFS
  • ZooKeeper
  • Region
  • Primary master

Question 12 :- Which component of a region server is the actual storage file of the data?

  • HRegion
  • Store
  • StoreFile
  • HLog
  • HFile

Question 13 :- When the master node is updated, which file can be used to automatically update the other nodes in the cluster?

  • hbase-env.sh
  • hbase-default.xml
  • synchbase.sh
  • hbase-site.xml
  • syncconf.sh

Question 14 :- There is a single HLog for each region server. True or false?

  • True
  • False

Question 15 :- What is the main purpose of the Write-Ahead log?

  • To flush data when the system reaches its capacity
  • To store HBase configuration details
  • To prevent data loss in the event of a system crash
  • To store performance details
  • To store HDFS configuration detailsv

About Machine Learning

Check Also

Python for Data Science Cognitive Class Exam Answers:-

Course Name:- Python for Data Science Module 1. Python Basics Question 1. What is the …

Leave a Reply

Your email address will not be published. Required fields are marked *