Course Name :- Using HBase for Real-time Access to your Big Data
Module 1 : Introduction HBase
Question 1 : What are some of the key properties of HBase? Select all that apply.
- All HBase data is stored as bytes
- HBase can run up to 1000 queries per second at the most
- HBase is ACID compliant across all rows and tables
- HBase is a NoSQL technology
- HBase is an open source Apache project
Question 2 : Which HBase component is responsible for storing the rows of a table?
- ZooKeeper
- Master
- Region
- HDFS
- API
Question 3: What is NOT a characteristic of an HBase table?
- Columns are grouped into column families
- NULL columns aren’t supported
- Columns can have multiple timestamps
- Columns can be added on the fly
- Each row must have a unique row key
Module 2:- HBase client API – The basics
Question 1 : Which HBase command is used to update existing data in a table?
- Scan
- Batch
- Delete
- Put
- Get
Question 2 : The batch command allows the user to determine the order of execution. True or false?
- True
- False
Question 3 : Which of the following statements are true of the scan operation? Select all that apply.
- Scanner caching is enabled by default
- The startRow and endRow parameters are both inclusive
- The addColumn() method can be used to restrict a scan
- Scanning is a resource-intensive operation
- Scan operations are used to iterate over HBase tables
Module 3 :- Client API : Administrative Advance feature
Question 1 : Which statement about HBase tables is incorrect?
- Performance may suffer if a table has more than three column families
- A table requires two descriptor classes
- Each table must contain at least one column family
- Everything in HBase is stored within tables
- HColumnDescriptor is used to describe columns, not column families
Question 2 :- When using a CompareFilter, you must specify what to include as part of the scan, rather than what to exclude. True or false?
- True
- False
Question 3 :- What is an example of a Dedicated Filter? Select all that apply.
- SingleColumnValueFilter
- QualifierFilter
- ColumnPrefixFilter
- TimestampsFilter
- FamilyFilter
Module 4 :- Available HBase Client
Question 1 : Which statements accurately describe the HBase interactive clients? Select all that apply.
- Thrift is included with Hbase
- Thrift and Avro both support C++
- With REST, data transport is always performed in binary
- Avro has a dynamic schema
- REST needs to be complied before it can run
Question 2 :- Unlike an interactive client, a batch client is used to run a large set of operations in the background. True or false?
- True
- False
Question 3 :- Which of the following is an example of a batch client?
- AsyncHBase
- Pig
- HBql
- JRuby
- PyHBase
Module 5 : HBase and Mapreduce Integration
Question 1 :- HBase can act both as a source and a sink of a MapReduce job.
- False
- True
Question 2 :- Which HBase class is responsible for splitting the source data?
- TableOutputFormat
- TableInputFormat
- TableMapReduceUtil
- TableReducer
- TableMapper
Question 3 :- Which of the following is NOT a component of the MapReduce framework?
- Reducer
- Mapper
- OutputFormat
- InputFormat
- All of the above are part of the MapReduce framework
Module 6 : HBase Configuration and Administration
Question 1 :- Which of the following statements accurately describe the HBase run modes? Select all that apply.
- The standalone mode is suited for a production environment
- The pseudo-distributed mode is used for performance evaluation
- The standalone mode uses local file systems
- The distributed mode is suited for a production environment
- The distributed mode requires the HDFS
Question 2 :- Which is NOT a component of a region server?
- StoreFile
- MemStore
- HFile
- ZooKeeper
- HLog
Question 3 :- What is an example of an operational task? Select all that apply.
- BulkImport
- CopyTable
- Adding Servers
- Node decommissioning
- Import and export
Using HBase for Real-time Access to your Big Data Final Exam answers :-
Question 1 : – Which statements accurately describe column families in HBase? Select all that apply.
- You aren’t required to specify any column families when declaring a table
- Each region contains multiple column families
- You typically want no more than two or three column families per table
- Column families have their own compression methods
- Column families can be defined dynamically after table creation
Question 2 :- Which of the following is NOT a component of HBase?
- Master
- ZooKeeper
- Region Server
- Region
- Pig
Question 3 :- Which programming language is supported by Thrift?
- PHP
- Perl
- Python
- C#
- All of the above
Question 4 :- Which HBase command is used to retrieve data from a table?
- Scan
- Get
- Put
- Batch
- Delete
Question 5 :- The HBase Shell and the native Java API are the only available tools for interacting with HBase. True or false?
- True
- False
Question 6 :- Without this filter, a scan will need to check every file to see if a piece of data exists.
- SkipFilter
- TimeStampsFilter
- BloomFilter
- PageFilter
- WhileMatchFilter
Question 7 :- What are the characteristics of the Avro client? Select all that apply.
- Avro is included with HBase
- Data transport is performed in binary
- Avro needs to be compiled before running
- Avro is a batch client
- Avro supports Python and PHP, among others
Question 8 :- Deleting an internal table in Hive automatically deletes the corresponding HBase table. True or false?
- True
- False
Question 9 : -What is the main purpose of an HBase Counter?
- To count the number of column families
- To increment column values for statistical data collection
- To count the number of region servers
- To count the number of regions
- All of the above
Question 10 :- Which file is used to specify configurations for HBase, HDFS, and ZooKeeper?
- hbase-env.sh
- RegionServer
- log4j.properties
- hbase-default.xml
- hbase-site.xml
Question 11 :- Which HBase component manages the race to add a backup master?
- Region Server
- HDFS
- ZooKeeper
- Region
- Primary master
Question 12 :- Which component of a region server is the actual storage file of the data?
- HRegion
- Store
- StoreFile
- HLog
- HFile
Question 13 :- When the master node is updated, which file can be used to automatically update the other nodes in the cluster?
- hbase-env.sh
- hbase-default.xml
- synchbase.sh
- hbase-site.xml
- syncconf.sh
Question 14 :- There is a single HLog for each region server. True or false?
- True
- False
Question 15 :- What is the main purpose of the Write-Ahead log?
- To flush data when the system reaches its capacity
- To store HBase configuration details
- To prevent data loss in the event of a system crash
- To store performance details
- To store HDFS configuration detailsv