Hadoop Quiz / Hadoop MCQ Set 11

 ultimate Hadoop Quiz!

1. What is HBASE?
Hbase is a part of the Apache Hadoop project that provides interface for scanning large amount of data using Hadoop infrastructure.
Hbase is separate set of the Java API for Hadoop cluster.
Hbase is a "database" like interface to Hadoop cluster data.
HBase is a part of the Apache Hadoop project that provides a SQL like interface for data processing.

2. You have loads of data that can be processed by you MRjobs. However you need the data to be available to Analysts and Scients in you organisation. What is the best format to represent the input?
Sequence File.
Avro.
XML.
HTML.

3. HDFS block size is larger as compared to the size of the disk blocks so that
A single file larger than the disk size can be stored across many disks in the cluster.
Transfer of a large files made of multiple disk blocks is not possible.
The seek time is maximum
Only HDFS files can be stored in the disk used.

4. You have user profile records in your OLTP database,that you want to join with weblogs you have already ingested into HDFS.How will you obtain these user records?
HDFS commands
Pig load
Sqoop import
Hive

5. What is map – side join?
Map-side join is done in the map phase and done in memory
Map-side join is a technique in which data is eliminated at the map step
Map-side join is a form of map-reduce API which joins data from different locations
None of these answers are correct

6. What license is Hadoop distributed under ?
Apache License 2.0
Commertial
Private
Mozilla Public License

7. Which of the following genres does Hadoop produce ?
Distributed file system
JAX-RS
Relational Database Management System
Java Message Service

8. Hadoop achieves reliability by replicating the data across multiple hosts, and hence does not require ________ storage on hosts.
RAID
Standard RAID levels
ZFS
Operating system

Score =
Correct answers: