Big-Data

Question 1
The data node and name node in HADOOP are
A
Worker Node and Master Node respectively
B
Master Node and Worker Node respectively
C
Both Worker Nodes
D
Both Master Nodes
Question 1 Explanation: 
Explanation: Hadoop Distributed File System(HDFS) has a master/slave architecture. An HDFS cluster consists:
1. NameNode, a master server that manages the file system namespace and regulates access to files by clients.
2. DataNodes, usually one per node in the cluster, which manage storage attached to the nodes that they run on. HDFS exposes a file system namespace and allows user data to be stored in files.
Internally, a file is split into one or more blocks and these blocks are stored in a set of DataNodes. The NameNode executes file system namespace operations like opening, closing, and renaming files and directories. It also determines the mapping of blocks to DataNodes. The DataNodes are responsible for serving read and write requests from the file system’s clients. The DataNodes also perform block creation, deletion, and replication upon instruction from the NameNode.
Question 2
Which of the following is a component of Hadoop?
A
YARN
B
HDFS
C
Map reduce
D
All of the options
Question 2 Explanation: 
YARN, HDFS and Map reduce are components of Hadoop.
Question 3
Hadoop(a big data tool) works with number of related tools. Choose from the following, the common tools included into Hadoop:
A
MySQl, Google API and Map reduce
B
Map reduce, Scala and hummer
C
Map reduce, H base and Hive
D
Map reduce, hummer and Heron
Question 3 Explanation: 
The common tools included into Hadoop are mainly
Map reduce
H base
Hive
Question 4
Criticism free idea generation is a factor of _____.
A
Decision Support System
B
Group Decision Support System
C
Enterprise Resource Support System
D
Artificial Intelligence
Question 4 Explanation: 
Criticism free idea generation is a factor of Group Decision Support System.
Question 5
​Which of the following statement/s is/are true ?
(i) Facebook has the world’s largest Hadoop cluster.
(ii) Hadoop 2.0 allows live stream processing of real time data
A
Neither (i) nor (ii)
B
Both (i) and (ii)
C
(i) only
D
(ii) only
Question 5 Explanation: 
→ The Data warehouse Hadoop cluster at Facebook has become the largest known Hadoop storage cluster in the world.
Here are some of the details about this single HDFS cluster:
1. 21 PB of storage in a single HDFS cluster
2. 2000 machines
3. 12 TB per machine (a few machines have 24 TB each)
4. 1200 machines with 8 cores each + 800 machines with 16 cores each
5. 32 GB of RAM per machine
6. 15 map-reduce tasks per machine
That's a total of more than 21 PB of configured storage capacity! This is larger than the previously known Yahoo!'s cluster of 14 PB.
→ Hadoop 2.0 allows live stream processing of real time data
There are 5 questions to complete.

Access subject wise (1000+) question and answers by becoming as a solutions adda PRO SUBSCRIBER with Ad-Free content

Register Now