Login
Remember
Register
Ask a Question
Recent questions in Hadoop
0
votes
1
answer
Point out the wrong statement. a) Replication Factor can be configured at a cluster level (Default is set to 3) and also at a file level
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-filesystem-hdfs-questions-answers
0
votes
1
answer
________ NameNode is used when the Primary NameNode goes down.
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-filesystem-hdfs-questions-answers
0
votes
1
answer
HDFS works in a __________ fashion.
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-filesystem-hdfs-questions-answers
0
votes
1
answer
Point out the correct statement. a) DataNode is the slave/worker node and holds the user data in the form of Data Blocks
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-filesystem-hdfs-questions-answers
0
votes
1
answer
A ________ serves as the master and there is only one NameNode per cluster.
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-filesystem-hdfs-questions-answers
0
votes
1
answer
Hadoop has a library class, org.apache.hadoop.mapred.lib.FieldSelectionMapReduce, that effectively allows you to process text data like the unix ______ utility
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
Which of the following class is provided by the Aggregate package?
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
Which of the following class provides a subset of features provided by the Unix/GNU Sort?
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
______________ class allows the Map/Reduce framework to partition the map outputs based on certain key fields, not the whole keys.
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadeep-questions-answers
0
votes
1
answer
The ________ option allows you to copy jars locally to the current working directory of tasks and automatically unjar the files. a) archives b) files c) task d) none of the mentioned
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
Point out the wrong statement. a) Hadoop has a library package called Aggregate
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
To set an environment variable in a streaming command use ____________
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
Which of the following Hadoop streaming command option parameter is required? a) output directoryname b) mapper executable c) input directoryname d) all of the mentioned
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
Point out the correct statement. a) You can specify any executable as the mapper and/or the reducer
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
Streaming supports streaming command options as well as _________ command options.
asked
Oct 23, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers-hadoop-streaming
0
votes
1
answer
__________ represent the logical computations of your Crunch pipelines.
asked
Oct 22, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers
0
votes
1
answer
___________ executes the pipeline as a series of MapReduce jobs.
asked
Oct 22, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answer
0
votes
1
answer
Hive, Pig, and Cascading all use a _________ data model.
asked
Oct 22, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers
0
votes
1
answer
Crunch was designed for developers who understand __________ and want to use MapReduce effectively.
asked
Oct 22, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers
0
votes
1
answer
Point out the wrong statement. a) Crunch pipeline written by the development team sessionizes a set of user logs generates are then processed by a diverse collection of Pig scripts and Hive queries
asked
Oct 22, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers
0
votes
1
answer
The Crunch APIs are modeled after _________ which is the library that Google uses for building data pipelines on top of their own implementation of MapReduce.
asked
Oct 22, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers
0
votes
1
answer
Point out the correct statement. a) Scrunch’s Java API is centered around three interfaces that represent distributed datasets
asked
Oct 22, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answers
0
votes
1
answer
The Apache Crunch Java library provides a framework for writing, testing, and running ___________ pipelines.
asked
Oct 22, 2022
in
Hadoop
by
DavidAnderson
hadoop-questions-answer
0
votes
1
answer
In which year Apache Mahout started?
asked
Oct 7, 2022
in
Hadoop
by
Robin
hadoop
mahout
0
votes
1
answer
What are the main configuration files in Hadoop?
asked
May 22, 2022
in
Hadoop
by
AdilsonLima
hadoop
0
votes
1
answer
Which of the following options are the characteristics of Sqoop
asked
Mar 2, 2022
in
Hadoop
by
DavidAnderson
sqoop-flume-and-oozie-interview-question-answer
0
votes
1
answer
Which of the following is the component of YARN?
asked
Jul 28, 2021
in
Hadoop
by
SakshiSharma
hadoop-manager
0
votes
1
answer
How many major component Yarn has?
asked
Jul 28, 2021
in
Hadoop
by
SakshiSharma
yarn-component
0
votes
1
answer
YARN is the one who helps to manage the resources across the ________.
asked
Jul 28, 2021
in
Hadoop
by
SakshiSharma
yarn-resource
0
votes
1
answer
What is the full form of YARN?
asked
Jul 28, 2021
in
Hadoop
by
SakshiSharma
yarn
Page:
« prev
1
2
3
4
5
6
7
8
9
10
next »
Recent questions in Hadoop
...