Weekend Special 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: sale65best

Free Cloudera CCA-500 Practice Exam with Questions & Answers

Questions 1

On a cluster running MapReduce v2 (MRv2) on YARN, a MapReduce job is given a directory of 10 plain text files as its input directory. Each file is made up of 3 HDFS blocks. How many Mappers will run?

Options:
A.

We cannot say; the number of Mappers is determined by the ResourceManager

B.

We cannot say; the number of Mappers is determined by the developer

C.

30

D.

3

E.

10

F.

We cannot say; the number of mappers is determined by the ApplicationMaster

Cloudera CCA-500 Premium Access
Questions 2

Choose three reasons why should you run the HDFS balancer periodically? (Choose three)

Options:
A.

To ensure that there is capacity in HDFS for additional data

B.

To ensure that all blocks in the cluster are 128MB in size

C.

To help HDFS deliver consistent performance under heavy loads

D.

To ensure that there is consistent disk utilization across the DataNodes

E.

To improve data locality MapReduce

Questions 3

In CDH4 and later, which file contains a serialized form of all the directory and files inodes in the filesystem, giving the NameNode a persistent checkpoint of the filesystem metadata?

Options:
A.

fstime

B.

VERSION

C.

Fsimage_N (where N reflects transactions up to transaction ID N)

D.

Edits_N-M (where N-M transactions between transaction ID N and transaction ID N)

Questions 4

Your cluster is running MapReduce version 2 (MRv2) on YARN. Your ResourceManager is configured to use the FairScheduler. Now you want to configure your scheduler such that a new user on the cluster can submit jobs into their own queue application submission. Which configuration should you set?

Options:
A.

You can specify new queue name when user submits a job and new queue can be created dynamically if the property yarn.scheduler.fair.allow-undecleared-pools = true

B.

Yarn.scheduler.fair.user.fair-as-default-queue = false and yarn.scheduler.fair.allow-undecleared-pools = true

C.

You can specify new queue name when user submits a job and new queue can be created dynamically if yarn .schedule.fair.user-as-default-queue = false

D.

You can specify new queue name per application in allocations.xml file and have new jobs automatically assigned to the application queue

Questions 5

You have just run a MapReduce job to filter user messages to only those of a selected geographical region. The output for this job is in a directory named westUsers, located just below your home directory in HDFS. Which command gathers these into a single file on your local file system?

Options:
A.

Hadoop fs –getmerge –R westUsers.txt

B.

Hadoop fs –getemerge westUsers westUsers.txt

C.

Hadoop fs –cp westUsers/* westUsers.txt

D.

Hadoop fs –get westUsers westUsers.txt

Questions 6

You have A 20 node Hadoop cluster, with 18 slave nodes and 2 master nodes running HDFS High Availability (HA). You want to minimize the chance of data loss in your cluster. What should you do?

Options:
A.

Add another master node to increase the number of nodes running the JournalNode which increases the number of machines available to HA to create a quorum

B.

Set an HDFS replication factor that provides data redundancy, protecting against node failure

C.

Run a Secondary NameNode on a different master from the NameNode in order to provide automatic recovery from a NameNode failure.

D.

Run the ResourceManager on a different master from the NameNode in order to load-share HDFS metadata processing

E.

Configure the cluster’s disk drives with an appropriate fault tolerant RAID level

Questions 7

You are running Hadoop cluster with all monitoring facilities properly configured.

Which scenario will go undeselected?

Options:
A.

HDFS is almost full

B.

The NameNode goes down

C.

A DataNode is disconnected from the cluster

D.

Map or reduce tasks that are stuck in an infinite loop

E.

MapReduce jobs are causing excessive memory swaps

Questions 8

Your Hadoop cluster is configuring with HDFS and MapReduce version 2 (MRv2) on YARN. Can you configure a worker node to run a NodeManager daemon but not a DataNode daemon and still have a functional cluster?

Options:
A.

Yes. The daemon will receive data from the NameNode to run Map tasks

B.

Yes. The daemon will get data from another (non-local) DataNode to run Map tasks

C.

Yes. The daemon will receive Map tasks only

D.

Yes. The daemon will receive Reducer tasks only

Questions 9

A user comes to you, complaining that when she attempts to submit a Hadoop job, it fails. There is a Directory in HDFS named /data/input. The Jar is named j.jar, and the driver class is named DriverClass.

She runs the command:

Hadoop jar j.jar DriverClass /data/input/data/output

The error message returned includes the line:

PriviligedActionException as:training (auth:SIMPLE) cause:org.apache.hadoop.mapreduce.lib.input.invalidInputException:

Input path does not exist: file:/data/input

What is the cause of the error?

Options:
A.

The user is not authorized to run the job on the cluster

B.

The output directory already exists

C.

The name of the driver has been spelled incorrectly on the command line

D.

The directory name is misspelled in HDFS

E.

The Hadoop configuration files on the client do not point to the cluster

Questions 10

You are configuring a server running HDFS, MapReduce version 2 (MRv2) on YARN running Linux. How must you format underlying file system of each DataNode?

Options:
A.

They must be formatted as HDFS

B.

They must be formatted as either ext3 or ext4

C.

They may be formatted in any Linux file system

D.

They must not be formatted - - HDFS will format the file system automatically

Exam Code: CCA-500
Certification Provider: Cloudera
Exam Name: Cloudera Certified Administrator for Apache Hadoop (CCAH)
Last Update: Mar 23, 2025
Questions: 60
PDF + Testing Engine
$164.99
$57.75
Testing Engine
$124.99
$43.75
PDF (Q&A)
$104.99
$36.75

Cloudera Free Exams

Cloudera Free Exams
Enhance your Cloudera exam prep with free resources and practice tests from Examstrack. Achieve Cloudera certification success by exploring Examstrack.