Month End Special 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: sale65best

Effective Study Techniques for the EMC E20-065 Exam

Questions 1

What is an intended application of the MapReduce framework?

Options:
A.

Processing can be broken into smaller pieces

B.

Processing a large number of small files

C.

Processing in real time is required

D.

Processing a small subset of data

EMC E20-065 Premium Access
Questions 2

What is a typical use of a UDF in Pig?

Options:
A.

Creating functionality outside of what is provided by the built-in functions

B.

Providing Functional access to user-defined data in HDFS

C.

Providing advanced analytics to Hadoop

D.

Providing an interface from Pig to Microsoft Excel for easier data manipulation

Questions 3

What is a property of a good color model for ordinal data?

Options:
A.

Uses a rainbow-like color map for distinction of categories

B.

Uses a rainbow-like color map for ease of display and printing

C.

Uses perceptually ordinal colors with just-noticeable increments

D.

Uses perceptually ordinal colors with linear, perceptual increments

Questions 4

What is a characteristic of lemmatization?

Options:
A.

Can be performed by calling the synset () function on a lemma in LNTK

B.

Can be performed by calling the lemma() function on a synset in LNTK

C.

Reduces words of variant forms to their base forms based on a set of heuristics

D.

Reduces words of variant forms to their base forms based on a dictionary

Questions 5

What are the major components of the YARN architecture?

Options:
A.

ResourceManager and NodeManager

B.

Task Tracker and NameNode

C.

HDFS, Tez, and Spark

D.

Avro, ZooKeeper, and HDFS

Questions 6

Which metric would be most helpful in identifying a node that may cause network disruption if the node were removed?

Options:
A.

Degree

B.

Closeness

C.

Betweenness

D.

PageRank

Questions 7

Consider dataset that resides in HDFS. Which tool natively provides the capability to run a Random Forests model against this data?

Options:
A.

Mahout

B.

Pig

C.

Hive

D.

HBase

Questions 8

What is a characteristic of stop words?

Options:
A.

Used in term frequency analysis

B.

Include words such as "a", "an", and "the"

C.

Meaningful words requiring a parser to stop and examine them

D.

Don't occur often in text

Questions 9

What is the most likely reason for an HBase table to contain millions of columns?

Options:
A.

Data is imported from a relational database table

B.

Data is stored in the column qualifier

C.

There are thousands of columns families

D.

The column names are randomly generated

Questions 10

What elements are needed to determine the time complexity of finding all the cliques of size k in social network analysis?

Options:
A.

Eigenvector centrality and betwenness

B.

Clique size and total number of nodes in the network

C.

Number of edges in the network and centrality measure of the cliques

D.

Clique size and betweenness centrality