Weekend Sale - Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70dumps

SDS Questions and Answers

Question # 6

Which of the following can visualize variations in the base data, which can be used to identify outliers in the data for further investigation?

A.

Trend Analysis

B.

Box Plots

C.

Histogram

D.

Scatter Plot

E.

None of the above

Full Access
Question # 7

Machine learning can be used in:

A.

Fraud detection

B.

Web search results

C.

Real-time ads on web pages and mobile devices

D.

Pattern and image recognition

E.

All of the above

Full Access
Question # 8

Business Intelligence (BI) is:

A.

BI focuses on descriptive analytics

B.

BI focuses on "What happened?"

C.

BI focuses on reporting on the future state of the business

D.

Both A and B

E.

Both B and C

Full Access
Question # 9

Which of the following is NOT an example of graphical model?

A.

Road maps

B.

Electrical circuits

C.

Computer networks

D.

Geographical networks

E.

Flow charts

Full Access
Question # 10

What is the agenda of discussion at a "stand up" meeting of an Agile team?

A.

What they accomplished the previous day

B.

What they are planning to do today

C.

Any roadblocks they are running into

D.

Both A and B

E.

All of the above

Full Access
Question # 11

Which of the following is TRUE for "By" analysis?

A.

The "By" analysis technique reinforces the process of "thinking like a data scientist."

B.

"By" analysis is a technique by which business subject matter experts (SMEs) and the Data Science team could collaborate to uncover new variables and metrics that might be better predictors of business performance.

C.

"By" analysis is used to create a collaborative technique to drive alignment between the business users and the data scientists to identify and brainstorm variables and metrics that might be better predictors of business performance.

D.

Both B and C

E.

All of the above

Full Access
Question # 12

Bernoulli random variable is a type of:

A.

Discrete random variable

B.

Continuous random variable

C.

Sometimes Discrete or sometimes Continuous random variable

D.

Both A and B

Full Access
Question # 13

Maximum Likelihood Estimation (MLE) is a way to frame:

A.

Large class of problems in Data Science

B.

Small class of problems in Data Science

C.

Large class of problems in HDFS

D.

Small class of problems in HDFS

E.

Both A and C

Full Access
Question # 14

Which of the following phases is NOT a Big Data Business Model Maturity Index?

A.

Business Monitoring

B.

Business Optimization

C.

Business Strategy

D.

Data Monetization

E.

Business Metamorphosis

Full Access
Question # 15

Exploratory analytic algorithms help the Data Science team to better:

A.

Understand the data content

B.

Gain a high-level understanding of relationships

C.

Understand patterns in the data

D.

Both A and B

E.

All of the above

Full Access
Question # 16

In regression, the principle of machine learning is used to optimize the parameters to:

A.

Minimize the approximation error

B.

Calculate the closest possible outcomes

C.

Both A and B

D.

None of the above

Full Access
Question # 17

Which of the following is FALSE for Social Network Analysis (SNA)?

A.

Social Network Analysis (SNA) is an example of graph analysis

B.

Social Network Analysis (SNA) is an example of trend analysis

C.

SNA is used to investigate social structures and relationships across social networks

D.

SNA characterizes networked structures in terms of nodes and the ties or edges that connect them

E.

None of the above

Full Access
Question # 18

Which of the following is TRUE for Chief Data Monetization Officer (CDMO)?

i. CDMO should focus on driving and deriving value from the organization's data and analytic assets.

ii. The CDMO should own the organization's investment decisions with respect to data and analytics.

iii. CDMO should have revenue and margin responsibilities.

A.

i, ii

B.

ii, iii

C.

All of the above

Full Access
Question # 19

What is Scrumban?

A.

It is Scrum

B.

It is Kanban

C.

It combines the principles of Scrum and Kanban into a pull-based system

D.

It combines the principles of Scrum and Kanban into a push-based system

Full Access
Question # 20

What is TRUE for “rehashing”?

A.

Allocate a new, larger hash table in memory

B.

It requires a new hash function, which maps values into a larger range of integers

C.

Key/value pairs from the original table can be inserted into the new, larger one

D.

Both A and B

E.

All of the above

Full Access
Question # 21

Which of the following is TRUE for data lake?

A.

The data lake can make both of the Business Intelligence and Data Science environments more agile and more productive

B.

The data lake enables organizations to gather, manage, enrich, and analyze many new sources of data, whether structured or unstructured

C.

The data lake enables organizations to treat data as an organizational asset to be gathered and nurtured versus a cost to be minimized

D.

The data lake can make both of the Business Intelligence and Data Science environments less agile and more productive

E.

None of the above

Full Access
Question # 22

Spark is written in:

A.

Scala

B.

Java

C.

C

D.

C++

E.

Python

Full Access
Question # 23

Which of the following errors refers to the wrong negation of a true null hypothesis?

A.

Type I Error

B.

Type II Error

C.

Logical Error

D.

Hypothesis Error

E.

None of the above

Full Access
Question # 24

Data wrangling is the process of getting the data from:

A.

Its raw format into something suitable for more conventional analytics

B.

Its modified meaning format into something suitable for more conventional analytics

C.

Both A and B

D.

None of the above

Full Access
Question # 25

Spark should be used when:

A.

Data is massive

B.

Data is not massive

C.

Both A and B

D.

None of the above

Full Access