Big Cyber Monday Sale - Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70dumps

DA0-001 Questions and Answers

Question # 6

A data analyst is using a two-tailed, independent t-test to determine whether the type of stretching, dynamic or static, has any influence on a dancer's flexibility. Which of the following is the alternative hypothesis?

A.

A dancer's flexibility is improved through static stretching.

B.

The change in a dancer's flexibility is not equal to zero.

C.

There is a difference in a dancer's flexibility between static and dynamic stretching.

D.

The means of the static and dynamic stretching groups do not differ from each other.

Full Access
Question # 7

A business unit made the following modification to the values in a table:

Which of the following data quality dimensions was applied in this scenario?

A.

Integrity

B.

Consistency

C.

Completeness

D.

Accuracy

Full Access
Question # 8

Given the information in the following tables:

Which of the following describes merging these tables to create a master file that includes all transactions for both online and in-store sales?

A.

Data audit

B.

Data completeness

C.

Data validation

D.

Data consolidation

Full Access
Question # 9

Given the image below:

The data should be cleaned because of the presence of:

A.

outlier

B.

non-parametric data.

C.

multicollinearity.

D.

invalid data.

Full Access
Question # 10

When analyzing the values of two variables, you decide to convert both variables so they are on a scale of 0 to 1.

What term describes this action?

A.

Filtering.

B.

Normalization.

C.

Transposition.

D.

Aggregation.

Full Access
Question # 11

An analyst is currently working on a ticket to revamp a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?

A.

Talk to the group that made the request to determine the desired goal.

B.

Make changes to a frequently used report that is already in production.

C.

Build an additional dashboard with fewer views tailored toward each specific team.

D.

Develop a more streamlined dashboard to roll out by the next delivery date.

Full Access
Question # 12

A sales director has requested a report for individual team members within the division be developed. The director would like the report to be shared with all team members, but individual team members should not be identifiable within the report Which of the following access requirements would support the director's needs?

A.

Create an acceptable use policy for the sales data.

B.

Release the report as user-group-based access and include data masking.

C.

Get a data use agreement from the individual team members.

D.

Provide the report based on role and include data encryption.

Full Access
Question # 13

Which of the following is the best reason to use database views instead of tables?

A.

Views reduce the need for repetitive, complex data joins.

B.

Views allow for the storage of temporary data, whereas tables do not.

C.

Views allow for the joining of multiple data sources, whereas tables do not.

D.

Views can be used to restrict anonymous sensitive information.

Full Access
Question # 14

An analyst is reviewing the following data:

Car IDSpeed

123155

566436

564418

650567

546436

645638

Which of the following should the analyst include in the measures of central tendency for speed?

A.

Mode = 38 Range = 31 Mean = 42.5

B.

Range = 49 Max = 67 Min = 18

C.

Mode = 36 Max = 67 Min = 18

D.

Mode = 36 Median = 37 Mean = 41.5

Full Access
Question # 15

What SQL command is used to delete an entire table from a database?

A.

DROP.

B.

MODIFY.

C.

DELETE.

D.

ALTER.

Full Access
Question # 16

A data analyst is reviewing SQL code and sees a query that uses terms such as MIN, SUM, and COUNT. Which of the following types of functions best describes these terms?

A.

Aggregate

B.

Logical

C.

Filtering

D.

System

Full Access
Question # 17

Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?

A.

Simple random

B.

Cluster

C.

Systematic

D.

Stratified

Full Access
Question # 18

Which of the following best describes a difference between JSON and XML?

A.

JSON is quicker to read and write.

B.

JSON has to use an end tag.

C.

JSON strings are longer

D.

JSON is much more difficult to parse.

Full Access
Question # 19

A data analyst is building a closed won quarter-over-quarter report for the sales team. Which of the following will be needed to complete this request?

A.

The report create date and closed dollar amount

B.

The closed won quarter and the closed dollar amount

C.

The segment and closed dollar amount

D.

The closed won year and sales leader name

Full Access
Question # 20

Which of the following BEST describes the issue in which character values are mixed with integer values in a data set column?

A.

Duplicate data

B.

Missing data

C.

Data outliers

D.

Invalid data type

Full Access
Question # 21

An analyst wants to test the association between the number of doors in a car and the number of gears in the car. Which of the following is the best test to use?

A.

F-test

B.

Acceptance test

C.

Chi-squared test

D.

Z-test

Full Access
Question # 22

An analyst has been tracking company intranet usage and has been asked to create a chat to show the most-used/most-clicked portions of a homepage that contains more than 30 links. Which of the following visualizations would BEST illustrate this information?

A.

Scatter plot

B.

Heat map

C.

Pie chart

D.

Infographic

Full Access
Question # 23

Which one the following is not considered an aggregate function?

A.

SUM

B.

MIN

C.

SELECT

D.

MAX

Full Access
Question # 24

‘Which of the following is the BEST reason to use database views instead of tables?

A.

Views reduce the need for repetitive, complex data joins.

B.

Views allow for the storage of temporary data. whereas tables do not.

C.

Views allow for the joining of multiple data sources, whereas tables do not.

D.

Views can be used to restrict sensitive information.

Full Access
Question # 25

A data profiling rule checks the quality of all email addresses in a database. The rule returns a value with the number of email addresses that conformed to the rule. Which of the following options describes this value?

A.

Columns passed

B.

Rows passed

C.

Rows failed

D.

Columns failed

Full Access
Question # 26

Which of the following data manipulation techniques is an example of a logical function?

A.

WHERE

B.

AGGREGATE

C.

BOOLEAN

D.

IF

Full Access
Question # 27

Given the following graph:

Which of the following summary statements upholds integrity in data reporting?

A.

Sales are approximately equal for Product A and Product B across all strategies.

B.

Strategy 4 provides the best sales in comparison to other strategies.

C.

While Strategy 2 does not result in the highest sales of Product D, over all products it appears to be the most effective.

D.

Product D should be promoted more than the other products in all strategies.

Full Access
Question # 28

A salesperson who is prospecting potential clients collected the following data:

Which of the following is an issue with this data?

A.

Duplicate data

B.

Invalid data

C.

Missing value

D.

Redundant data

Full Access
Question # 29

What subset of Structured Query Language (SQL) is used to add, remove, modify, or retrieve the information stored within a relational database?

A.

DDL.

B.

DSL.

C.

DQL.

D.

DML.

Full Access
Question # 30

A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following

data visualization methods should the analyst use?

A.

A bubble chart

B.

A waterfall chart

C.

A scatter plot

D.

A line chart

Full Access
Question # 31

A data set for sales per month includes the following data:

Which of the following cleaning and profiling methods should be applied to the data set?

A.

Data outliers

B.

Invalid data

C.

Duplicate data

D.

Data type validation

Full Access
Question # 32

Which of the following is an example of PII?

A.

Age

B.

Name

C.

Ethnicity

D.

Gender

Full Access
Question # 33

Given the customer table below:

Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?

A.

Pie chart

B.

Heat graph

C.

Scatter plot

D.

Line chart

Full Access
Question # 34

A data analyst has been asked to create an ad-hoc sales report for the Chief Executive Officer (CEO).

Which of the following should be included in the report?

A.

The sales representatives' home addresses.

B.

Line-item SKU numbers.

C.

YTD total sales.

D.

The customers' first and last names.

Full Access
Question # 35

Given the following athlete workout data (with inconsistent units or formats for time/distance), which of the following best describes the data quality issue?

A.

Duplicate data

B.

Data outlier

C.

Data inconsistency

D.

Invalid data

Full Access
Question # 36

A gambler thinks that a coin is fair and is equally likely to turn up heads or tails when the coin is flipped. Which of the following tests should the gambler use to fest this hypothesis?

A.

t-test

B.

Chi-squared test

C.

Rank sum test

D.

Ratio test

Full Access
Question # 37

Consider the following dataset which contains information about houses that are for sale:

Which of the following string manipulation commands will combine the address and region namecolumns to create a full address?

full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan

A.

SELECT CONCAT(address, ' , ' , regionname) AS full_address FROM melb LIMIT 5;

B.

SELECT CONCAT(address, '-' , regionname) AS full_address FROM melb LIMIT 5;

C.

SELECT CONCAT(regionname, ' , ' , address) AS full_address FROM melb LIMIT 5

D.

SELECT CONCAT(regionname, '-' , address) AS full_address FROM melb LIMIT 5;

Full Access
Question # 38

A company wants to know how its customers interact with an e-commerce website based on clicks over items. Which of the following is the primary requirement for this report?

A.

Data content

B.

Frequency

C.

Filtering

D.

Views

Full Access
Question # 39

An analyst wants to extract data from a variety of sources and store the data in a cloud-based environment prior to cleaning. Which of the following integration techniques should the analyst use?

A.

ETL

B.

API

C.

SQL

D.

ELT

Full Access
Question # 40

Which of the following types of data manipulation functions should a data analyst use to implement a YES/NO condition in a spreadsheet?

A.

Text

B.

Statistical

C.

Financial

D.

Logical

Full Access
Question # 41

A database administrator needs to increase performance on a large dimension table. Which of the following is the best way to accomplish this task?

A.

Sampling

B.

Partitioning

C.

Windowing

D.

Sorting

Full Access
Question # 42

A data analyst has been asked to create one table that has each employee's first name, last name, sales, and address. The sales and addresses are listed in the tables below:

Which of the following steps should the analyst take to create the table?

A.

Transpose the first name and last name in both tables. Use lookup to pull the address field from Table 2 into Table 1.

B.

Use lookup with the first name or first name to pull the address field from Table 2 into Table 1.

C.

Use the append formula in both tables for the first name and last name. Use lookup to pull the address field from Table 2 into Table 1.

D.

Create a column that concatenates the first name and last name in each table. Use concatenate and lookup to bring the address field into Table 1.

Full Access
Question # 43

Which of the following best describes an exploratory analysis?

A.

Involves the use of descriptive statistics to understand observations

B.

Involves analysis of exploring data sets for performance tracking

C.

Involves the testing of specific hypotheses

D.

Involves the use of arithmetic algebra to determine the distribution

Full Access
Question # 44

An analyst has written the following code:

SELECT *

FROM Cust_table

WHERE age > 60 AND City = "New York"

Which of the following criteria is the analyst retrieving?

A.

All customers older than age 60 in New York state

B.

All customers aged 60 and older in New York state

C.

All customers older than age 60 in New York City

D.

All customers younger than age 60 in New York City

Full Access
Question # 45

Which of the following data analysis tools increases the efficiency of data visualizations?

A.

SQL

B.

Microsoft Excel

C.

SAS

D.

RapidMiner

Full Access
Question # 46

Which of the following BEST describes standard deviation?

A.

A measure that is used to establish a relationship between two variables

B.

A measure of how data is distributed

C.

A measure of the amount of dispersion of a set of values

D.

A measure that is used to find the significant difference between variables

Full Access
Question # 47

An analyst for a small business with multiple locations is using each location’s quarterly sales reports from last year to create a single revenue report for the year. Which of the following data mining techniques should the analyst use to complete this task?

A.

Data merge

B.

Data append

C.

Data blending

D.

Data imputation

Full Access
Question # 48

Given the following data sample:

Which of the following best describes the data quality issue?

A.

Data outlier

B.

Consistent data

C.

Duplicate data

D.

Invalid data

Full Access
Question # 49

An analysts building a monthly report for production and wants to ensure the audience is aware of its once-a-month cadence. Which of the following is the MOST important to convey that information?

A.

The date of the dashboard build

B.

The data refresh date

C.

A report summary

D.

Frequently asked questions

Full Access
Question # 50

Which of the following actions should be taken when transmitting data to mitigate the chance of a data leak occurring? (Choose two.)

A.

Data identification

B.

Data processing

C.

Data Reporting

D.

Data encryption

E.

Data masking

F.

Fata removal

Full Access
Question # 51

Which of the following is a characteristic of a star schema?

A.

It has a tabular structure.

B.

It stores transactional data.

C.

It stores unstructured data.

D.

It has denormalized dimension tables.

Full Access
Question # 52

An analyst needs to join two tables of data together for analysis. All the names and cities in the first table should be joined with the corresponding ages in the second table, if applicable.

Which of the following is the correct join the analyst should complete. and how many total rows will be in one table?

A.

INNER JOIN, two rows

B.

LEFT JOIN. four rows

C.

RIGHT JOIN. five rows

D.

OUTER JOIN, seven rows

Full Access
Question # 53

Which of the following concepts should be applied if a data set with 40 fields needs to be pared down to 20 fields and contains similar data across multiple fields?

A.

Duplication

B.

Consolidation

C.

Compliance

D.

Standardization

Full Access
Question # 54

Which of the following occurs if a 90% confidence interval increases to 95%?

A.

The margin of error does not change.

B.

The interval remains the same.

C.

The interval becomes narrower.

D.

The margin of error doubles.

Full Access
Question # 55

Which of the following contains alphanumeric values?

A.

10.1Ε²

B.

13.6

C.

1347

D.

A3J7

Full Access
Question # 56

The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?

A.

Q2 2020 and Q4 2019

B.

YTD 2020 and YTD 2019

C.

Q2 2020 and Q2 2019

D.

Q2 2020 and Q2 2021

Full Access
Question # 57

The total values in this month's revenue report are twice as much as last month's. Which of the following most likely occurred during the ETL process?

A.

The data cleansing processes failed to execute.

B.

The database connectivity failed.

C.

The report included the previous month's data.

D.

The data normalization processes failed.

Full Access
Question # 58

Which of the following would be used to store unstructured data from different sources?

A.

A data lake

B.

A database management system

C.

A database

D.

A data warehouse

Full Access
Question # 59

An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?

A.

Complete an audit on the data pulled for the report.

B.

Complete a check for quality in the report.

C.

Complete a review of the data and a check for consistency

D.

Complete a trend analysis to be included in the report.

Full Access
Question # 60

Which of the following would a data analyst look for first if 100% participation is needed on survey results?

A.

Missing data

B.

Invalid data

C.

Redundant data

D.

Duplicate data

Full Access
Question # 61

During data cleansing, an analyst conducts measures of central tendency on a data set. Which of the following data is the analyst attempting to identify?

A.

Duplicate

B.

Missing

C.

Outlying

D.

Invalid

Full Access
Question # 62

Five dogs have the following heights in millimeters:

300,430, 170, 470, 600

Which of the following is the standard deviation for the five dogs?

A.

147mm

B.

154mm

C.

394 mm

D.

21,704mm

Full Access
Question # 63

A data analyst reviews the following data set:

Which of the following is the range value?

A.

9

B.

10

C.

12

D.

13

Full Access
Question # 64

Given the following table of student scores (with some values that violate the allowed scoring rules), which of the following is the best reason for cleansing the data?

A.

Invalid data

B.

Redundant data

C.

Data outliers

D.

Missing data

Full Access
Question # 65

Which one of the following is a common data warehouse schema?

A.

Snowflake.

B.

Square.

C.

Spiral.

D.

Sphere.

Full Access
Question # 66

Which of the following types of analyses is best to use when tracking sales revenue against quarterly targets?

A.

Trend

B.

Performance

C.

Link

D.

Scope

Full Access
Question # 67

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

Which of the following conclusions is accurate at a 95% confidence interval?

A.

In Germany, the increase in conversion from the new layout was not significant.

B.

In France, the increase in conversion from the new layout was not significant.

C.

In general, users who visit the new website are more likely to make a purchase.

D.

The new layout has the lowest conversion rates in the United Kingdom.

Full Access
Question # 68

Given the table below:

Which of the following variable types BEST describes the “Year” column?

A.

Numeric

B.

Date

C.

Alphanumeric

D.

Text

Full Access
Question # 69

Given the following data:

Which of the following BEST describes the data set?

A.

There is data bias.

B.

The data is incomplete.

C.

The data is inconsistent.

D.

The data is outliers.

Full Access
Question # 70

A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:

Income category 1: less than $1.

Income category 2: more than $1 and less than $20,000.

Income category 3: more than $20,001 and less than $40,000.

Income category 4: more than $40,001.

Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?

A.

Data merge

B.

Derived variables

C.

Data blending

D.

Data append

Full Access
Question # 71

An analyst collected data that includes primary account numbers, expiration dates, and service codes. Which of the following data governance classifications is used to describe this data?

A.

PI I

B.

PCI

C.

PBI

D.

PHI

Full Access
Question # 72

Which of the following analysis techniques is an unsupervised data mining process?

A.

Clustering

B.

Descriptive

C.

Regression

D.

Predictive

Full Access
Question # 73

Which of the following roles is responsible for ensuring an organization's data quality, security, privacy, and regulatory compliance?

A.

Data owner.

B.

Data steward.

C.

Data custodian.

D.

Data processor.

Full Access
Question # 74

A company’s marketing department wants to do a promotional campaign next month. A data analyst on the team has been asked to perform customer segmentation, looking at how recently a customer bought the product, at what frequency, and at what value. Which of the following types of analysis would this practice be considered?

A.

Prescriptive

B.

Trend

C.

Gap

D.

Custer

Full Access
Question # 75

An analyst develops an IT document and needs to describe the technical terms used in the document. Which of the following is where the analyst should include descriptions of the technical terms?

A.

Glossary

B.

System diagram

C.

User requirements

D.

Index

Full Access
Question # 76

Given the following grocery store orders:

If a query is made to the table with the following logic:

Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)

Which of the following is the number of orders that will be returned by the query?

A.

Four

B.

Five

C.

Six

D.

Seven

Full Access
Question # 77

Given the following report:

Which of the following components need to be added to ensure the report is point-in-time and static? (Choose two.)

A.

A control group for the phrases

B.

A summary of the KPIs

C.

Filter buttons for the status

D.

The date when the report was last accessed

E.

The time period the report covers

F.

The date on which the report was run

Full Access
Question # 78

Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?

A.

Microsoft Excel

B.

R

C.

Snowflake

D.

SQL

Full Access
Question # 79

Given the below:

Which of the following numbers represents a Type I error?

A.

1

B.

2

C.

3

D.

4

Full Access
Question # 80

An analyst is explaining the company’s financial systems and reporting tools to a new coworker. Which of the following data quality dimensions are the most important? (Select three).

A.

Data formatting

B.

Data accuracy

C.

Data maturity

D.

Data field

E.

Data completeness

F.

Data consistency

G.

Data diversity

Full Access
Question # 81

An analyst must obtain the average daily sales for the following week:

Which of the following must the analyst perform to obtain this value?

A.

Data normalization

B.

Data append

C.

Data aggregation

D.

Data blending

Full Access
Question # 82

A development company is constructing a new unit in its apartment complex. The complex has the following floor plans:

Using the average cost per square foot of the original floor plans, which of the following should be the price of the Rose unit?

A.

$640,900

B.

$690,000

C.

$705,200

D.

$702,500

Full Access
Question # 83

A company notifies its employees that emails will be automatically moved to a cloud-based server in 180 days. Which of the following describes this concept?

A.

Data deletion

B.

Data processing

C.

Data retention

D.

Data constraints

Full Access
Question # 84

Which of the following will MOST likely be streamed live?

A.

Machine data

B.

Key-value pairs

C.

Delimited rows

D.

Flat files

Full Access
Question # 85

Which of the following is the best description of the term "data governance"?

A.

Data governance governs the development of a data visualization dashboard in an organization.

B.

Data governance is the policy that protects against data breaches by cybercriminals.

C.

Data governance is the process of analyzing, manipulating, and reporting data in an organization.

D.

Data governance is the availability, usability, integrity, and security of data in an enterprise.

Full Access
Question # 86

Exhibit.

Which of the following logical statements results in Table B?

A)

B)

C)

D)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Full Access
Question # 87

A company needs a report that provides executives an overview and regional managers with both an overview and specifics. Which of the following reporting elements will achieve these results?

A.

Observations and insights

B.

Live data feed

C.

Drill-down function

D.

Access permissions

Full Access
Question # 88

A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be themost efficient way to deliver this report?

A.

A workbook with multiple tabs for each region

B.

A daily email with snapshots of regional summaries

C.

A static report with a different page for every filtered view

D.

A dashboard with filters at the top that the user can toggle

Full Access
Question # 89

Which of the following statistical methods requires two or more categorical variables?

A.

Simple linear regression

B.

Chi-squared test

C.

Z-test

D.

Two-sample t-test

Full Access
Question # 90

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company's annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Full Access
Question # 91

Which of the following types of analysis is used when comparing last week's sales to the previous week's sales?

A.

Trend analysis

B.

Exploratory analysis

C.

Prescriptive analysis

D.

Link analysis

Full Access
Question # 92

An analyst has conducted a review of business questions. Which of the following should the analyst do next to conduct an analysis?

A.

Determine the data needs and review the observations.

B.

Determine the data needs and sources for analysis.

C.

Determine the data needs and schedule interviews.

D.

Determine the data needs and begin the analysis.

Full Access
Question # 93

An analyst reviews the following data:

7

3

5

2

3

7

7

10

Which of the following is the value of the mode?

A.

3

B.

5

C.

7

D.

10

Full Access
Question # 94

A data analyst must separate the column shown below into multiple columns for each component of the name:

Which of the following data manipulation techniques should the analyst perform?

A.

Imputing

B.

Transposing

C.

Parsing

D.

Concatenating

Full Access
Question # 95

A data analyst has a set of data that shows the number of gallons of oil produced each day. The company would like to know the standard deviation for the data set. The variance for the data is 36 gallons. Which of the following is the standard deviation for gallons produced?

A.

1.16

B.

6

C.

36

D.

72

Full Access
Question # 96

Which of the following data manipulation techniques should an analyst use to hide unnecessary data during analysis?

A.

Filtering

B.

Parametrization

C.

Sorting

D.

Indexing

Full Access
Question # 97

Which of the following file formats is best suited to start exploratory analysis within statistical software?

A.

CSV

B.

XLSM

C.

XML

D.

JSON

Full Access
Question # 98

A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer The analyst must choose an appropriate chart to include in the dashboard. The following data is available:

Which of the following types of charts should be considered?

A.

Include a line chart using the site and average sales per customer.

B.

Include a pie chart using the site and sales to average sales per customer.

C.

Include a scatter chart using sales volume and average sales per customer.

D.

Include a column chart using the site and sales to average sales per customer.

Full Access
Question # 99

A data analyst is attempting to understand how ice cream consumption is affected by different attributes. such as cost, temperature. and income level. Which of the following

regression analyses should the data analyst perform to understand this relationship?

A.

Logistic

B.

Ordinary least squares

C.

Cox

D.

Polynomial

Full Access
Question # 100

A customer survey reveals 90% positive feedback. Which of the following statistical methods would be best to utilize to determine the reliability of a data set and predict how a larger sample of customers over the same time period might respond?

A.

Calculate a high variance on survey responses.

B.

Calculate the maximum range of the survey responses.

C.

Calculate a low standard deviation on survey responses.

D.

Remove any data more than 4 standard deviation from the mean.

Full Access
Question # 101

Which of the following is most likely to be used as a data-mining ETL tool?

A.

SSIS

B.

Stata

C.

SPSS

D.

Cognos

Full Access
Question # 102

Which of the following should an analyst do to best summarize the data on a data set?

A.

Filtering

B.

Aggregation

C.

Sorting

D.

Concatenation

Full Access
Question # 103

Which of the following is an example of a flat file?

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Full Access
Question # 104

An analyst reviews the following table:

Which of the following data types is represented in the values in the RefNo column?

A.

Numeric

B.

Real Number

C.

Currency

D.

Alphanumeric

Full Access
Question # 105

Which of the following techniques should an analyst use to analyze a data set to get a snapshot of basic measures of central tendency?

A.

Forecasting

B.

Trend analysis

C.

Gap analysis

D.

Descriptive statistics

Full Access
Question # 106

Which one of the following in NOT a common data integration tool?

A.

XSS

B.

ELT

C.

ETL

D.

APIs

Full Access
Question # 107

An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:

Which of the following types of charts should be considered to best display the data?

A.

Include a bar chart using the site and the percentage of new customers data.

B.

Include a line chart using the site and the percentage of new customers data.

C.

Include a pie chart using the site and percentage of new custorners data.

D.

Include a scatter chart using the site and the percent of new customers data.

Full Access
Question # 108

Which of the following types of dashboards should a business intelligence engineer develop in order to provide information about failed data pipelines?

A.

Referencing

B.

Strategic

C.

Operational

D.

Technical

Full Access
Question # 109

Which of the following components need to be added to ensure the report is point-in-time and static? (Select two).

A.

A control group for the phrases

B.

A summary of the KPIs

C.

Filter buttons for the status

D.

The date when the report was last accessed

E.

The time period the report covers

F.

The date on which the report was run

Full Access
Question # 110

Which of the following is an example of a at flat file?

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Full Access
Question # 111

While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?

A.

Replace missing data.

B.

Remove duplicate data.

C.

Replace redundant data.

D.

Remove invalid data.

Full Access
Question # 112

An analyst needs to create an analytics dashboard for an employee intranet site to improve the search functionality, display relevant information, and maintain an updated FAQ page. Which of the following visualizations would best represent what employees are searching for?

A.

A word cloud

B.

A histogram

C.

A pie chart

D.

A scatter plot

Full Access
Question # 113

After the daily ETL jobs are completed, the data in the reports does not appear complete, and a lot of data seems to be missing. Which of the following concepts should be used to assess and investigate further?

A.

Cross-validation

B.

Data profiling

C.

Data integrity

D.

Data consistency

Full Access
Question # 114

A Chief Executive Officer (CEO) is requesting more up-to-date sales data for improved visibility prior to month-end. An analyst must determine the frequency of a sales report that was previously distributed on an as-needed basis. Which of the following would be the most appropriate frequency for this report?

A.

Monthly

B.

Quarterly

C.

Weekly

D.

Every other month

Full Access
Question # 115

An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?

A.

Join

B.

Append

C.

Transform

D.

Blend

Full Access
Question # 116

Taylor wants to investigate how manufacturing, marketing, and sales expenditures impact overall profitability for her company.

Which of the following systems is the most appropriate?

A.

OLTP.

B.

OLAP.

C.

Data warehouse.

D.

Data mart.

Full Access
Question # 117

A site reliability team wants to monitor the stability of their website. so they can proactively diagnose issues when they occur Which of the following deliverables would best suit their needs?

A.

A self-serve dashboard of website performance that updates in real time

B.

A weekly log report of site visits and user actions

C.

A portal that is refreshed daily and reports errors classified by type

D.

A daily summary email indicating website outages for the previous day

Full Access
Question # 118

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company’s annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Full Access