A data analyst is using a two-tailed, independent t-test to determine whether the type of stretching, dynamic or static, has any influence on a dancer's flexibility. Which of the following is the alternative hypothesis?
A business unit made the following modification to the values in a table:
Which of the following data quality dimensions was applied in this scenario?
Given the information in the following tables:
Which of the following describes merging these tables to create a master file that includes all transactions for both online and in-store sales?
Given the image below:
The data should be cleaned because of the presence of:
When analyzing the values of two variables, you decide to convert both variables so they are on a scale of 0 to 1.
What term describes this action?
An analyst is currently working on a ticket to revamp a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?
A sales director has requested a report for individual team members within the division be developed. The director would like the report to be shared with all team members, but individual team members should not be identifiable within the report Which of the following access requirements would support the director's needs?
Which of the following is the best reason to use database views instead of tables?
An analyst is reviewing the following data:
Car IDSpeed
123155
566436
564418
650567
546436
645638
Which of the following should the analyst include in the measures of central tendency for speed?
A data analyst is reviewing SQL code and sees a query that uses terms such as MIN, SUM, and COUNT. Which of the following types of functions best describes these terms?
Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?
A data analyst is building a closed won quarter-over-quarter report for the sales team. Which of the following will be needed to complete this request?
Which of the following BEST describes the issue in which character values are mixed with integer values in a data set column?
An analyst wants to test the association between the number of doors in a car and the number of gears in the car. Which of the following is the best test to use?
An analyst has been tracking company intranet usage and has been asked to create a chat to show the most-used/most-clicked portions of a homepage that contains more than 30 links. Which of the following visualizations would BEST illustrate this information?
‘Which of the following is the BEST reason to use database views instead of tables?
A data profiling rule checks the quality of all email addresses in a database. The rule returns a value with the number of email addresses that conformed to the rule. Which of the following options describes this value?
Which of the following data manipulation techniques is an example of a logical function?
Given the following graph:
Which of the following summary statements upholds integrity in data reporting?
A salesperson who is prospecting potential clients collected the following data:
Which of the following is an issue with this data?
What subset of Structured Query Language (SQL) is used to add, remove, modify, or retrieve the information stored within a relational database?
A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following
data visualization methods should the analyst use?
A data set for sales per month includes the following data:
Which of the following cleaning and profiling methods should be applied to the data set?
Given the customer table below:
Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?
A data analyst has been asked to create an ad-hoc sales report for the Chief Executive Officer (CEO).
Which of the following should be included in the report?
Given the following athlete workout data (with inconsistent units or formats for time/distance), which of the following best describes the data quality issue?
A gambler thinks that a coin is fair and is equally likely to turn up heads or tails when the coin is flipped. Which of the following tests should the gambler use to fest this hypothesis?
Consider the following dataset which contains information about houses that are for sale:
Which of the following string manipulation commands will combine the address and region namecolumns to create a full address?
full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan
A company wants to know how its customers interact with an e-commerce website based on clicks over items. Which of the following is the primary requirement for this report?
An analyst wants to extract data from a variety of sources and store the data in a cloud-based environment prior to cleaning. Which of the following integration techniques should the analyst use?
Which of the following types of data manipulation functions should a data analyst use to implement a YES/NO condition in a spreadsheet?
A database administrator needs to increase performance on a large dimension table. Which of the following is the best way to accomplish this task?
A data analyst has been asked to create one table that has each employee's first name, last name, sales, and address. The sales and addresses are listed in the tables below:
Which of the following steps should the analyst take to create the table?
An analyst has written the following code:
SELECT *
FROM Cust_table
WHERE age > 60 AND City = "New York"
Which of the following criteria is the analyst retrieving?
Which of the following data analysis tools increases the efficiency of data visualizations?
An analyst for a small business with multiple locations is using each location’s quarterly sales reports from last year to create a single revenue report for the year. Which of the following data mining techniques should the analyst use to complete this task?
Given the following data sample:
Which of the following best describes the data quality issue?
An analysts building a monthly report for production and wants to ensure the audience is aware of its once-a-month cadence. Which of the following is the MOST important to convey that information?
Which of the following actions should be taken when transmitting data to mitigate the chance of a data leak occurring? (Choose two.)
An analyst needs to join two tables of data together for analysis. All the names and cities in the first table should be joined with the corresponding ages in the second table, if applicable.
Which of the following is the correct join the analyst should complete. and how many total rows will be in one table?
Which of the following concepts should be applied if a data set with 40 fields needs to be pared down to 20 fields and contains similar data across multiple fields?
Which of the following occurs if a 90% confidence interval increases to 95%?
The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?
The total values in this month's revenue report are twice as much as last month's. Which of the following most likely occurred during the ETL process?
Which of the following would be used to store unstructured data from different sources?
An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?
Which of the following would a data analyst look for first if 100% participation is needed on survey results?
During data cleansing, an analyst conducts measures of central tendency on a data set. Which of the following data is the analyst attempting to identify?
Five dogs have the following heights in millimeters:
300,430, 170, 470, 600
Which of the following is the standard deviation for the five dogs?
A data analyst reviews the following data set:
Which of the following is the range value?
Given the following table of student scores (with some values that violate the allowed scoring rules), which of the following is the best reason for cleansing the data?
Which of the following types of analyses is best to use when tracking sales revenue against quarterly targets?
An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:
Which of the following conclusions is accurate at a 95% confidence interval?
Given the table below:
Which of the following variable types BEST describes the “Year” column?
Given the following data:
Which of the following BEST describes the data set?
A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:
Income category 1: less than $1.
Income category 2: more than $1 and less than $20,000.
Income category 3: more than $20,001 and less than $40,000.
Income category 4: more than $40,001.
Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?
An analyst collected data that includes primary account numbers, expiration dates, and service codes. Which of the following data governance classifications is used to describe this data?
Which of the following analysis techniques is an unsupervised data mining process?
Which of the following roles is responsible for ensuring an organization's data quality, security, privacy, and regulatory compliance?
A company’s marketing department wants to do a promotional campaign next month. A data analyst on the team has been asked to perform customer segmentation, looking at how recently a customer bought the product, at what frequency, and at what value. Which of the following types of analysis would this practice be considered?
An analyst develops an IT document and needs to describe the technical terms used in the document. Which of the following is where the analyst should include descriptions of the technical terms?
Given the following grocery store orders:
If a query is made to the table with the following logic:
Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)
Which of the following is the number of orders that will be returned by the query?
Given the following report:
Which of the following components need to be added to ensure the report is point-in-time and static? (Choose two.)
Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?
Given the below:
Which of the following numbers represents a Type I error?
An analyst is explaining the company’s financial systems and reporting tools to a new coworker. Which of the following data quality dimensions are the most important? (Select three).
An analyst must obtain the average daily sales for the following week:
Which of the following must the analyst perform to obtain this value?
A development company is constructing a new unit in its apartment complex. The complex has the following floor plans:
Using the average cost per square foot of the original floor plans, which of the following should be the price of the Rose unit?
A company notifies its employees that emails will be automatically moved to a cloud-based server in 180 days. Which of the following describes this concept?
Which of the following is the best description of the term "data governance"?
Exhibit.
Which of the following logical statements results in Table B?
A)
B)
C)
D)
A company needs a report that provides executives an overview and regional managers with both an overview and specifics. Which of the following reporting elements will achieve these results?
A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be themost efficient way to deliver this report?
Which of the following statistical methods requires two or more categorical variables?
A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?
Which of the following types of analysis is used when comparing last week's sales to the previous week's sales?
An analyst has conducted a review of business questions. Which of the following should the analyst do next to conduct an analysis?
An analyst reviews the following data:
7
3
5
2
3
7
7
10
Which of the following is the value of the mode?
A data analyst must separate the column shown below into multiple columns for each component of the name:
Which of the following data manipulation techniques should the analyst perform?
A data analyst has a set of data that shows the number of gallons of oil produced each day. The company would like to know the standard deviation for the data set. The variance for the data is 36 gallons. Which of the following is the standard deviation for gallons produced?
Which of the following data manipulation techniques should an analyst use to hide unnecessary data during analysis?
Which of the following file formats is best suited to start exploratory analysis within statistical software?
A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered?
A data analyst is attempting to understand how ice cream consumption is affected by different attributes. such as cost, temperature. and income level. Which of the following
regression analyses should the data analyst perform to understand this relationship?
A customer survey reveals 90% positive feedback. Which of the following statistical methods would be best to utilize to determine the reliability of a data set and predict how a larger sample of customers over the same time period might respond?
Which of the following is most likely to be used as a data-mining ETL tool?
Which of the following should an analyst do to best summarize the data on a data set?
An analyst reviews the following table:
Which of the following data types is represented in the values in the RefNo column?
Which of the following techniques should an analyst use to analyze a data set to get a snapshot of basic measures of central tendency?
An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to best display the data?
Which of the following types of dashboards should a business intelligence engineer develop in order to provide information about failed data pipelines?
Which of the following components need to be added to ensure the report is point-in-time and static? (Select two).
While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?
An analyst needs to create an analytics dashboard for an employee intranet site to improve the search functionality, display relevant information, and maintain an updated FAQ page. Which of the following visualizations would best represent what employees are searching for?
After the daily ETL jobs are completed, the data in the reports does not appear complete, and a lot of data seems to be missing. Which of the following concepts should be used to assess and investigate further?
A Chief Executive Officer (CEO) is requesting more up-to-date sales data for improved visibility prior to month-end. An analyst must determine the frequency of a sales report that was previously distributed on an as-needed basis. Which of the following would be the most appropriate frequency for this report?
An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?
Taylor wants to investigate how manufacturing, marketing, and sales expenditures impact overall profitability for her company.
Which of the following systems is the most appropriate?
A site reliability team wants to monitor the stability of their website. so they can proactively diagnose issues when they occur Which of the following deliverables would best suit their needs?
A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?