Big Cyber Monday Sale - Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70dumps

AIP-210 Questions and Answers

Question # 6

Which of the following are true about the transform-design pattern for a machine learning pipeline? (Select three.)

It aims to separate inputs from features.

A.

It encapsulates the processing steps of ML pipelines.

B.

It ensures reproducibility.

C.

It represents steps in the pipeline with a directed acyclic graph (DAG).

D.

It seeks to isolate individual steps of ML pipelines.

E.

It transforms the output data after production.

Full Access
Question # 7

Which of the following models are text vectorization methods? (Select two.)

A.

Lemmatization

B.

PCA

C.

Skip-gram

D.

TF-IDF

E.

Tokenization

F.

t-SNE

Full Access
Question # 8

Which of the following pieces of AI technology provides the ability to create fake videos?

A.

Generative adversarial networks (GAN)

B.

Long short-term memory (LSTM) networks

C.

Recurrent neural networks (RNN)

D.

Support-vector machines (SVM)

Full Access
Question # 9

What is the open framework designed to help detect, respond to, and remediate threats in ML systems?

A.

Adversarial ML Threat Matrix

B.

MITRE ATTandCK® Matrix

C.

OWASP Threat and Safeguard Matrix

D.

Threat Susceptibility Matrix

Full Access
Question # 10

You train a neural network model with two layers, each layer having four nodes, and realize that the model is underfit. Which of the actions below will NOT work to fix this underfitting?

A.

Add features to training data

B.

Get more training data

C.

Increase the complexity of the model

D.

Train the model for more epochs

Full Access
Question # 11

Which of the following regressions will help when there is the existence of near-linear relationships among the independent variables (collinearity)?

A.

Clustering

B.

Linear regression

C.

Polynomial regression

D.

Ridge regression

Full Access
Question # 12

Which of the following approaches is best if a limited portion of your training data is labeled?

A.

Dimensionality reduction

B.

Probabilistic clustering

C.

Reinforcement learning

D.

Semi-supervised learning

Full Access
Question # 13

Why do data skews happen in the ML pipeline?

A.

Test and evaluation data are designed incorrectly.

B.

There Is a mismatch between live input data and offline data.

C.

There is a mismatch between live output data and offline data.

D.

There is insufficient training data for evaluation.

Full Access
Question # 14

Which two of the following decrease technical debt in ML systems? (Select two.)

A.

Boundary erosion

B.

Design anti-patterns

C.

Documentation readability

D.

Model complexity

E.

Refactoring

Full Access
Question # 15

Which of the following is NOT an activation function?

A.

Additive

B.

Hyperbolic tangent

C.

ReLU

D.

Sigmoid

Full Access
Question # 16

Which of the following metrics is being captured when performing principal component analysis?

A.

Kurtosis

B.

Missingness

C.

Skewness

D.

Variance

Full Access
Question # 17

For each of the last 10 years, your team has been collecting data from a group of subjects, including their age and numerous biomarkers collected from blood samples. You are tasked with creating a prediction model of age using the biomarkers as input. You start by performing a linear regression using all of the data over the 10-year period, with age as the dependent variable and the biomarkers as predictors.

Which assumption of linear regression is being violated?

A.

Equality of variance (Homoscedastidty)

B.

Independence

C.

Linearity

D.

Normality

Full Access
Question # 18

Which of the following unsupervised learning models can a bank use for fraud detection?

A.

Anomaly detection

B.

DB5CAN

C.

Hierarchical clustering

D.

k-means

Full Access
Question # 19

A company is developing a merchandise sales application The product team uses training data to teach the AI model predicting sales, and discovers emergent bias. What caused the biased results?

A.

The AI model was trained in winter and applied in summer.

B.

The application was migrated from on-premise to a public cloud.

C.

The team set flawed expectations when training the model.

D.

The training data used was inaccurate.

Full Access
Question # 20

A dataset can contain a range of values that depict a certain characteristic, such as grades on tests in a class during the semester. A specific student has so far received the following grades: 76,81, 78, 87, 75, and 72. There is one final test in the semester. What minimum grade would the student need to achieve on the last test to get an 80% average?

A.

82

B.

89

C.

91

D.

94

Full Access
Question # 21

Which of the following statements are true regarding highly interpretable models? (Select two.)

A.

They are usually binary classifiers.

B.

They are usually easier to explain to business stakeholders.

C.

They are usually referred to as "black box" models.

D.

They are usually very good at solving non-linear problems.

E.

They usually compromise on model accuracy for the sake of interpretability.

Full Access
Question # 22

Which of the following describes a benefit of machine learning for solving business problems?

A.

Increasing the quantity of original data

B.

Increasing the speed of analysis

C.

Improving the constraint of the problem

D.

Improving the quality of original data

Full Access
Question # 23

Which of the following best describes distributed artificial intelligence?

A.

It does not require hyperparemeter tuning because the distributed nature accounts for the bias.

B.

It intelligently pre-distributes the weight of starting a neural network.

C.

It relies on a distributed system that performs robust computations across a network of unreliable nodes.

D.

It uses a centralized system to speak to decentralized nodes.

Full Access
Question # 24

A market research team has ratings from patients who have a chronic disease, on several functional, physical, emotional, and professional needs that stay unmet with the current therapy. The dataset also captures ratings on how the disease affects their day-to-day activities.

A pharmaceutical company is introducing a new therapy to cure the disease and would like to design their marketing campaign such that different groups of patients are targeted with different ads. These groups should ideally consist of patients with similar unmet needs.

Which of the following algorithms should the market research team use to obtain these groups of patients?

A.

k-means clustering

B.

k-nearest neighbors

C.

Logistic regression

D.

Naive-Bayes

Full Access
Question # 25

Which of the following can take a question in natural language and return a precise answer to the question?

A.

Databricks

B.

IBM Watson

C.

Pandas

D.

Spark ML

Full Access
Question # 26

Your dependent variable Y is a count, ranging from 0 to infinity. Because Y is approximately log-normally distributed, you decide to log-transform the data prior to performing a linear regression.

What should you do before log-transforming Y?

A.

Add 1 to all of the Y values.

B.

Divide all the Y values by the standard deviation of Y.

C.

Explore the data for outliers.

D.

Subtract the mean of Y from all the Y values.

Full Access
Question # 27

A product manager is designing an Artificial Intelligence (AI) solution and wants to do so responsibly, evaluating both positive and negative outcomes.

The team creates a shared taxonomy of potential negative impacts and conducts an assessment along vectors such as severity, impact, frequency, and likelihood.

Which modeling technique does this team use?

A.

Business

B.

Harms

C.

Process

D.

Threat

Full Access