Databricks-Machine-Learning-Professional Questions and Answers

Question # 6

A data scientist is utilizing MLflow to track their machine learning experiments. After completing a series of runs for the experiment with experiment ID exp_id, the data scientist wants to programmatically work with the experiment run data in a Spark DataFrame. They have an active MLflow Client client and an active Spark session spark.

Which of the following lines of code can be used to obtain run-level results for exp_id in a Spark DataFrame?

client.list_run_infos(exp_id)

spark.read.format("delta").load(exp_id)

There is no way to programmatically return row-level results from an MLflow Experiment.

mlflow.search_runs(exp_id)

spark.read.format("mlflow-experiment").load(exp_id)

Full Access

Question # 7

A machine learning engineer wants to move their model versionmodel_versionfor the MLflow Model Registry modelmodelfrom the Staging stage to the Production stage using MLflow Clientclient.

Which of the following code blocks can they use to accomplish the task?

Option A

Option B

Option C

Option D

option E

Full Access

Question # 8

A machine learning engineer and data scientist are working together to convert a batch deployment to an always-on streaming deployment. The machine learning engineer has expressed that rigorous data tests must be put in place as a part of their conversion to account for potential changes in data formats.

Which of the following describes why these types of data type tests and checks are particularly important for streaming deployments?

Because the streaming deployment is always on, all types of data must be handled without producing an error

All of these statements

Because the streaming deployment is always on, there is no practitioner to debug poor model performance

Because the streamingdeployment is always on, there is a need to confirm that the deployment can autoscale

None of these statements

Full Access

Question # 9

Which of the following is a simple statistic to monitor for categorical feature drift?

Mode

None of these

Mode, number of unique values, and percentage of missing values

Percentage of missing values

Number of unique values

Full Access

Question # 10

Which of the following MLflow operations can be used to delete a model from the MLflow Model Registry?

client.transition_model_version_stage

client.delete_model_version

client.update_registered_model

client.delete_model

client.delete_registered_model

Full Access

Question # 11

A machine learning engineer is monitoring categorical input variables for a production machine learning application. The engineer believes that missing values are becoming more prevalent in more recent data for a particular value in one of the categorical input variables.

Which of the following tools can the machine learning engineer use to assess their theory?

Kolmogorov-Smirnov (KS) test

One-way Chi-squared Test

Two-way Chi-squared Test

Jenson-Shannon distance

None of these

Full Access

Question # 12

A machine learning engineer is in the process of implementing a concept drift monitoring solution. They are planning to use the following steps:

1. Deploy a model to production and compute predicted values

2. Obtain the observed (actual) label values

3. _____

4. Run a statistical test to determine if there are changes over time

Which of the following should be completed as Step #3?

Obtain the observed values (actual) feature values

Measure the latency of the prediction time

Retrain the model

None of these should be completed as Step #3

Compute the evaluation metric using the observed and predicted values

Full Access

Question # 13

A data scientist has computed updated feature values for all primary key values stored in the Feature Store table features. In addition, feature values for some new primary key values have also been computed. The updated feature values are stored in the DataFrame features_df. They want to replace all data in features with the newly computed data.

Which of the following code blocks can they use to perform this task using the Feature Store Client fs?

Option A

Option B

Option C

Option D

Option E

Full Access

Question # 14

A machine learning engineer needs to deliver predictions of a machine learning model in real-time. However, the feature values needed for computing the predictions are available one week before the query time.

Which of the following is a benefit of using a batch serving deployment in this scenario rather than a real-time serving deployment where predictions are computed at query time?

Batch servinghas built-in capabilities in Databricks Machine Learning

There is no advantage to using batch serving deployments over real-time serving deployments

Computing predictions in real-time provides more up-to-date results

Testing is not possible in real-time serving deployments

Querying stored predictions can be faster than computing predictions in real-time

Full Access

Question # 15

A data scientist has developed a scikit-learn modelsklearn_modeland they want to log the model using MLflow.

They write the following incomplete code block:

Which of the following lines of code can be used to fill in the blank so the code block can successfully complete the task?

mlflow.spark.track_model(sklearn_model, "model")

mlflow.sklearn.log_model(sklearn_model, "model")

mlflow.spark.log_model(sklearn_model, "model")

mlflow.sklearn.load_model("model")

mlflow.sklearn.track_model(sklearn_model, "model")

Full Access

Question # 16

A machine learning engineer wants to move their model versionmodel_versionfor the MLflow Model Registry modelmodelfrom the Staging stage to the Production stage using MLflow Clientclient. At the same time, they would like to archive any model versions that are already in the Production stage.

Which of the following code blocks can they use to accomplish the task?

Option A

Option B

Option C

Option D

Full Access

Question # 17

A data scientist has developed a model to predict ice cream sales using the expected temperature and expected number of hours of sun in the day. However, the expected temperature is dropping beneath the range of the input variable on which the model was trained.

Which of the following types of drift is present in the above scenario?

Label drift

None of these

Concept drift

Prediction drift

Feature drift

Full Access

Question # 18

In a continuous integration, continuous deployment (CI/CD) process for machine learning pipelines, which of the following events commonly triggers the execution of automated testing?

The launch of a new cost-efficient SQL endpoint

CI/CD pipelines are not needed for machine learning pipelines

The arrival of a new feature table in the Feature Store

The launch of a new cost-efficient job cluster

The arrival of a new model version in the MLflow Model Registry

Full Access

Summer Sale - Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: dpt65

DumpsTool Header

dumpstool logo

Databricks-Machine-Learning-Professional Questions and Answers

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Quick Links

Why Us

Updated Exams

Site Secure

Footer