New Year Sale - Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70dumps

Databricks-Machine-Learning-Professional Questions and Answers

Question # 6

A data scientist is utilizing MLflow to track their machine learning experiments. After completing a series of runs for the experiment with experiment ID exp_id, the data scientist wants to programmatically work with the experiment run data in a Spark DataFrame. They have an active MLflow Client client and an active Spark session spark.

Which of the following lines of code can be used to obtain run-level results for exp_id in a Spark DataFrame?

A.

client.list_run_infos(exp_id)

B.

spark.read.format("delta").load(exp_id)

C.

There is no way to programmatically return row-level results from an MLflow Experiment.

D.

mlflow.search_runs(exp_id)

E.

spark.read.format("mlflow-experiment").load(exp_id)

Full Access
Question # 7

A machine learning engineer wants to move their model versionmodel_versionfor the MLflow Model Registry modelmodelfrom the Staging stage to the Production stage using MLflow Clientclient.

Which of the following code blocks can they use to accomplish the task?

A)

B)

C)

D)

E)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

option E

Full Access
Question # 8

A machine learning engineer and data scientist are working together to convert a batch deployment to an always-on streaming deployment. The machine learning engineer has expressed that rigorous data tests must be put in place as a part of their conversion to account for potential changes in data formats.

Which of the following describes why these types of data type tests and checks are particularly important for streaming deployments?

A.

Because the streaming deployment is always on, all types of data must be handled without producing an error

B.

All of these statements

C.

Because the streaming deployment is always on, there is no practitioner to debug poor model performance

D.

Because the streamingdeployment is always on, there is a need to confirm that the deployment can autoscale

E.

None of these statements

Full Access
Question # 9

Which of the following is a simple statistic to monitor for categorical feature drift?

A.

Mode

B.

None of these

C.

Mode, number of unique values, and percentage of missing values

D.

Percentage of missing values

E.

Number of unique values

Full Access
Question # 10

Which of the following MLflow operations can be used to delete a model from the MLflow Model Registry?

A.

client.transition_model_version_stage

B.

client.delete_model_version

C.

client.update_registered_model

D.

client.delete_model

E.

client.delete_registered_model

Full Access
Question # 11

A machine learning engineer is monitoring categorical input variables for a production machine learning application. The engineer believes that missing values are becoming more prevalent in more recent data for a particular value in one of the categorical input variables.

Which of the following tools can the machine learning engineer use to assess their theory?

A.

Kolmogorov-Smirnov (KS) test

B.

One-way Chi-squared Test

C.

Two-way Chi-squared Test

D.

Jenson-Shannon distance

E.

None of these

Full Access
Question # 12

A machine learning engineer is in the process of implementing a concept drift monitoring solution. They are planning to use the following steps:

1. Deploy a model to production and compute predicted values

2. Obtain the observed (actual) label values

3. _____

4. Run a statistical test to determine if there are changes over time

Which of the following should be completed as Step #3?

A.

Obtain the observed values (actual) feature values

B.

Measure the latency of the prediction time

C.

Retrain the model

D.

None of these should be completed as Step #3

E.

Compute the evaluation metric using the observed and predicted values

Full Access
Question # 13

A data scientist has computed updated feature values for all primary key values stored in the Feature Store table features. In addition, feature values for some new primary key values have also been computed. The updated feature values are stored in the DataFrame features_df. They want to replace all data in features with the newly computed data.

Which of the following code blocks can they use to perform this task using the Feature Store Client fs?

A)

B)

C)

D)

E)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

Option E

Full Access
Question # 14

A machine learning engineer needs to deliver predictions of a machine learning model in real-time. However, the feature values needed for computing the predictions are available one week before the query time.

Which of the following is a benefit of using a batch serving deployment in this scenario rather than a real-time serving deployment where predictions are computed at query time?

A.

Batch servinghas built-in capabilities in Databricks Machine Learning

B.

There is no advantage to using batch serving deployments over real-time serving deployments

C.

Computing predictions in real-time provides more up-to-date results

D.

Testing is not possible in real-time serving deployments

E.

Querying stored predictions can be faster than computing predictions in real-time

Full Access
Question # 15

A data scientist has developed a scikit-learn modelsklearn_modeland they want to log the model using MLflow.

They write the following incomplete code block:

Which of the following lines of code can be used to fill in the blank so the code block can successfully complete the task?

A.

mlflow.spark.track_model(sklearn_model, "model")

B.

mlflow.sklearn.log_model(sklearn_model, "model")

C.

mlflow.spark.log_model(sklearn_model, "model")

D.

mlflow.sklearn.load_model("model")

E.

mlflow.sklearn.track_model(sklearn_model, "model")

Full Access
Question # 16

A machine learning engineer wants to move their model versionmodel_versionfor the MLflow Model Registry modelmodelfrom the Staging stage to the Production stage using MLflow Clientclient. At the same time, they would like to archive any model versions that are already in the Production stage.

Which of the following code blocks can they use to accomplish the task?

A)

B)

C)

D)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Full Access
Question # 17

A data scientist has developed a model to predict ice cream sales using the expected temperature and expected number of hours of sun in the day. However, the expected temperature is dropping beneath the range of the input variable on which the model was trained.

Which of the following types of drift is present in the above scenario?

A.

Label drift

B.

None of these

C.

Concept drift

D.

Prediction drift

E.

Feature drift

Full Access
Question # 18

In a continuous integration, continuous deployment (CI/CD) process for machine learning pipelines, which of the following events commonly triggers the execution of automated testing?

A.

The launch of a new cost-efficient SQL endpoint

B.

CI/CD pipelines are not needed for machine learning pipelines

C.

The arrival of a new feature table in the Feature Store

D.

The launch of a new cost-efficient job cluster

E.

The arrival of a new model version in the MLflow Model Registry

Full Access