Weekend Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: Board70

Databricks-Machine-Learning-Professional Exam Dumps - Databricks ML Data Scientist Questions and Answers

Question # 14

A machine learning engineer has developed a model and registered it using the FeatureStoreClient fs. The model has model URI model_uri. The engineer now needs to perform batch inference on customer-level Spark DataFrame spark_df, but it is missing a few of the static features that were used when training the model. The customer_id column is the primary key of spark_df and the training set used when training and logging the model.

Which of the following code blocks can be used to compute predictions for spark_df when the missing feature values can be found in the Feature Store by searching for features by customer_id?

Options:

A.

df = fs.get_missing_features(spark_df, model_uri)

fs.score_model(model_uri, df)

B.

fs.score_model(model_uri, spark_df)

C.

df = fs.get_missing_features(spark_df, model_uri)

fs.score_batch(model_uri, df)

df = fs.get_missing_features(spark_df)

D.

fs.score_batch(model_uri, df)

E.

fs.score_batch(model_uri, spark_df)

Buy Now
Question # 15

A machine learning engineer needs to select a deployment strategy for a new machine learning application. The feature values are not available until the time of delivery, and results are needed exceedingly fast for one record at a time.

Which of the following deployment strategies can be used to meet these requirements?

Options:

A.

Edge/on-device

B.

Streaming

C.

None of these strategies will meet the requirements.

D.

Batch

E.

Real-time

Buy Now
Question # 16

A machine learning engineer is migrating a machine learning pipeline to use Databricks Machine Learning. They have programmatically identified the best run from an MLflow Experiment and stored its URI in themodel_urivariable and its Run ID in therun_idvariable. They have also determined that the model was logged with the name"model". Now, the machine learning engineer wants to register that model in the MLflow Model Registry with the name"best_model".

Which of the following lines of code can they use to register the model to the MLflow Model Registry?

Options:

A.

mlflow.register_model(model_uri, "best_model")

B.

mlflow.register_model(run_id, "best_model")

C.

mlflow.register_model(f"runs:/{run_id}/best_model", "model")

D.

mlflow.register_model(model_uri, "model")

E.

mlflow.register_model(f"runs:/{run_id}/model")

Buy Now
Question # 17

A machine learning engineer is using the following code block as part of a batch deployment pipeline:

Which of the following changes needs to be made so this code block will work when theinferencetable is a stream source?

Options:

A.

Replace "inference" with the path to the location of the Delta table

B.

Replace schema(schema) with option("maxFilesPerTriqqer", 1}

C.

Replace spark.read with spark.readStream

D.

Replace formatfdelta") with format("stream")

E.

Replace predict with a stream-friendly prediction function

Buy Now
Question # 18

Which of the following lists all of the model stages are available in the MLflow Model Registry?

Options:

A.

Development. Staging. Production

B.

None. Staging. Production

C.

Staging. Production. Archived

D.

None. Staging. Production. Archived

E.

Development. Staging. Production. Archived

Buy Now
Question # 19

Which of the following is a reason for using Jensen-Shannon (JS) distance over a Kolmogorov-Smirnov (KS) test for numeric feature drift detection?

Options:

A.

All of these reasons

B.

JS is not normalized or smoothed

C.

None of these reasons

D.

JS is more robust when working with large datasets

E.

JS does not require any manual threshold or cutoff determinations

Buy Now
Question # 20

A machine learning engineer wants to programmatically create a new Databricks Job whose schedule depends on the result of some automated tests in a machine learning pipeline.

Which of the following Databricks tools can be used to programmatically create the Job?

Options:

A.

MLflow APIs

B.

AutoML APIs

C.

MLflow Client

D.

Jobs cannot be created programmatically

E.

Databricks REST APIs

Buy Now
Question # 21

Which of the following describes the purpose of the context parameter in the predict method of Python models for MLflow?

Options:

A.

The context parameter allows the user to specify which version of the registered MLflowModel should be used based on the given application's current scenario

B.

The context parameter allows the user to document the performance of a model after it has been deployed

C.

The context parameter allows the user to include relevant details of the business case to allow downstream users to understand the purpose of the model

D.

The context parameter allows the user to provide the model with completely custom if-else logic for the given application's current scenario

E.

The context parameter allows the user to provide the model access to objects like preprocessing models or custom configuration files

Buy Now
Exam Name: Databricks Certified Machine Learning Professional
Last Update: Feb 23, 2025
Questions: 60
Databricks-Machine-Learning-Professional pdf

Databricks-Machine-Learning-Professional PDF

$25.5  $84.99
Databricks-Machine-Learning-Professional Engine

Databricks-Machine-Learning-Professional Testing Engine

$28.5  $94.99
Databricks-Machine-Learning-Professional PDF + Engine

Databricks-Machine-Learning-Professional PDF + Testing Engine

$40.5  $134.99