Winter Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: bigdisc65

Free Databricks-Machine-Learning-Associate Questions Attempt

Page: 4 / 5
Question 16

A data scientist is attempting to tune a logistic regression model logistic using scikit-learn. They want to specify a search space for two hyperparameters and let the tuning process randomly select values for each evaluation.

They attempt to run the following code block, but it does not accomplish the desired task:

Which of the following changes can the data scientist make to accomplish the task?

Options:

A.

Replace the GridSearchCV operation with RandomizedSearchCV

B.

Replace the GridSearchCV operation with cross_validate

C.

Replace the GridSearchCV operation with ParameterGrid

D.

Replace the random_state=0 argument with random_state=1

E.

Replace the penalty= ['12', '11'] argument with penalty=uniform ('12', '11')

Question 17

A machine learning engineer wants to parallelize the training of group-specific models using the Pandas Function API. They have developed thetrain_modelfunction, and they want to apply it to each group of DataFramedf.

They have written the following incomplete code block:

Which of the following pieces of code can be used to fill in the above blank to complete the task?

Options:

A.

applyInPandas

B.

mapInPandas

C.

predict

D.

train_model

E.

groupedApplyIn

Question 18

A data scientist is wanting to explore the Spark DataFrame spark_df. The data scientist wants visual histograms displaying the distribution of numeric features to be included in the exploration.

Which of the following lines of code can the data scientist run to accomplish the task?

Options:

A.

spark_df.describe()

B.

dbutils.data(spark_df).summarize()

C.

This task cannot be accomplished in a single line of code.

D.

spark_df.summary()

E.

dbutils.data.summarize (spark_df)

Question 19

Which of the Spark operations can be used to randomly split a Spark DataFrame into a training DataFrame and a test DataFrame for downstream use?

Options:

A.

TrainValidationSplit

B.

DataFrame.where

C.

CrossValidator

D.

TrainValidationSplitModel

E.

DataFrame.randomSplit

Page: 4 / 5
Exam Name: Databricks Certified Machine Learning Associate Exam
Last Update: Nov 21, 2024
Questions: 74
Databricks-Machine-Learning-Associate pdf

Databricks-Machine-Learning-Associate PDF

$28  $80
Databricks-Machine-Learning-Associate Engine

Databricks-Machine-Learning-Associate Testing Engine

$33.25  $95
Databricks-Machine-Learning-Associate PDF + Engine

Databricks-Machine-Learning-Associate PDF + Testing Engine

$45.5  $130