New Year Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: Board70

Microsoft DP-203 Based on Real Exam Environment

Page: 5 / 12
Question 20

You are designing an Azure Databricks table. The table will ingest an average of 20 million streaming events per day.

You need to persist the events in the table for use in incremental load pipeline jobs in Azure Databricks. The solution must minimize storage costs and incremental load times.

What should you include in the solution?

Options:

A.

Partition by DateTime fields.

B.

Sink to Azure Queue storage.

C.

Include a watermark column.

D.

Use a JSON format for physical data storage.

Question 21

You are creating a new notebook in Azure Databricks that will support R as the primary language but will also support Scale and SOL Which switch should you use to switch between languages?

Options:

A.

@

B.

%

C.

\\()

D.

\\()

Question 22

You have an Azure Stream Analytics job that receives clickstream data from an Azure event hub.

You need to define a query in the Stream Analytics job. The query must meet the following requirements:

    Count the number of clicks within each 10-second window based on the country of a visitor.

    Ensure that each click is NOT counted more than once.

How should you define the Query?

Options:

A.

SELECT Country, Avg(*) AS Average

FROM ClickStream TIMESTAMP BY CreatedAt

GROUP BY Country, SlidingWindow(second, 10)

B.

SELECT Country, Count(*) AS Count

FROM ClickStream TIMESTAMP BY CreatedAt

GROUP BY Country, TumblingWindow(second, 10)

C.

SELECT Country, Avg(*) AS Average

FROM ClickStream TIMESTAMP BY CreatedAt

GROUP BY Country, HoppingWindow(second, 10, 2)

D.

SELECT Country, Count(*) AS Count

FROM ClickStream TIMESTAMP BY CreatedAt

GROUP BY Country, SessionWindow(second, 5, 10)

Question 23

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.

After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.

You are designing an Azure Stream Analytics solution that will analyze Twitter data.

You need to count the tweets in each 10-second window. The solution must ensure that each tweet is counted only once.

Solution: You use a session window that uses a timeout size of 10 seconds.

Does this meet the goal?

Options:

A.

Yes

B.

No

Page: 5 / 12
Exam Code: DP-203
Exam Name: Data Engineering on Microsoft Azure
Last Update: Dec 22, 2024
Questions: 347
DP-203 pdf

DP-203 PDF

$28.5  $94.99
DP-203 Engine

DP-203 Testing Engine

$33  $109.99
DP-203 PDF + Engine

DP-203 PDF + Testing Engine

$43.5  $144.99