Microsoft Exam DP-500 Topic 2 Question 45 Discussion

Actual exam question for Microsoft's DP-500 exam

Question #: 45
Topic #: 2

You use Azure Synapse Analytics and Apache Spark notebooks to You need to use PySpark to gain access to the visual libraries. Which Python libraries should you use?

ASeaborn only

BMatplotlib and Seaborn

CMatplotlib only

DMatplotlib and TensorFlow

ETensorFlow only

FSeaborn and TensorFlow

Show Suggested Answer

Suggested Answer: B

pandas.DataFrame.corr computes pairwise correlation of columns, excluding NA/null values.

Incorrect:

* freqItems

pyspark.sql.DataFrame.freqItems

Finding frequent items for columns, possibly with false positives. Using the frequent element count algorithm described in https://doi.org/10.1145/762471.762473, proposed by Karp, Schenker, and Papadimitriou.'

* summary is used for index.

* There is no panda method for rollup. Rollup would not be correct anyway.

by Susy at Jul 07, 2024, 04:01 AM

Limited Time Offer

25%

Off

Get Premium DP-500 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Roselle

11 months ago

I believe freqItems is used for finding frequent items, not data distribution statistics. So, D) describe is the correct answer.

upvoted 0 times

...

Vonda

11 months ago

I'm not sure, but I think A) freqItems might also be used for data distribution statistics.

upvoted 0 times

...

Huey

11 months ago

The 'describe' method is the way to go! It's like a magic trick - you wave your DataFrame at it, and *poof*, you've got a beautiful table of distribution stats. Saves you from having to do all that number-crunching yourself.

upvoted 0 times

...

Rosendo

11 months ago

Ah, the 'describe' method - the data analyst's best friend! It's like having a personal genie that can summarize your data in a snap. Beats trying to do it all by hand, that's for sure.

upvoted 0 times

Johnathon

9 months ago

'describe' is my go-to method for getting a quick summary of the DataFrame.

upvoted 0 times

...

Arminda

9 months ago

I prefer using 'describe' as well, it gives a quick snapshot of the data distribution.

upvoted 0 times

...

Nina

9 months ago

I agree, 'describe' is definitely a time-saver when it comes to getting an overview of the data.

upvoted 0 times

...

Diane

9 months ago

D) describe

upvoted 0 times

...

Lezlie

10 months ago

Yes, 'describe' is definitely the way to go. It gives you all the key statistics you need at a glance.

upvoted 0 times

...

Gilbert

10 months ago

D) describe

upvoted 0 times

...

Jaime

10 months ago

C) sample

upvoted 0 times

...

Amber

10 months ago

B) corr

upvoted 0 times

...

Devorah

11 months ago

A) freqItems

upvoted 0 times

...

Whitney

11 months ago

I agree with Alecia, describe method gives statistical summary of the DataFrame.

upvoted 0 times

...

Lourdes

11 months ago

Definitely 'describe'! It's the perfect tool for getting a quick overview of your data. Plus, it's way easier than trying to do all that manually. Who's got time for that?

upvoted 0 times

Nadine

11 months ago

Agreed, it's definitely the easiest option.

upvoted 0 times

...

Glory

11 months ago

I think 'describe' is the way to go.

upvoted 0 times

...

Alecia

12 months ago

I think the answer is D) describe.

upvoted 0 times

...

Pamella

12 months ago

Hmm, I think the 'describe' method is the way to go. It's like the Swiss Army knife of data analysis - it gives you a nice summary of the distribution, including measures like mean, standard deviation, and percentiles.

upvoted 0 times