Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?
B for sure. R is a power tool for statistical analysis, and it can handle huge datasets like no other. Excel would probably start smoking if you tried to run those calculations on 5 million rows!
Hmm, I'd say R would be the way to go for crunching those big numbers. Excel might choke on a dataset that size, and Snowflake is more for warehousing than analysis, right? SQL could work, but R has those built-in stats functions that would make this a breeze.
Blythe
7 days agoSheridan
10 days agoErinn
16 days agoBarrie
17 days agoLeila
22 days ago