Week 3 - Working with data in R - Shuffle Q/A 3 -

Week 3 – Working with data in R – Shuffle Q/A 3

Post author:Devendra Kumar
Post published:
Post comments:0 Comments

37. You are working with the penguins dataset and want to understand the year of data collection for all combinations of species, island, and sex. You write the following code:

penguins %>%

drop_na() %>%

group_by(species) %>%

summarize(min = min(year), max = max(year))

When you run the code in the code box, how many different groups are returned by this code chunk?

Answers

3

10
2

6

38. You are working with the ToothGrowth dataset. You want to use the glimpse() function to get a quick summary of the dataset. Write the code chunk that will give you this summary.

How many different data types are used for the column data types?

Answers

2
3

60
1

39. A data analyst is working with a data frame named customers. It has separate columns for area code (area_code) and phone number (phone_num). The analyst wants to combine the two columns into a single column called phone_number, with the area code and phone number separated by a hyphen. What code chunk lets the analyst create the phone_numbercolumn?

Answers

unite(customers, “phone_number”, area_code, sep=”-”)

unite(customers, “phone_number”, area_code, phone_num, sep=”-”)
unite(customers, “phone_number”, area_code, phone_num)
unite(customers, area_code, phone_num, sep=”-”)

40. You are compiling an analysis of the average monthly costs for your company. What summary statistic function should you use to calculate the average?

Answers

mean()
max()
cor()

min()

41. A data analyst is studying weather data. They write the following code chunk:

bias(actual_temp, predicted_temp)

What will this code chunk calculate?

Answers

The average difference between the actual and predicted values
The maximum difference between the actual and predicted values

The total average of the values
The minimum difference between the actual and predicted values

42. A data analyst is working with a large data frame. It contains so many columns that they don’t all fit on the screen at once. The analyst wants a quick list of all of the column names to get a better idea of what is in their data. What function should they use?

Answers

str()

mutate()
head()
colnames()

43. A data analyst is using the unite() function to combine two columns into a single column. What does the sep parameter of the unite() function represent?

Answers

The strings to place between each column
The vector of columns to join into the final column
The data frame that is the target of the operation

The name of the final column formed from the original columns

44. A data analyst is checking a script for one of their peers. They want to learn more about a specific data frame. What function(s) will allow them to see a subset of data values in the data frame? Select all that apply.

Answers

library()
colnames()

head()
str()

45. A data analyst is working with the penguins dataset. The variable island represents the island on which the sample was collected. The analyst wants to create a data frame that excludes records from the island named “Torgersen”. What code chunk will allow them to create this data frame?

Answers

penguins %>% filter(island == “Torgersen”)

penguins %>% filter(island = “Torgersen”)
penguins %>% filter(island <> “Torgersen”)
penguins %>% filter(island != “Torgersen”)

Devendra Kumar

Project Management Apprentice at Google

Leave a Reply Cancel reply