Module 5: *Course challenge*

Looking for ‘foundations data data everywhere course challenge answers‘?

In this post, I provide accurate answers and detailed explanations for Module 5: *Course challenge*
 of Course 1: Foundations: Data, Data, Everywhere Google Data Analytics Professional Certificate.

Whether you’re preparing for quizzes or brushing up on your knowledge, these insights will help you master the concepts effectively. Let’s dive into the correct answers and detailed explanations for each question.

*Course challenge*

Graded Quiz

1. Scenario 1, question 1-5

You’ve just started a new job as a data analyst. You’re working for a midsized pharmacy chain with 38 stores in the American Southwest. Your supervisor shares a new data analysis project with you.She explains that the pharmacy is considering discontinuing a bubble bath product called Splashtastic. Your supervisor wants you to analyze sales data and determine what percentage of each store’s total daily sales come from that product. Then, you’ll present your findings to leadership.You know that it's important to follow each step of the data analysis process: ask, prepare, process, analyze, share, and act. So, you begin by defining the problem and making sure you fully understand stakeholder expectations.One of the questions you ask is where to find the dataset you’ll be working with. Your supervisor explains that the company database has all the information you need. Next, you continue to the prepare step. You access the database and write a query to retrieve data about Splashtastic. You notice that there are only 38 rows of data, representing the company’s 38 stores. In addition, your dataset contains five columns: Store Number, Average Daily Customers, Average Daily Splashtastic Sales (Units), Average Daily Splashtastic Sales (Dollars), and Average Total Daily Sales (All Products).

You know that spreadsheets work well for processing and analyzing a small dataset, like the one you’re using. To get the data from the database into a spreadsheet, what should you do?

  • Email a copy of the dataset to your company email address.
  • Use Tableau to convert the data into a spreadsheet.
  • Copy and paste the data into a spreadsheet.
  • Download the data as a .CSV file, then import it into a spreadsheet. ✅

2. Scenario 1 continued

You’ve downloaded the data from your company database and imported it into a spreadsheet. To use the dataset for this scenario, click the link below and select “Use Template.”

Course Challenge - Scenario 1

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

Now, it’s time to process the data. As you know, this step involves finding and eliminating errors and inaccuracies that can get in the way of your results. While cleaning the data, you notice that information about Splashtastic is missing in one of the rows. You are unsure of how to proceed, so the best course of action is to ask your supervisor for guidance.

  • True ✅
  • False

3. Scenario 1 continued

Once you’ve found the missing information, you analyze your dataset. During analysis, you create a new column F. At the top of the column, you add the attribute Average Percentage of Total Sales - Splashtastic.

Fill in the blank: An attribute is a _______ or quality of data used to label a column.

  • number
  • characteristic ✅
  • headline
  • response

4. Scenario 1 continued

Next, you determine the average total daily sales over the past 12 months at all stores. The range that contains these sales is E2:E39. To do this, you use a function. You input =AVERAGE(E2-E39), but this returns an error. What is the correct command?

  • =AVERAGE(E2;E39)
  • =AVERAGE(E2+E39)
  • =AVERAGE(E2:E39) ✅
  • =AVERAGE(E2,E39)

5. Scenario 1 continued

You’ve reached the share phase of the data analysis process. What can you do in this phase to highlight the Splashtastic sales insights you've discovered?

  • Mission statement
  • Hypothesis
  • Complaint
  • Business task ✅

6. Scenario 2, questions 6-10

You’ve been working for the nonprofit National Dental Society (NDS) as a junior data analyst for about two months. The mission of the NDS is to help its members advance the oral health of their patients. NDS members include dentists, hygienists, and dental office support staff.

The NDS is passionate about patient health. Part of this involves automatically scheduling follow-up appointments after crown replacement, emergency dental surgery, and extraction procedures. NDS believes the follow-up is an important step to ensure patient recovery and minimize infection.

Unfortunately, many patients don’t show up for these appointments, so the NDS wants to create a campaign to help its members learn how to encourage their patients to take follow-up appointments seriously. If successful, this will help the NDS achieve its mission of advancing the oral health of all patients.

Your supervisor has just sent you an email saying that you’re doing very well on the team, and he wants to give you some additional responsibility. He describes the issue of many missed follow-up appointments. You are tasked with analyzing data about this problem and presenting your findings using data visualizations.

An NDS member with three dental offices in Colorado offers to share its data on missed appointments. So, your supervisor uses a database query to access the dataset from the dental group. The query instructs the database to retrieve all patient information from the member’s three dental offices, located in zip code 81137.

The table is dental_data_table, and the column name is zip_code. You write the following query, but get an error. What statement will correct the problem?

SELECT *
FROM dental_data_table
WHERE zip code = 81137

  • zip_code = 81137
  • WHERE zip_code = 81137 ✅
  • WHERE 81137
  • WHERE_zip code = 81137

7. Scenario 2 continued

The dataset your supervisor retrieved and imported into a spreadsheet includes a list of patients, their demographic information, dental procedure types, and whether they attended their follow-up appointment. To use the dataset for this scenario, click the link below and select “Use Template.”

Course Challenge - Scenario 2

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

The patient demographic information includes data such as age, gender, and home address. The fact that the dataset includes people who all live in the same zip code might get in the way of what?

  • Fairness ✅
  • Spreadsheet formulas or functions
  • Data visualization
  • Future dental procedures

8. Scenario 2 continued

As you’re reviewing the dataset, you notice that there are a disproportionate number of senior citizens. So, you investigate further and find out that this zip code represents a rural community in Colorado with about 800 residents. In addition, there’s a large assisted-living facility in the area. Nearly 300 of the residents in the 81137 zip code live in the facility.

You recognize that’s a sizable number, so you want to find out if age has an effect on a patient’s likelihood to attend a follow-up dental appointment. You analyze the data, and your analysis reveals that older people tend to miss follow-ups more than younger people.

So, you do some research online and discover that people over the age 60 are 50% more likely to miss dentist appointments. Sometimes this is because they’re on a fixed income. Also, many senior citizens lack transportation to get to and from appointments.

With this new knowledge, you write an email to your supervisor expressing your concerns about the dataset. He agrees with your concerns, but he’s also impressed with what you’ve learned and thinks your findings could be very important to the project. He asks you to change the business task. Now, the NDS campaign will be about educating dental offices on the challenges faced by senior citizens and finding ways to help them access quality dental care.

Fill in the blank: Changing the business task involves _____ a new question or problem to be solved.

  • recording
  • defining ✅
  • sharing
  • analyzing

9. Scenario 2 continued

You continue with your analysis. In the end, your findings support what you discovered during your online research: As people get older, they’re less likely to attend follow-up dental visits.

But you’re not done yet. You know that data should be combined with human insights in order to lead to true data-driven decision-making. So, your next step is to share this information with people who are familiar with the problem professionally. They’ll help verify the results of your data analysis.

The people who are familiar with a problem professionally and help verify the results of data analysis include customers and competitors.

  • True
  • False ✅

10. Scenario 2 continued

The subject-matter experts are impressed by your analysis. The team agrees to move to the next step: data visualization. You know it’s important that stakeholders at NDS can quickly and easily understand that older people are less likely to attend important follow-up dental appointments than younger people. This will help them create an effective campaign for members.

It’s time to create your presentation to stakeholders. It will include a data visualization that demonstrates the lifetime trend of people being less likely to attend follow-up appointments as they get older.

Fill in the blank: The type of chart that would be most effective in visualizing this is a _____.

  • line chart ✅
  • pie chart
  • bar chart
  • doughnut chart

Explanation:
A line chart effectively shows trends over time or across ages, making it suitable for visualizing how the likelihood of follow-up attendance decreases with age.

11. Scenario 1, question 1-5

You’ve just started a new job as a data analyst. You’re working for a midsized pharmacy chain with 38 stores in the American Southwest. Your supervisor shares a new data analysis project with you.

She explains that the pharmacy is considering discontinuing a bubble bath product called Splashtastic. Your supervisor wants you to analyze sales data and determine what percentage of each store’s total daily sales come from that product. Then, you’ll present your findings to leadership.

You know that it's important to follow each step of the data analysis process: ask, prepare, process, analyze, share, and act. So, you begin by defining the problem and making sure you fully understand stakeholder expectations.

One of the questions you ask is where to find the dataset you’ll be working with. Your supervisor explains that the company database has all the information you need.

Next, you continue to the prepare step. You access the database and write a query to retrieve data about Splashtastic. You notice that there are only 38 rows of data, representing the company’s 38 stores. In addition, your dataset contains five columns: Store Number, Average Daily Customers, Average Daily Splashtastic Sales (Units), Average Daily Splashtastic Sales (Dollars), and Average Total Daily Sales (All Products).

Considering the size of your dataset, you decide to download the data, then import it into a spreadsheet. What step of the data analysis process are you in?

  • Analyze
  • Destroy
  • Process ✅
  • Copy

12. Scenario 1 continued

You’ve downloaded the data from your company database and imported it into a spreadsheet. To use the dataset for this scenario, click the link below and select “Use Template.”

Link to template: Course Challenge - Scenario 1

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

Now, it’s time to process the data. As you know, this step involves finding and eliminating errors and inaccuracies that can get in the way of your results. While cleaning the data, you notice that information about Splashtastic is missing in row 16. Which of the following would be an appropriate course of action?

  • Sort the spreadsheet so the row with missing data is at the bottom.
  • Delete the row with the missing data point.
  • Investigate previous projects and see how this was dealt with there. ✅
  • Replace the row with the average values of the other data points.

13. Scenario 1 continued

Once you’ve found the missing information, you analyze your dataset.

During analysis, you create a new column F. At the top of the column, you add: Average Percentage of Total Sales - Splashtastic. In data analytics, this column label is called an attribute.

  • True ✅
  • False

14. Scenario 1 continued

Fill in the blank: You’ve reached the share phase of the data analysis process. One of the things that you can do in this phase is to prepare a _____ about Splashtastic’s sales and practice your presentation.

  • slideshow ✅
  • record
  • finding
  • prediction

15. Scenario 2 continued

The dataset your supervisor retrieved and imported into a spreadsheet includes a list of patients, their demographic information, dental procedure types, and whether they attended their follow-up appointment. To use the dataset for this scenario, click the link below and select “Use Template.”

Link to template: Course Challenge - Scenario 2

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

The patient demographic information includes data such as age and gender. As you’re learning, it’s your responsibility as a data analyst to make sure your analysis is fair. Looking at the geographic data, you notice that all the patients live in the same zip code. How might this negatively impact the analysis?

  • It could cause the analysis to be useless.
  • It could cause the analysis to be unbiased.
  • It could cause the analysis to be fair.
  • It could cause the analysis to be biased. ✅

16. Scenario 1, question 1-5

You’ve just started a new job as a data analyst. You’re working for a midsized pharmacy chain with 38 stores in the American Southwest. Your supervisor shares a new data analysis project with you.

She explains that the pharmacy is considering discontinuing a bubble bath product called Splashtastic. Your supervisor wants you to analyze sales data and determine what percentage of each store’s total daily sales come from that product. Then, you’ll present your findings to leadership.

You know that it's important to follow each step of the data analysis process: ask, prepare, process, analyze, share, and act. So, you begin by defining the problem and making sure you fully understand stakeholder expectations.

One of the questions you ask is where to find the dataset you’ll be working with. Your supervisor explains that the company database has all the information you need.

Next, you continue to the prepare step. You access the database and write a query to retrieve data about Splashtastic. You notice that there are only 38 rows of data, representing the company’s 38 stores. In addition, your dataset contains five columns: Store Number, Average Daily Customers, Average Daily Splashtastic Sales (Units), Average Daily Splashtastic Sales (Dollars), and Average Total Daily Sales (All Products).

Considering the size of your dataset, what’s the best way to proceed with the process and analyze steps?

  • Continue using the company database to process and analyze the data.
  • Use SQL to process and analyze the data.
  • Download the data, then use a spreadsheet to process and analyze it. ✅
  • Upload the data, then process and analyze it using Tableau.

17. Scenario 1 continued

You’ve downloaded the data from your company database and imported it into a spreadsheet. To use the dataset for this scenario, click the link below and select “Use Template.”

Link to template: Course Challenge - Scenario 1

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

Now, it’s time to process the data. As you know, this step involves finding and eliminating errors and inaccuracies that can get in the way of your results. While cleaning the data, you notice there’s missing data in one of the rows. What might you do to fix this problem? Select all that apply.

  • Ask a colleague on your team how they’ve handled similar issues in the past ✅
  • Sort the spreadsheet so the row with missing data is at the bottom
  • Delete the row with the missing data point
  • Ask you supervisor for guidance ✅

18. Scenario 1 continued

Once you’ve found the missing information, you analyze your dataset. During analysis, you create a new column F. You label the top of the column Average Percentage of Total Sales - Splashtastic.

Fill in the blank: The column label you add to column F is known as ______.

  • an attribute ✅
  • a reference
  • an observation
  • a title

Explanation:
Column labels in datasets are often referred to as attributes because they describe the data contained in the column.

19. Scenario 1 continued

Next, you determine the average total daily sales over the past 12 months at all stores. The range that contains these sales is E2:E39. To do this, you use a function. Fill in the blank to complete the function correctly: =AVERAGE_____.

  • (E2:E39) ✅
  • E2:E39
  • (E2-E39)
  • E2-E39

20. Scenario 1 continued

You’ve reached the share phase of the data analysis process. What can you do in this phase to highlight the Splashtastic sales insights you've discovered?

  • Compile a record of the data.
  • Create a data visualization. ✅
  • Establish a repository for the data.
  • Revisit the analyze phase.

21. Scenario 2, questions 6-10

You’ve been working for the nonprofit National Dental Society (NDS) as a junior data analyst for about two months. The mission of the NDS is to help its members advance the oral health of their patients. NDS members include dentists, hygienists, and dental office support staff.

The NDS is passionate about patient health. Part of this involves automatically scheduling follow-up appointments after crown replacement, emergency dental surgery, and extraction procedures. NDS believes the follow-up is an important step to ensure patient recovery and minimize infection.

Unfortunately, many patients don’t show up for these appointments, so the NDS wants to create a campaign to help its members learn how to encourage their patients to take follow-up appointments seriously. If successful, this will help the NDS achieve its mission of advancing the oral health of all patients.

Your supervisor has just sent you an email saying that you’re doing very well on the team, and he wants to give you some additional responsibility. He describes the issue of many missed follow-up appointments. You are tasked with analyzing data about this problem and presenting your findings using data visualizations.

An NDS member with three dental offices in Colorado offers to share its data on missed appointments. So, your supervisor uses a database query to access the dataset from the dental group. The query instructs the database to retrieve all patient information from the member’s three dental offices, located in zip code 81137.

The table is dental_data_table, and the column name is zip_code. You have written the following query, but received an error when it ran.

SELECT *
FROM dental_data_table
WHERE dental_data_table = 81137

Given the objective of the query, where is the mistake in this query?

  • In line 3, dental_data_table should be replaced with zip_code. ✅
  • SELECT, FROM, and WHERE should not be capitalized.
  • The third line should be WHERE = 81137
  • In line 2, dental_data_table should be replaced with zip_code 81137.

22. Scenario 2 continued

The dataset your supervisor retrieved and imported into a spreadsheet includes a list of patients, their demographic information, dental procedure types, and whether they attended their follow-up appointment. To use the dataset for this scenario, click the link below and select “Use Template.”

Link to template: Course Challenge - Scenario 2

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

The patient demographic information includes data such as age and gender. As you’re learning, it’s your responsibility as a data analyst to make sure your analysis is fair. The fact that the dataset includes people who all live in the same zip code might get in the way of fairness.

  • True ✅
  • False

23. Scenario 2 continued

As you’re reviewing the dataset, you notice that there are a disproportionate number of senior citizens. So, you investigate further and find out that this zip code represents a rural community in Colorado with about 800 residents. In addition, there’s a large assisted-living facility in the area. Nearly 300 of the residents in the 81137 zip code live in the facility.

You recognize that’s a sizable number, so you want to find out if age has an effect on a patient’s likelihood to attend a follow-up dental appointment. You analyze the data, and your analysis reveals that older people tend to miss follow-ups more than younger people.

So, you do some research online and discover that people over the age 60 are 50% more likely to miss dentist appointments. Sometimes this is because they’re on a fixed income. Also, many senior citizens lack transportation to get to and from appointments.

With this new knowledge, you write an email to your supervisor expressing your concerns about the dataset. He agrees with your concerns, but he’s also impressed with what you’ve learned and thinks your findings could be very important to the project. He asks you to change the business task. Now, the NDS campaign will be about educating dental offices on the challenges faced by senior citizens and finding ways to help them access quality dental care.

Changing the business task involves which of the following?

  • Defining the new question or problem to be solved ✅
  • Conducting a gap analysis
  • Using a database instead of a spreadsheet
  • Creating a graphical representation of the data

Explanation:
Changing the business task involves redefining the problem based on new insights.

24. Scenario 2 continued

You continue with your analysis. In the end, your findings support what you discovered during your online research: As people get older, they’re less likely to attend follow-up dental visits.

But you’re not done yet. You know that data should be combined with human insights in order to lead to true data-driven decision-making. So, your next step is to share this information with people who are familiar with the problem professionally. They’ll help verify the results of your data analysis.

The people who are familiar with a problem and help verify the results of data analysis are called subject-matter experts. What are their roles in the process? Select all that apply.

  • Collect, transform, and organize data
  • Validate the choices being made ✅
  • Offer insights into the business problem ✅
  • Identify inconsistencies in the analysis ✅

25. Scenario 2 continued

The subject-matter experts are impressed by your analysis. The team agrees to move to the next step: data visualization. You know it’s important that stakeholders at NDS can quickly and easily understand that older people are less likely to attend important follow-up dental appointments than younger people. This will help them create an effective campaign for members.

It’s time to create your presentation to stakeholders. It will include a data visualization that demonstrates the lifetime trend of people being less likely to attend follow-up appointments as they get older.

You recognize that this data is given in series. What type of data visualization is most effective to visualize this data?

  • A line chart ✅
  • A doughnut chart
  • A table
  • A box plot

26. Scenario 1 continued

You’ve downloaded the data from your company database and imported it into a spreadsheet. To use the dataset for this scenario, click the link below and select “Use Template.”

Link to template: Course Challenge - Scenario 1

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

Now, it’s time to process the data. As you know, this step involves finding and eliminating errors and inaccuracies that can get in the way of your results. While cleaning the data, you notice that information about Splashtastic is missing in row 16. Which of the following would be an appropriate response?

  • Delete the row with the missing data point.
  • Ask a colleague on your team how they’ve handled similar issues in the past. ✅
  • Replace the row with the average values of the other data points.
  • Sort the spreadsheet so the row with missing data is at the bottom.

27. Scenario 1 continued

You’ve reached the share phase of the data analysis process. What can you do in this phase to share the Splashtastic sales insights you've discovered?

  • Revisit the analyze phase.
  • Present your findings to customers.
  • Establish a repository for the data.
  • Present your findings to stakeholders. ✅

28. Scenario 2 continued

The dataset your supervisor retrieved and imported into a spreadsheet includes a list of patients, their demographic information, dental procedure types, and whether they attended their follow-up appointment. To use the dataset for this scenario, click the link below and select “Use Template.”

Link to template: Course Challenge - Scenario 2

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

The patient demographic information includes data such as age and gender. As you’re learning, it’s your responsibility as a data analyst to make sure your analysis is fair. Which aspect of patient demographics might get in the way of fairness?

  • The dataset indicates which dental procedure the patients had performed.
  • The dataset contains patient identification numbers.
  • The dataset includes people who all live in the same zip code. ✅
  • The dataset represents people who are single.

29. Scenario 2 continued

You continue with your analysis. In the end, your findings support what you discovered during your online research: As people get older, they’re less likely to attend follow-up dental visits.

But you’re not done yet. You know that data should be combined with human insights in order to lead to true data-driven decision-making. So, your next step is to share this information with people who are familiar with the problem professionally. They’ll help verify the results of your data analysis.

Fill in the blank: Subject matter experts are people who are familiar with a problem. They can help by identifying inconsistencies in the analysis, _____, and validating the choices being made.

  • offering insights into the business problem ✅
  • creating a presentation with the data
  • collecting data relevant to the business problem
  • redefining the business problem

Explanation:
Subject matter experts provide valuable insights and validate the decisions made based on data analysis.

30. Scenario 1, question 1-5

You’ve just started a new job as a data analyst for a midsized pharmacy chain with 38 stores in the American Southwest. Your supervisor shares a new data analysis project with you.

She explains that the pharmacy is considering discontinuing a bubble bath product called Splashtastic. Your supervisor wants you to analyze sales data and determine what percentage of each store’s total daily sales come from that product. Then, you’ll present your findings to leadership.

You know that it's important to follow each step of the data analysis process: ask, prepare, process, analyze, share, and act. So, you begin by defining the problem and making sure you fully understand stakeholder expectations.

One of the questions you ask is where to find the dataset you’ll be working with. Your supervisor explains that the company database has all the information you need.

Next, you continue to the prepare step. You access the database and write a query to retrieve data about Splashtastic. You notice that there are only 38 rows of data, representing the company’s 38 stores. In addition, your dataset contains five columns: Store Number, Average Daily Customers, Average Daily Splashtastic Sales (Units), Average Daily Splashtastic Sales (Dollars), and Average Total Daily Sales (All Products). You decide to use a spreadsheet to work with the data because you know that spreadsheets work well for processing and analyzing a small dataset, like the one you’re using.

Fill in the blank: To get the data from the database into a spreadsheet, you would first _____ the data as a .CSV file, then import it into a spreadsheet.

  • print
  • download ✅
  • copy and paste
  • email

31. Scenario 1 continued

Next, you determine the average total daily sales over the past 12 months at all stores. The entire range of cells that contain these sales are E2:E39. The correct syntax is =AVERAGE(E2:E39).

  • True ✅
  • False

Explanation:
The syntax =AVERAGE(E2:E39) is correctly used to calculate the average of values in a range of cells.

32. Scenario 1 continued

Next, you create a slideshow, which includes a data visualization to highlight the Splashtastic sales insights you've discovered. You’ve reached which phase of the data analysis process?

  • Manage
  • Analyze
  • Act
  • Share ✅

33. Scenario 2, questions 6-10

You’ve been working for the nonprofit National Dental Society (NDS) as a junior data analyst for about two months. The mission of the NDS is to help its members advance the oral health of their patients. NDS members include dentists, hygienists, and dental office support staff.

The NDS is passionate about patient health. Part of this involves automatically scheduling follow-up appointments after crown replacement, emergency dental surgery, and extraction procedures. NDS believes the follow-up is an important step to ensure patient recovery and minimize infection.

Unfortunately, many patients don’t show up for these appointments, so the NDS wants to create a campaign to help its members learn how to encourage their patients to take follow-up appointments seriously. If successful, this will help the NDS achieve its mission of advancing the oral health of all patients.

Your supervisor has just sent you an email saying that you’re doing very well on the team, and he wants to give you some additional responsibility. He describes the issue of many missed follow-up appointments. You are tasked with analyzing data about this problem and presenting your findings using data visualizations.

An NDS member with three dental offices in Colorado offers to share its data on missed appointments. So, your supervisor uses a database query to access the dataset from the dental group. The query instructs the database to retrieve all patient information from the member’s three dental offices, located in zip code 81137.

The table is dental_data_table, and the column name is zip_code. You write the following query.

SELECT *
FROM dental_data_table
WHERE zip code = 81137

This query is incorrect. How could it be fixed?

  • Decapitalize SELECT, FROM, and WHERE
  • In line 3, replace zip code with zip_code ✅
  • Rewrite line 3 as WHERE_zip code = 81137
  • Rewrite line 3 as zip_code = 81137

Explanation:
The WHERE clause should filter by the column zip_code, not the table name.

34. Scenario 2 continued

The dataset your supervisor retrieved and imported into a spreadsheet includes a list of patients, their demographic information, dental procedure types, and whether they attended their follow-up appointment. To use the dataset for this scenario, click the link below and select “Use Template.”

Link to template:
Course Challenge - Scenario 2

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

The patient demographic information includes data such as age, gender, and home address. When examining the geographic data, you notice that all the patients live in the same zip code.

Fill in the blank: The fact that the dataset includes people who all live in the same zip code might get in the way of ______.

  • accuracy
  • spreadsheet formulas or functions
  • fairness ✅
  • data visualization

35. Scenario 2 continued

As you’re reviewing the dataset, you notice that there are a disproportionate number of senior citizens. So, you investigate further and find out that this zip code represents a rural community in Colorado with about 800 residents. In addition, there’s a large assisted-living facility in the area. Nearly 300 of the residents in the 81137 zip code live in the facility.

You recognize that’s a sizable number, so you want to find out if age has an effect on a patient’s likelihood to attend a follow-up dental appointment. You analyze the data, and your analysis reveals that older people tend to miss follow-ups more than younger people.

So, you do some research online and discover that people over the age 60 are 50% more likely to miss dentist appointments. Sometimes this is because they’re on a fixed income. Also, many senior citizens lack transportation to get to and from appointments.

With this new knowledge, you write an email to your supervisor expressing your concerns about the dataset. He agrees with your concerns, but he’s also impressed with what you’ve learned and thinks your findings could be very important to the project. He asks you to change the business task. Now, the NDS campaign will be about educating dental offices on the challenges faced by senior citizens and finding ways to help them access quality dental care.

Changing the business task involves defining the new question or problem to be solved.

  • True ✅
  • False

36. Scenario 2 continued

You continue with your analysis. In the end, your findings support what you discovered during your online research: As people get older, they’re less likely to attend follow-up dental visits.

But you’re not done yet. You know that data should be combined with human insights in order to lead to true data-driven decision-making. So, your next step is to share this information with people who are familiar with the problem professionally. They’ll help verify the results of your data analysis.

Fill in the blank: Subject matter experts are people who are familiar with a problem. They can help by _____, offering insights into the business problem, and validating the choices being made.

  • creating a presentation with the data
  • collecting data relevant to the business problem
  • identifying inconsistencies in the analysis ✅
  • redefining the business problem

37. Scenario 2 continued

The subject-matter experts are impressed by your analysis. The team agrees to move to the next step: data visualization. You know it’s important that stakeholders at NDS can quickly and easily understand that older people are less likely to attend important follow-up dental appointments than younger people. This will help them create an effective campaign for members.

It’s time to create your presentation to stakeholders. It will include a data visualization that demonstrates the lifetime trend of people being less likely to attend follow-up appointments as they get older.

For this, a pie chart will be most effective.

  • True
  • False ✅

38. Scenario 1, question 1-5

You’ve just started a new job as a data analyst. You’re working for a midsized pharmacy chain with 38 stores in the American Southwest. Your supervisor shares a new data analysis project with you.

She explains that the pharmacy is considering discontinuing a bubble bath product called Splashtastic. Your supervisor wants you to analyze sales data and determine what percentage of each store’s total daily sales come from that product. Then, you’ll present your findings to leadership.

You know that it's important to follow each step of the data analysis process: ask, prepare, process, analyze, share, and act. So, you begin by defining the problem and making sure you fully understand stakeholder expectations.

One of the questions you ask is where to find the dataset you’ll be working with. Your supervisor explains that the company database has all the information you need.

Next, you continue to the prepare step. You access the database and write a query to retrieve data about Splashtastic. You notice that there are only 38 rows of data, representing the company’s 38 stores. In addition, your dataset contains five columns: Store Number, Average Daily Customers, Average Daily Splashtastic Sales (Units), Average Daily Splashtastic Sales (Dollars), and Average Total Daily Sales (All Products).

Considering the size of your dataset, you decide a spreadsheet will be the best tool for your project. You proceed by downloading the data from the database. Describe why this is the best choice.

  • Spreadsheets are most effective when working with queries.
  • Spreadsheets work well for processing and analyzing a small dataset, like the one you’re using. ✅
  • Databases can’t be used for analysis.
  • Only spreadsheets let you download and upload data.

Explanation:
Spreadsheets are ideal for small datasets, providing easy tools for analysis and visualization.

39. Scenario 1 continued

You’ve downloaded the data from your company database and imported it into a spreadsheet. IMPORTANT: To answer questions using this dataset for the scenario, click the link below and select the “Use Template” button before answering the questions.

Link to template:
Course Challenge - Scenario 1

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

Now, it’s time to process the data. As you know, this step involves finding and eliminating errors and inaccuracies that can get in the way of your results. While cleaning the data, you notice that information about Splashtastic is missing for Store Number 15 in Row 16. The best course of action is to delete the row with missing data from your dataset so it doesn’t get in the way of your results.

  • True
  • False ✅

40. Scenario 1 continued

You’ve reached the share phase of the data analysis process. It involves which of the following? Select all that apply.

  • Create a data visualization to highlight the Splashtastic sales insights you’ve discovered. ✅
  • Stop selling Splashtastic because it doesn’t represent a large percentage of total sales.
  • Prepare a slideshow about Splashtastic’s sales and practice your presentation. ✅
  • Present your findings about Splashtastic to stakeholders. ✅

Explanation:
Sharing involves visualizing data, preparing materials, and presenting findings. The option about stopping sales is an action, not part of sharing.

41. Scenario 2 continued

The dataset your supervisor retrieved and imported into a spreadsheet includes a list of patients, their demographic information, dental procedure types, and whether they attended their follow-up appointment. To use the dataset for this scenario, click the link below and select “Use Template.”

Link to template:
Course Challenge - Scenario 2

OR

If you don’t have a Google account, you can download the template directly from the attachment below.

The patient demographic information includes data such as age, gender, and home address. When examining the geographic data, you notice that all the patients live in the same zip code.

Fill in the blank: The fact that the dataset includes people who all live in the same zip code might get in the way of ______.

  • fairness ✅
  • data visualization
  • accuracy
  • spreadsheet formulas or functions

Explanation:
If all data comes from one zip code, the dataset lacks geographic diversity, potentially affecting fairness and generalizability.

Leave a Reply