Making Inferences and Convincing Arguments about Samples and Populations

Introduction

In October 2012, The New York Times published an interactive graph on their website comparing a person’s income before and after considering the person’s taxes and health care benefits. Click on the image to access the article.

How Health Care Affects Income. Bar graph showing base annual income and income after taxes and government benefits.

Data are frequently displayed in graphs in the media to make it easier to visualize what a set of numbers actually means. Different data displays are used for different purposes.

Data are also used to describe characteristics of a particular population or a group that is being studied. It is usually difficult to survey every member of a population, so frequently a representative sample is used to describe the characteristics of a population. For example, The New York Times reported the following survey that was conducted in 2009. The researchers did not talk to every American resident over the age of 15. Instead, they selected a sample of residents in order to describe the entire population, or all American residents over the age of 15.

Graph: How Different Groups Spend Their Day.

To see the graph as it was in the original interactive, as well as additional graphs from the same survey, click on the above image. (Use Firefox or Internet Explorer as your browser in order to use the interactive.)

In this lesson, you will investigate how a random sample can be used to describe a population. You will also look at different ways that valid conclusions can be drawn from a set of data that is displayed in a visual format such as a table or a graph. 

Samples and Populations

Trees and forests are important to the ecology of any town, city, or state. Urban planners need to know about the different trees present in a city to make sure there is enough green space to promote recreation. Foresters need to know about the types of trees in a forest to combat outbreaks of disease and other pests that harm forests.

student detectiveIn this section, you will use sampling techniques in order to describe a neighborhood forest.

Susana wants to know what types of trees and how many of each type are in her neighborhood. Instead of counting every tree in the neighborhood, Susana randomly selected 25 houses out of the 240 in her neighborhood to survey the types of trees.

Use the interactive below to determine the number of each of the four types of trees that Susana found at the 25 houses. Match the leaf or tree to the number of that type of tree that Susana found. Use the information below to complete the table that follows.

Copy and paste the table below into your notes, and use the information in the interactive to complete the table.

Type of Tree
Number of Trees
White Oak
 
Cedar Elm
 
Loblolly Pine
 
Pecan
 

Click here to check your answer.

Now that Susana has completed her survey, she is ready to use it to make predictions about the population of trees in her neighborhood.

Susana can set up and solve a proportion relating the ratio of each type of tree in her sample to the ratio of each type of tree in her neighborhood.

Watch the video below to see how Susana set up and solved a proportion to determine the number of white oak trees in her entire neighborhood. (Note: There is no audio in the video.)

Use what you have seen so far to answer the following questions.

  1. Based on the data that Susana collected, how many pecan trees are there in her neighborhood?
  2. Based on the data that Susana collected, about how many cedar elm trees are there in her neighborhood?

Pause and Reflect

  1. When using a sample to make predictions about a population, why is it important to make sure that the sample is representative of the population?
  2. What is one way you can help to make sure your sample is representative of the population?

Practice

Reginald wanted to know which pet was most popular among seventh graders at his school. He randomly surveyed 60 students during seventh grade lunch. The results of his survey are shown in the graph below. Use this graph to answer questions 1–3.

  1. How many of the students surveyed said that their favorite pet was a cat?
  2. If there are 340 seventh grade students in Reginald’s school, how many of them should Reginald expect to say that a dog was their favorite pet?
  3. Of the 340 seventh grade students, how many more would Reginald expect to favor dogs as pets over cats?

Interpreting Graphical Displays of Data: Part-Whole Relationships

pink carIn the last section, you used data from a sample to make predictions about the population. In this section, you will use the data presented in graphs to determine if particular conclusions can be made.

Cynthia wanted to know what colors of vehicles the teachers at Northside Middle School drive. So she went to the first row in the teachers’ parking lot and counted the number of different vehicle colors. The data she collected are shown in the bar graph below.

chart of colors of vehicles in parking lot. blue 6, white 8, black 10, green 8, red 12

blue carJacob thought that Cynthia’s car color study was interesting, so he went into the same row of the teachers’ parking lot to look at the age of each car. Jacob determined the car’s age by subtracting the year the car was made from the current year. He represented his data in the dot plot shown below.

Data dot plot of car age in years. Range of years from zero to 12.

Pause and Reflect

  1. How can you use a bar graph to make comparisons of one category compared to the whole set of data represented in the graph?
  2. How is a bar graph like a dot plot? How are they different?

Practice

For questions 1 and 2, the graph below shows how different sources were used to generate electricity in Texas during 2010.

circle graph with generation sources of Texas electricity
 

Determine whether or not each of the following statements can be supported from the information in the graph.

For questions 3 and 4, the box-and-whiskers plot, or box plot, shown below represents the average monthly low temperatures and high temperatures in San Antonio, Texas.

Graph: Temperatures in San Antonio. Box and whisker plot showing the average monthly low and high temperatures.

Data Source: www.weather.com

Determine whether or not each of the following statements can be supported from the information in the graph.

Summary

In this lesson, you used data collected from a random sample to make predictions about a population. You also saw different graphical representations of data and used those representations to determine whether or not given conclusions were valid.

woman with broken down car

 

The graph shown below displays data that Brendan and Emily collected from a recent fishing trip. They caught 15 fish and made the graph showing the lengths of the fish that they caught.

histogram of length of fish that Brendan and Emily caught
 
When they studied the data, Brendan made several conclusions. Assist Emily in sorting the conclusions into those that are valid and those that are invalid.