Graphing Data

Learning Objectives

Introduction

A nurse is collecting blood type data from her patients. When a new patient is checked in, the nurse does a simple finger-prick test to see whether the patient’s blood is type A, B, AB, or O. (These are the four possible blood types. Each one also carries a `+` or `-` to represent the RH factor, but for our purposes, let’s just track the type, not the `+` or `-`.) She tracks her results by creating a two-column table with the patient’s name and blood type.

Name

Blood Type

Dominique

A

Ilya

O

Raul

AB

Madison

O

Philip

AB

Samuel

B

Josefine

O

Brett

O

Paula

B

Leticia

AB

The information in this table is an example of dataMathematical term for information such as values or measurements., or information. In this case, the nurse has gathered a fair amount of data about her patients’ blood types. By analyzing the data, she can learn more about the range of patients that she serves.

Data helps us make many kinds of decisions. Organizing data into graphs can help us get a clear picture of a situation and can often help us make decisions based on the picture. So how do you take data and make a picture out of it? Let’s take a look.

Pictures of Data

Let’s return to the data set used previously. If the nurse wanted to represent the data visually, she could use a pictographA graph that uses small icons or pictures to represent data.. Pictographs represent data using images. This visual presentation helps illustrate that for the data in her table, Type O blood is the most common and Type A blood is the least common.

A pictograph table shows Blood Type and Number of People. The Number of People is represented by a blood drop, with one drop equal to one person. Type A, 1 blood drop. Type B, 2 blood drops. Type AB, 3 blood drops. Type O, 4 blood drops.

Interested by the results of this small survey, the nurse continues to document the blood types of her patients until she has surveyed `100` people. She puts all of this data in a table, but she finds that it is hard for her to quickly identify what the data is telling her. She decides to make another pictograph using a different scale.

A pictograph table shows Blood Type and Number of People. The Number of People is represented by a blood drop, with one drop equal to 5 people. Type A, 6 blood drops. Type B, 5 blood drops. Type AB, 2 blood drops. Type O, 7 blood drops.

To read this pictograph, all you need is the scale, which, in this case, is the number of people that each blood drop symbol represents. In this graph, each blood drop represents `5` people. There are six drops next to Type A, so `5 * 6 = 30` people had Type A blood. The table below shows the rest of the information.

Blood Type

Number of People

Type A

`6" drops" * 5" people" = 30" people"`

Type B

`5" drops" * 5" people" = 25" people"`

Type AB

`2" drops" * 5" people" = 10" people"`

Type O

`7" drops" * 5" people" = 35" people"`

 

Example

Problem

 

 

 

 

 

 

 

 

 

The pictograph below shows the number of medals earned at an international competition. How many medals did Japan earn?

 

Country

Medals

Japan

MMMMM

Argentina

M

Germany

MMMMMM

Egypt

MMMM

                 M `= 4` medals

 

 

 

 

 

 

`5 * 4 = 20`

 

Look at the scale of the pictograph. Each M represents `4` medals.

 

Japan has `5` symbols, so the total number of medals is `5 * 4 = 20`.

Answer

Japan earned `20` medals.

 

       

Which table accurately represents the data shown in the pictograph below?

 

Employee

Hourly wage

Wayne

$ $ $ $ $

Sarah

$ $ $ $ $ $ $

Leigh

$ $ $ $

                          $ `= $4`

 

A)

Employee

Hourly wage

Wayne

`$5`

Sarah

`$7`

Leigh

`$4`

 

B)

Employee

Hourly wage

Wayne

`$20`

Sarah

`$22`

Leigh

`$19`

 

C)

Employee

Hourly wage

Wayne

`$10`

Sarah

`$14`

Leigh

`$8`

 

D)

Employee

Hourly wage

Wayne

`$20`

Sarah

`$28`

Leigh

`$16`

 

 

Bar Graphs

Representing data as pictures doesn’t always make sense either. Bar graphsA graph that uses horizontal or vertical bars to represent data. are an alternative (and popular) way to represent data sets, especially those with large amounts of data or which do not lend themselves well to individual symbols. In a bar graph, the number of items in a data category is represented by the height or length of bars.

As when reading pictographs, paying attention to the scale is essential: small differences in the height of two bars can sometimes represent thousands of dollars, for example!

Let’s consider an example. Here is some information about the average life span of five animals in the wild, presented in a table.

Animal

Average Life Span in the Wild (Years)

Zebra

`25`

Giant Panda

`20`

Cheetah

`11`

Baboon

`30`

African Elephant

`70`

Source: National Geographic, accessed July `2011`

This data is fine in a table, but presenting it as a bar graph helps the viewer compare the different life spans more easily. Look at the bar graph below. In this example, the animals are listed on the left side of the graph (also called the y-axisThe vertical axis of a coordinate plane. Also the vertical axis of a bar graph or histogram.), and the life span in years is listed on the bottom (the x-axisThe horizontal axis of a coordinate plane. Also the horizontal axis of a bar graph or histogram.). The graph shows the information by the length of the bar associated with each animal name.

A horizontal bar graph entitled Average Life Span in Wild shows lifespan, in 10-year intervals, on the x-axis and Zebra, Giant Panda, Cheetah, Baboon, and African Elephant on the y-axis. Zebra is a bar that reaches precisely between 20 and 30 years. Giant Panda reaches 20 years. Cheetah reaches just past 10 years. Baboon reaches 30 years. African Elephant reaches 70 years.

Bar graphs are generally used to compare quantities, not to determine exact quantities, especially when the scale is large, as in the next graph.

The bar graph below shows total revenue for four fictional companies in `2011`. Notice that the scale, on the `y`-axis, is very large; each horizontal line represents an increase of `$500,000`. For this reason, it is difficult to tell exactly how much money each company made in `2011`. However, comparing the bars is straightforward. Glancing at the data, you can tell that Lumbertown earned the most (a little over `$2,500,000`), while Machine Systems earned the least (about one million less, at just over `$1,500,000`).

A bar graph entitled 2011 Revenue in Dollars shows four companies on the x-axis and dollar amounts on the y-axis. Machine Systems earned just past 1.5 million. Lumbertown earned past 2.5 million. Quarry, Inc., earned just past 2.5 million, but is less than Lumbertown. Nuts and Bolts earned just past 2 million.

You can also use bar graphs to showcase multiple pieces of information about a specific situation. For example, let’s show the next year’s projected revenue for each company on the graph that you just looked at. You can leave the existing bars in the graph and just add four more.

The blue columns remain, but now they are accompanied by four new red columns that represent the projected revenue for these companies. Again, this data could be expressed in a table. With a bar graph, you gain ease of quick comparison, but lose the detail of the exact values. Looking at this graph tells you that while Lumbertown has the highest revenue for `2011`, it is projected to decrease. Conversely, Machine Systems is projected to increase its revenue. Seeing data visually can help you understand the story that the data is telling about a situation.

A double bar graph entitled 2011 Revenue in Dollars shows the names of four companies on the x-axis and dollar amounts on the y-axis. Each company shows a blue bar for 2011 revenue and a red bar for projected revenue. Machine Systems’ blue bar reaches just past 1.5 million; its red bar reaches just over 2 million. Lumbertown’s blue bar reaches past 2.5 million; its red bar barely reaches over 2.5 million. Quarry, Inc.’s blue bar reaches just past 2.5 million; its red bar reaches nearly 3 million. Nuts and Bolts’ blue bar reaches just past 2 million. Its red bar reaches 2.5 million.

Example

Problem

 

 

 

 

 

 

 

 

 

Use the graph to list the drinks from the most number of calories to the least number of calories (serving size: `16` ounces).

 

A bar graph entitled Number of Calories shows Type of Drink on the x-axis and Calories on the y-axis. Soda reaches between 200 and 250 calories. Juice Blend reaches between 250 and 300 calories. 2% Milk reaches just below 250. Sports Drink reaches between 100 and 150 calories.

 

 

The `y`-axis shows total calories, and the `x`-axis shows the drink. The taller the bar, the more calories the drink has.

 

Juice Blend `~~275`

 

 

 

`2%` Milk `~~245`

 

Soda `~~230`

 

 

 

Sports Drink `~~125`

The juice blend contains over `250` calories, so it has the most calories per serving.

 

Soda and `2%` milk are both between `200` and `250` calories, but the bar for `2%` milk is taller, so it must contain more calories.

 

Sports drink has the shortest bar; it contains about `125` calories.

Answer

From most to least number of calories per serving:

Juice Blend, `2%` Milk, Soda, Sports Drink

 

Example

Problem

 

 

 

 

 

 

 

 

 

 

Based on the graph below, which player’s rebounding increased the most from `2009` to `2010`?

 

A double bar graph is entitled Rebounds per Game, 2009 and 2010, and shows the names of five players along the x-axis and rebounds on the y-axis. Jackson’s 2009 rebounds reach 7; his 2010 rebounds reach 8. Brunson’s 2009 rebounds reach 7; his 2010 rebounds reach just above 10. Catchings’ 2009 and 2010 rebounds both appear to reach 7, with 2010 being just slightly below 2009. Dupree’s 2009 rebounds reach just under 8, while his 2010 rebounds are slightly below that. De Souza’s 2009 rebounds reach 9, while his 2010 rebounds reach just above 8.

Source: WNBA.com, accessed July `2011`

 

 

The `y`-axis shows rebounds per game, and the `x`-axis shows the player’s name. A taller bar represents more rebounds per game by the player.

 

This graph shows two sets of data: one for `2009`, in blue, and one for `2010`, in red. To compare the data from one year to the next, compare the heights of the two bars for each player.

 

Jackson and Brunson

Two players had higher rebound averages in `2010` than they did in `2009`. This is indicated by the red bar being taller than the blue bar. The other players’ red bars are shorter, so their rebounds decreased.

 

 

Comparing the sizes of the increases, you can tell that Brunson increased her per game rebounding more than Jackson did.

Answer

 The player whose rebounding increased the most from `2009` to `2010` was Brunson. 

Histograms

The data sets you have looked at so far have shown categorical dataData that details non-numerical features of an object. Examples of categorical data include eye color, blood type, and types of computers.. Categorical data is data that can be separated into categories. For instance, data about eye color (brown, blue, green), blood type (O, A, B, AB), and the type of computer you use at work or home (PC or Mac) are all categorical.

Quantitative dataNumerical data. Examples of quantitative data include height, weight, and test scores. is sometimes called numerical data, because it is data that is represented by numbers. Quantitative data sets consist of quantities such as age (`1, 2, 23, 34, 77...`), test scores ( `90``95`, `100`, `72`...), and height (`55` inches, `50` inches, `68` inches…). Notice that all of the data here are numbers.

Continuous quantitative data can be graphed using a histogramA graph using bars to show continuous quantitative data over a series of similar-sized intervals. The height of the bar shows the frequency of the data, and the width of the bar represents the interval for the data.. A histogram resembles a bar graph, but instead of having categories along the axis, it has numbers listed in order and usually grouped in intervals (such as `0` to `10`, `11` to `20`, and so on). While the bars in bar graphs can have space between the bars, the bars of a histogram are usually touching, as the data is continuous.

Let’s look at how you could display the set of test score data listed below.

Name

Score (`0` to `100`)

Alex

`81`

Beatriz

`73`

Celia

`79`

Donnie

`91`

Erykah

`87`

Fred

`79`

Gigi

`81`

Helene

`84`

Irma

`88`

Joelle

`96`

Looking at the scores, you can tell that some people scored in the `70`s,  some scored in the `80`s,  and some scored in the `90`s. You can organize the data in a histogram with the continuous data from `0` to `100`, by using the intervals to `71` to `80``81` to `90`, and `91` to `100`.

A histogram entitled Test Scores has score intervals of 71 to 80, 81 to 90, and 91 to 100 on the x-axis, and Frequency on the y-axis. The interval 71 to 80 has a frequency of 3. The interval 81 to 90 has a frequency of 5. The interval 91 to 100 has a frequency of 2.

There are only three intervals for this histogram and ten data points. For a histogram to be meaningful, it should include `100` data points or more and `7` intervals or more. (For this reason, many histograms are created using a variety of technical tools.) The histogram then takes on interesting shapes that can provide a lot of information.

Even with only `10` data points, you find a much more meaningful histogram by using smaller intervals (such as `71` to `75``76` to `80``81` to `85``86` to `90``91` to `95`, and `96` to `100`).

A histogram entitled Test Scores has Score intervals on the x-axis and Frequency on the y-axis. The intervals 51 to 55, 56 to 60, 61 to 65, 66 to 70 do not have any scores. The interval 71 to 75 has a frequency of 1. The interval 76 to 80 has a frequency of 2. The interval 81 to 85 has a frequency of 3. The interval 86 to 90 has a frequency of 2. The interval 91 to 95 has a frequency of 1. The interval 96 to 100 has an interval of 1.

This second graph shows a bit of a curve to the data. Nobody scored `70` or lower, and the data peaks in the `81` to `85` range. Increasing the number of intervals has an effect on the shape of the graph, and helps us see some trends that are in the data.

Example

Problem

 

 

 

 

 

 

 

 

 

Based on the histogram below, how many students are between `151` and `180` centimeters tall?

 

A histogram entitled Height in Centimeters has Centimeter intervals on the x-axis and Number of Students on the y-axis. The interval 136 to 140 has 2 students; 141 to 145 has 4 students; 146 to 150 has 2 students; 151 to 155 has 1 student; 156 to 160 has 2 students; 161 to 165 has 6 students; 166 to 170 has 2 students; 171 to 175 and 176 to 180 have zero students; 181 to 185 has 1 student.

 

Number of students in each interval:

`151` to `155``1`

`156` to `160``2`

`161` to `165``6`

`166` to `170``2`

`171` to `175``0`

`176` to `180``0`

 

 

Each interval in this histogram is `5` centimeters.

 

`1 + 2 + 6 + 2 + 0 + 0 = 11`

Add the number of students.

Answer

`11` students are between `151` and `180` centimeters tall.

 

A teacher made this histogram to track scores on a recent math test. How many students scored in the `91` to `100` range?

 

A histogram entitled Scores on Math Test has Score intervals on the x-axis and Number of Students on the y-axis. The interval 61 to 65 has 1 student. 66 to 70 has 0 students. 71 to 75 has 2 students. 76 to 80 has 3 students. 81 to 85 has 6 students. 86 to 90 has 1 student. 91 to 95 has 6 students. 96 to 100 has 3 students.

 

A) `3`

 

B) `6`

 

C) `9`

 

D) `22`

 

 

Summary

Data is mathematical information. Mathematical data is often recorded in tables to organize, or spreadsheets to organize and sort. Graphs can help you see the data visually, which can help you to better understand the data. A pictograph is a graph that uses symbols to represent data. Bar graphs show the frequency of categorical data, using bars instead of symbols. Histograms are similar to bar graphs in that they are constructed out of a series of bars. However, histograms represent continuous quantitative (numerical) data, which can illustrate trends as well as other more advanced attributes.