# Basic Statistics & Probability

### Topics

The most commonly used statistic is the **average**, or finding where the middle of the data lies. There are three ways to measure the average: the **mean**, **median**, and **mode**.

Why three ways? Good question. Each will give you a different way of looking at the numbers; depending on the question you're trying to answer (or the argument you're trying to make), any of the three could prove the most useful.

The mean is the most commonly used measure of finding the average. (In fact, in everyday language, people often use the word "average" simply to mean, um, "mean"). Finding the mean is simple: just add up all the numbers in a data set and divide by the number of data entries.

The **median **is the middle number in a data set. However, the data must be in numerical order (least to greatest or greatest to least) before finding this average. If the middle number lies between two numbers, find the mean of those two numbers (add them together and divide by 2).

The **mode **is probably the least common way of finding the average, and in most cases is the least useful. To find the mode, just look for the number that occurs the most. There can be more than one mode, or none at all.

Finally there is the **range**. The range is NOT a measure of the average; however, it is often taught along with averages because it's another helpful way to measure a set of data. The range measures the "spread" of the data, how far apart the smallest and largest values are. To find the range, subtract the smallest value in the data set from the largest.

This will all make a lot more sense with some real examples. For our social networking survey, we found the mean, median, and range for the amount of time each group spends on social networking sites per day.

Time Spent Social Networking(hours/day) | Girls | Boys |

Mean | 2.20 | 1.01 |

Median | 2.17 | 0.98 |

Range | 3.75 | 1.88 |

As we predicted, on average girls spend at least one hour a day more on these sites than boys. Both the mean and median for the girls was significantly higher.

## This Week on Prancing with the B-List Celebrities

Here are the contestants' scores on this week's episode of our favorite show, *Prancing with the B-List Celebrities*:

Evan L | 52 |

Nicole S | 50 |

Pamela A | 47 |

Chad O | 44 |

Erin A | 39 |

Jake P | 38 |

Niecy N | 36 |

Kate G | 32 |

Now, let's find the three averages and the range for the contestants' scores.

Stat | How to Find | Explanation |
---|---|---|

Mean | Add all the scores and divide by 8, the number of contestants. The mean is 42.25. | |

Median | First put the scores in order, then find the middle value. In this set, the middle value lies between 44 and 39, so add these middle numbers together and divide by 2. The median is 41.5. | |

Mode | No mode | No score occurs more than once, so there is no mode for this data set. |

Range | Subtract the smallest score from the largest. The range is 20 points. |

For this data set, there are really only two measures of the average, since there is not a mode. Both the mean and median could be used to describe the average. If you were Evan, would you rather call out the mean or the median? What if you were Kate?

**Look Out**: the range of a data set does NOT measure the average of the data.