Probability and Statistics Terms
Get down with the lingo
Bar GraphA representation of data that uses rectangular bars to show the magnitude of categorical variables. Useful in all sorts of real-life situations.
Bivariate DataData that deals with relationships between two variables. Talk about double trouble.
Box And Whisker PlotA representation of data that displays the range and quartiles of the data set. Looks like a kitty cat when you squint and tilt your head to the left.
Categorical DataData that can be organized into mutually exclusive groups or categories.
CombinationOne of all possible ways of choosing objects out of a larger group where order does not matter. Not Judge Judy's favorite math concept, that's for sure. The formula is .
Continuous DataA type of data for which there's no separation between the possible values.
CorrelationThe measure of the linear relationship between two variables. Can be positive or negative, depending on which side of the bed it woke up on.
Discrete DataA type of data for which there's only a finite number of possible values. For example, the integers are discrete because there's no integer between 1 and 2, or between 2 and 3, etc. Also, they're very good at keeping secrets.
EventA set of outcomes of an experiment.
FactorialThe product of all the integers from n down to 1: n! = n × (n – 1) × (n – 2) . . . × 2 × 1. The most excited of all key terms.
GraphA diagram representing data or relationship(s) between variables.
HistogramA representation of data that uses rectangular bars to show the magnitude of quantitative variables.
Independent EventEvents A and B are independent if the outcome of event A has no effect on the outcome of event B. It's grown up! It can do what it wants!
Interquartile RangeThe difference between the third and the first quartile.
Linear RegressionFitting a straight line to a set of data points to find the linear relationship between the dependent and independent variables.
Mean (Average)The sum of all the data points divided by the number of data points.
MedianThe middle value of a list of data points.
ModeThe data point with highest frequency.
Mutually Exclusive EventEvents A and B are mutually exclusive if the occurrence of event A implies event B can't occur. They had a bad breakup last year, and they just refuse to be in the same room.
OddsThe ratio of the probability that an event will happen to the probability that it will not happen.
OutliersData points that are numerically far away from the rest of the data set. The loners of the group, if you will.
PermutationOne of all possible rearrangements of a collection of objects where the order of the objects is super important. To arrange n objects in r ways, we use the formula .
Pie ChartA circular graph divided into sectors, where the area of each sector is proportional to the relative size of the quantities represented. Also incredibly useful in real life.
ProbabilityLikelihood or chance of the occurrence of an event.
Qualitative DataData that can be represented with qualities instead of numbers, like "soft" or "muscular" or "caffeinated."
Quantitative DataData that can be represented with numbers.
QuartilesA set of points that divide the data set into three equal parts.
Scatter PlotA graph of points showing the relationship between two variables.
Stem And Leaf PlotA representation of data where each data point is split into a leaf and a stem. The leaf is usually the last digit, and the stem consists of the other digits.
Univariate DataData with only one variable. Not to be confused with data on a unicycle.
People who Shmooped this also Shmooped...