Statistics definitions


Terms in this set (...)

A collection in a context.
A set of individuals that we wish to describe and/or make predictions about.
Member of a population
Characteristics recorded about each individual in a data set.
Categorical variable
A variable that records qualities or characteristics of an individual such as gender or eye color.
Quantitative variable
Variable that measures a characteristic of an individual such as height, weight, or age.
What is the most typical value.
The value that occurs the most.
How much do values typically vary from the center.
The difference between the lowest and the highest values.
Data value that doe not fit in the overall pattern.
Mound shaped and symmetrical, uniform, skewed left, and skewed right.
A set of data collected and/ or selected from a statistical population by a defined procedure.
The number of times the event occurred in an experiment or study.
Frequency table
A table that shows the number of times a particular value is used as a data point.
Scales are numeric scales in which we know not only the order, but also the exact differences between the values.
Adding up the values and then dividing by the number of values.
The middle value or the average of the middle of two values when the data is arranged in numerical order.
5 number summary
Minimum, Q1, median, Q3, and maximum.
Q1-Q3 the spread of middle 50% of data.(not sensitive to outliers.
The amount that a single data value differs from the mean.
Standard deviation
Measure of how spread out numbers are.
Mean absolute deviation
The average distance between each data value and the mean.
First quartile
The middle lower half of the data. 25% of data is below Q1.
Third quartile
The middle upper half of the data. 25% of data is above Q3.
Two way table
Shows relationships between two categorical variables.
Joint frequencies
Represent the body of the table.
Marginal frequencies
Represents the total row and total column.
Sample size
The total number of individuals surveyed.
The difference between the observed value of the dependent variable and the predicted value.
Residual plot
A graph that shows the difference between the actual data. (What is provided through a table or graph) and the predicted data. (what the model says should happen).
Measures a relationship between two variables.
One evet is the result of the occurrence of the other event.
To go in One Direction or the other.
Line of best fit
A straight line drawn through the center of a group of data points plotted on a scatterplot.
Bivariate statistics
Analyzing two variables to find the relationship between them.
Univariate statistics
Involves only one variable.