Terms in this set (65)
statistical procedures used to describe characteristics and responses of groups of subjects
procedures used to draw conclusions about larger populations from small samples of data
A branch of statistics which assumes that sample data comes from a population that follows a probability distribution based on a fixed set of parameters. More precise.x
normal distribution not assumed
Central Limit Theorem
The theory that, as sample size increases, the distribution of sample means of size n, randomly selected, approaches a normal distribution.
normal distribution curve
the bell-shaped curve that results from plotting continuous variation data on a graph.
Determinants of shape on normal distribution
Mean, height. Standard deviation, width
observations are clustered tightly around a central value, less variation
observations are scattered widely from the mean, more variation
area under the normal curve
in a normal model, about 68% of values fall within 1 standard deviation of the mean, about 95% fall within 2 standard deviations of the mean, and about 99.7% fall within 3 standard deviations of the mean
How do you know if data is normally distributed?
Calculate the mean plus or minus twice the standard deviation
D'Agostino & Pearson omnibus normality test, Shapiro-Will normality test
Not very useful but can do
measure of the asymmetry of a probability distribution
Measure of the fatness of the tails of a probability distribution relative to that of a normal distribution. Indicates likelihood of extreme outcomes. (Tailed ness)
a distribution with two modes
A graph of vertical bars representing the frequency distribution of a set of data.
A graph that displays the highest and lowest quarters of data as whiskers, the middle two quarters of the data as a box, and the median
Compare the cumulative probability of our empirical data with an ideal "test" distribution
Compare the quantiles of our empirical data with the ideal
measures of location (central tendency)
mean, median, mode
Measure of dispersion
range, variance, standard deviation, standard error, confidence interval
the arithmetic average of a distribution, obtained by adding the scores and then dividing by the number of scores (data must show normal distribution)
the middle score in a distribution; half the scores are above it and half are below it
The value that occurs most frequently in a given data set.
the difference between the highest and lowest scores in a distribution
The average of the squared differences from the mean.
a computed measure of how much scores vary around the mean score
A measure of how far the sample mean is away from the population mean
When should you use Standard deviation vs. standard error
An estimate of the range that is likely to contain the true population mean, generates an interval I. Which the probability that the sample mean reflects the population mean is high
Confidence interval with known SD
confidence interval with unknown SD
T value instead of z value
SD/SEM/95% CI error bars
a measure of how close a series of measurements are to one another
A description of how close a measurement is to the true value of the quantity measured.
the hypothesis that there is no significant difference between specified populations, any observed difference being due to sampling or experimental error.
The hypothesis that states there is a difference between two or more sets of data, observed effect for the experiment
Type I error (alpha)
incorrectly rejects the null. Think there is a difference in the population when there actually is not. More serious error
When you reduce your significance level you reduce the chance of making this error, as chance of making type I error decreases the chances of making type II error increase.
Type II error (beta)
Incorrectly retaining a false null hypothesis (false negative)
type I vs. type II error
Caused by inherently unpredictable fluctuations in the readings of a measurement apparatus or in the experimenter's interpretation of the instrumental reading, can be in either direction
Is predictable, and typically constant or proportional to the true value. Systematic errors are caused by imperfect calibration of measurement instruments or imperfect methods of observation. Typically occurs in one direction
Variability, detection difference, and power influence on n?
Student t test
Comparison of means, N<30, independent data points (unless paired t-test), normal distribution for equal and unequal variance, random sampling, equal sample size, degrees of freedom (n-1)
Student t-test equation
One tailed vs two tailed t-test
Paired vs. unpaired t-test
Degrees of freedom for t-test
Df= Na-1+Nb-1= 2N-2=N-1
The probability level which forms basis for deciding if results are statistically significant (not due to chance), <0.05 you can reject the null hypothesis
Compares mean values of a contributes variable for multiple categories/groups (F value)
degree of freedom (df) for ANOVA
post hoc tests
additional hypothesis tests that are done after an ANOVA to determine exactly which mean differences are significant and which are not
Pros: make fewer assumptions about the distribution of the data, Cons: less powerful, difficult to detect small differences
Parametric vs. Non-parametric tests
Mann-Whitney U test
Determines whether two uncorrelated means differ significantly when data are nonparametric, uses ranks of the measurements
Mann-Whitney U test equation
Once you calculate U, use the lower of the two to look in the table. NOTE: want Ustat to be less than Ucrit
The non-parametric equivalent to the one-way ANOVA, uses chi-square as calculated value
Pearson correlation coefficient
The most common statistical measure of the strength of linear relationships among variables, 1 total positive correlation, 0 is no correlation, -1 is total negative correlation
a statistical method used to fit a linear model to a given data set
Relationship between x and y, 0 means that knowing x doesn't help in predicting y, 1 means you can perfectly predict x from y
Grubb's test for outliers
YOU MIGHT ALSO LIKE...
Essentials of Business Research | Silver, Stevens, Kernek, Wrenn, Loudon
Module 23 Quiz
Business Statistics Final Exam Review
OTHER SETS BY THIS CREATOR
Adrenergic Agonists and Antagonists
THIS SET IS OFTEN IN FOLDERS WITH...
Global Health Exam 1 review
Global Health Chapter 12: Communicable Diseases