IB Biology: Statistical Analysis - Question Set
Terms in this set (56)
the 3 types of data are...
- ordinal (ranked/relative data)
- interval (on a scale)
a value that is representative of a data set
- 3 ways to measure: mean, median & mode
- shows how similar the data points are in a set of data
an average of data points
the number in the middle
the most frequently occurring measurement
the measure of the spread of data
(a.k.a. largest # - smallest #)
is a line that extends above and below a bar in a chart/data point in a graph
- shows range of data, standard deviation, 95% confidence intervals (CI), & standard error (how close to the means the samples are)
standard deviation (SD)
is used to summarize the spread of values around the mean
- normal distribution = 68% of all values lie within the 1st standard deviation
- compares the means & spreads the data between 2 or more samples
- can indicate that control variables are not constant
- flat-bell shape = the data's spread out widely from the mean
- narrow/tall bell-shape = data's really close & not spread out
- clustered data points = small SD; spread apart = large SD
what does the size of the standard deviation (SD) indicate about the data?
the SD shows the spread of the range
- small SD signifies a small range
- large SD signifies a big range
refers to how probable it is that a relationship is caused by pure chance
is based on probability (p) that chance alone could make a difference
is used to compare the means of 2 samples to determine whether they are statistically different
: data shows normal distribution (follows the bell-curve) & the sample size must be at least 10
3 things needed for the t-test
: df (degrees of freedom), p-value, & t-value
- results will either accept/reject null hypothesis
a hypothesis that says that the IV & DV have no correlation; it's based on chance
if the critical value is ≤ 0.05
(5%) (in the p-value column), then the
data is not reliable
; you must
accept the null hypothesis
difference is due to chance
if the critical value is > 0.05
, then the
data is reliable
; you must
reject the null hypothesis
there isn't a significant difference between the means
correlation v. causation
shows a linear relationship between 2 variables. however, just because there's a pattern doesn't mean one is the cause of the other. if the IV affects the DV, then it's
The degree of closeness of a measured value to its true amount.
A short statement describing the purpose or reason for an experiment.
A statistical test for determining the significance of departures of observed data from an expected results.
A standard (reference) treatment that helps to ensure that the responses to the other treatments can be reliably interpreted.
Variable that is fixed at a specific amount across all treatment groups.
A relationship in which variables vary together in some predictable way, but cause and effect in not implied.
Facts collected for analysis.
a variable whose values are determined by another variable
Calculated values that summarize the main features of a collection of data.
a graphical representation of the variability of data. Used on graphs to indicate the uncertainty in a reported measurement.
A diagram which often displays numerical information in a way that can be used to identify trends in the data.
A tentative explanation of an observation, capable of being tested by experimentation.
A variable whose values are set, or systematically altered, by the investigator.
the central value in a sorted data set
the value that occurs most often in a data set
the repeatability of a measurement
data described in descriptors or terms rather than by numbers
Data able to be expressed in numbers. Numerical values derived from counts or measurements.
a relationship between a dependent variable and one or more independent variables
Estimate of the true population mean based upon data collected by random sampling. Valid for population data that are normally distributed. The sum of the data divided by the number of data entries (n).
sample standard deviation
A calculated statistic expressing the variability of a sample population around its mean.
The use of an ordered, repeatable method to investigate, manipulate, gather, and record data.
A calculated measure of some attribute of a sample (e.g. the arithmetic mean)
A test used to determine if the difference between the two sample means is significant.
A representation of data, often summarized, organized in rows and columns.
trend (of data)
a relationship between variables in a data set.
A factor in an experiment that is subject to change.
What is the standard deviation?
how far a number deviates from the mean
The standard deviation is used to what?
summarize the spread of data around the mean
How is the standard deviation represented?
mean +/- SD
What does a large standard deviation mean?
data is widespread around the mean
What does a small standard deviation mean?
data is clustered around the mean
Name two applications of standard deviation?
1. used to compare the means of 2 populations
2. useful tool in evaluating experimental designs
What do large SDs across treatments show?
that variables may not be controlled
Most biological variation shows a normal distribution which displays what?
a bell-shaped curve
What percentage of data falls between +/- 1 SD of the mean?
What percentage of data falls between +/- 2 SD of the mean?
Error bars are used to show what?
the variability of data graphically
What are error bars used to represent?
standard deviation and the range
Correlation does not establish what?
What do error bars show?
The range of data (all the samples), the standard deviation (68% of the sample), and the 95% confidence intervals