Term
Alternative Hypothesis (Ha) |
|
Definition
aka Researcher hypothesis
The hypothesis that the researcher is trying to show. Can be one-sided or two-sided. |
|
|
Term
|
Definition
The probability of a type II error. Depends on α, n, and μ. It can be calculated if the true value of μ is known. It has a trade-off with α. |
|
|
Term
|
Definition
Even if a population is not normally distributed, the sample mean will be normally distributed, as long as the sample size is large. Sample mean distribution tends towards the normal distribution as sample size increases. If sample size is 30 or more, it can be safe to assume normal distribution. |
|
|
Term
|
Definition
p ± margin of error
A type of statistical inference. Gives a range of plausible values for a parameter where we are a certain percent confident the parameter occurs. Based on sample data. There is a trade-off between confidence level and margin of error. |
|
|
Term
|
Definition
A percentage. The probability that p-hat is found within the confidence interval. The most commonly used confidence level is 95%. The larger the confidence level, the greater the margin of error. Related to α. |
|
|
Term
|
Definition
The numbers on the edge(s) of the rejection region. |
|
|
Term
|
Definition
|
|
Term
Distribution-free procedure |
|
Definition
aka Nonparametric procedure
Does not require a normal distribution. |
|
|
Term
|
Definition
Made about parameters, such as μ. Never about statistics such as x bar. |
|
|
Term
|
Definition
We reject the null hypothesis if there is significant evidence against it.
1. Formulate a null hypothesis and alternative hypothesis, not based on research data.
2. Calculate an appropriate test statistic (z or t), based on research data.
3. Assess the significance of the test statistic to determine significance of evidence against the null hypothesis. |
|
|
Term
|
Definition
The value that μ is said to be equal to in the null hypothesis. |
|
|
Term
|
Definition
The part of the confidence interval below x bar. |
|
|
Term
|
Definition
aka Bound on error
aka Error bound
The distance on either side of p that forms the confidence interval. Maximum error of estimate. |
|
|
Term
Minimum variance unbiased estimator |
|
Definition
An unbiased estimator with the smallest possible variance. |
|
|
Term
Normal distribution table |
|
Definition
A list of values of the area under the normal distribution curve. |
|
|
Term
|
Definition
aka Status quo hypothesis
The hypothesis of no effect. No difference. |
|
|
Term
|
Definition
aka One-tailed test
An alternative hypothesis used in a z or t test used when there is an interest to only one side of μ. P-value is the area to the left of z or to the right of z.
μ < μ0
or
μ > μ0 |
|
|
Term
|
Definition
The probability of getting the observed value of the test statistic or a more extreme value, if the null hypothesis is true. The smaller the p-value, the stronger the evidence against the null hypothesis. The null hypothesis is rejected if the p value is ≤ α. If the null hypothesis is true, the distribution of the p-value is uniform between 0 and 1. If the null hypothesis is false, the distribution tends towards 0. The tendency towards 0 depends on sample size, magnitude of difference between hypothesized and true mean, and variance.
p < 0.01 : Very strong evidence against H0
0.01 < p < 0.05 : Strong evidence against H0
0.05 < p < 0.1 : Some weak evidence against H0
0.1 < p < 0.2 : Litter or no evidence against H0
p > 0.2 : No evidence against H0 |
|
|
Term
|
Definition
A method of determining significance. Measures the strength of the evidence against the null hypothesis. |
|
|
Term
|
Definition
We don't actually know the value of parameters. We used statistical inferece to estimate parameters. |
|
|
Term
|
Definition
The R command that gives the area to the left of x in a normal distribution curve. |
|
|
Term
|
Definition
The single value that is an estimate of a parameter. Based on a single sample. |
|
|
Term
|
Definition
Power = 1 - β
The probability of rejecting the null hypothesis, given that it is false. Increases as α increases, as n increases, as σ decreases, and as the true value of μ is farther from μ0. |
|
|
Term
|
Definition
A plot of power vs. values of μ. |
|
|
Term
|
Definition
Real-world significance. A finding with statistical significance may be unimportant, uninteresting, or not practically useful in real life. |
|
|
Term
Probability density function (PDF) |
|
Definition
The formula for a normal distribution. |
|
|
Term
|
Definition
The distribution of the statistic in all possible samples of the same size. The sampling distribution of a statistic is the probability distribution of that statistic if samples of the same size were repeatedly drawn from the population. |
|
|
Term
|
Definition
Area equal to α. In the rejection region approach, if the test statistic falls in the rejection region, the null hypothesis is rejected. It is bound by critical values. |
|
|
Term
Rejection region approach |
|
Definition
A method of deterining significance. The chosen α value is used to calculate a rejection region. If the test statistic falls in the rejection region, the null hypothesis is rejected. Downside is that we get the same result if the test statistic is very close or very far from critical values. |
|
|
Term
|
Definition
When a procedure still performs reasonably well when assumptions are violated. T procedures are robust to many violations of the normality assumption. |
|
|
Term
Sample proportion (p-hat) |
|
Definition
The parameter that p is an estimate of. We do not actually know the value of p-hat. |
|
|
Term
|
Definition
The greater the sample size, the smaller the margin of error. Diminishing returns. |
|
|
Term
Sampling distribution (X-bar) |
|
Definition
The probability distribution of a statistic in all possible samples. μ stays the same; but σ is smaller, according to the equation
σX-bar = (σ/√n) |
|
|
Term
|
Definition
α = P(type I error | H0 is true)
The probability that the null hypothesis is true. Usually chosen to be 0.05. The probability of a type I error. It has a trade-off with β. Related to confidence level. |
|
|
Term
|
Definition
The greater the standard deviation, the greater the margin of error. Forms a linear relationship with margin of error. |
|
|
Term
Standard error of the sample mean (SE(X-bar)) |
|
Definition
SE(X-bar) = s / √n
Because σ is unknown, SE(X-bar) is the estimate of the standard deviation of a sample. Used in a t test. |
|
|
Term
Standard normal distribution (z) |
|
Definition
Normal distribution where μ = 0, and σ = 1. |
|
|
Term
|
Definition
Determined using the rejection region approach or the p-value approach. The effect observed in the sample was unlikely to have occured due to chance. Strongly affected by sample size. |
|
|
Term
|
Definition
aka Student's t distribution
Has heavier tails and a lower peak than the standard normal distribution. Measured in degrees of freedom. As degrees of freedom increases, the t distribution tends towards the standard normal distribution. Infinite degrees of freedom is the standard normal distribution. |
|
|
Term
|
Definition
Used when σ is known. Works well if n > 40. Does not work well if n < 15. Same as a z-test, but uses a t distribution.
t = (x-bar - μ0) / (s/√n) |
|
|
Term
|
Definition
If the null hypothesis is true, z will have normal distribution.
If the null hypothesis is true, t will have a t distribution of n - 1 degrees of freedom. |
|
|
Term
|
Definition
Using the logs or square roots of data to create a normal distribution. |
|
|
Term
|
Definition
aka Two-tailed test
An alternative hypothesis used in a z test. P-value is the area to the left of -z and to the right of +z.
μ ≠ μ0 |
|
|
Term
|
Definition
Rejecting the null hypothesis when actually it is true. The probability of a type I error is α. |
|
|
Term
|
Definition
Not rejecting the null hypothesis when actually it is false. The probability of a type II error is β. |
|
|
Term
|
Definition
Estimates a parameter when the expected value is μ. A good estimate if variability (s) is low. |
|
|
Term
|
Definition
The part of the confidence interval above x bar. |
|
|
Term
|
Definition
Used when σ is known. This is a rare occurrance. Must be normally distributed population.
z = x-bar - μ0 / (σ / √n) |
|
|