Shared Flashcard Set

Details

Statistics I: Term Test 2
University of Guelph STAT*2040
48
Mathematics
Undergraduate 2
03/10/2015

Additional Mathematics Flashcards

 


 

Cards

Term
Alternative Hypothesis (Ha)
Definition

aka Researcher hypothesis

The hypothesis that the researcher is trying to show. Can be one-sided or two-sided.

Term
Beta (β)
Definition
The probability of a type II error. Depends on α, n, and μ. It can be calculated if the true value of μ is known. It has a trade-off with α.
Term
Central Limit Theorem
Definition
Even if a population is not normally distributed, the sample mean will be normally distributed, as long as the sample size is large. Sample mean distribution tends towards the normal distribution as sample size increases. If sample size is 30 or more, it can be safe to assume normal distribution.
Term
Confidence interval
Definition

p ± margin of error

A type of statistical inference. Gives a range of plausible values for a parameter where we are a certain percent confident the parameter occurs. Based on sample data. There is a trade-off between confidence level and margin of error.

Term
Confidence level (zα/2)
Definition
A percentage. The probability that p-hat is found within the confidence interval. The most commonly used confidence level is 95%. The larger the confidence level, the greater the margin of error. Related to α.
Term
Critical value
Definition
The numbers on the edge(s) of the rejection region.
Term
Degrees of freedom (DF)
Definition
n - 1
Term
Distribution-free procedure
Definition

aka Nonparametric procedure

Does not require a normal distribution.

Term
Hypothesis
Definition
Made about parameters, such as μ. Never about statistics such as x bar.
Term
Hypothesis testing
Definition

We reject the null hypothesis if there is significant evidence against it. 

1. Formulate a null hypothesis and alternative hypothesis, not based on research data.

2. Calculate an appropriate test statistic (z or t), based on research data.

3. Assess the significance of the test statistic to determine significance of evidence against the null hypothesis.

Term
Hypothesized value (μ0)
Definition
The value that μ is said to be equal to in the null hypothesis.
Term
Lower bound
Definition
The part of the confidence interval below x bar.
Term
Margin of error
Definition

aka Bound on error

aka Error bound

The distance on either side of p that forms the confidence interval. Maximum error of estimate. 

Term
Minimum variance unbiased estimator
Definition
An unbiased estimator with the smallest possible variance.
Term
Normal distribution table
Definition
A list of values of the area under the normal distribution curve.
Term
Null hypothesis (H0)
Definition

aka Status quo hypothesis

The hypothesis of no effect. No difference.

Term
One-sided alternative
Definition

aka One-tailed test

An alternative hypothesis used in a z or t test used when there is an interest to only one side of μ. P-value is the area to the left of z or to the right of z.

μ < μ0

or

μ > μ0

Term
P-value
Definition

The probability of getting the observed value of the test statistic or a more extreme value, if the null hypothesis is true. The smaller the p-value, the stronger the evidence against the null hypothesis. The null hypothesis is rejected if the p value is ≤ α. If the null hypothesis is true, the distribution of the p-value is uniform between 0 and 1. If the null hypothesis is false, the distribution tends towards 0. The tendency towards 0 depends on sample size, magnitude of difference between hypothesized and true mean, and variance.

p < 0.01 : Very strong evidence against H0

0.01 < p < 0.05 : Strong evidence against H0

0.05 < p < 0.1 : Some weak evidence against H0

0.1 < p < 0.2 : Litter or no evidence against H0

p > 0.2 : No evidence against H0

Term
P-value approach
Definition
A method of determining significance. Measures the strength of the evidence against the null hypothesis.
Term
Parameter
Definition
We don't actually know the value of parameters. We used statistical inferece to estimate parameters.
Term
pnorm(x,μ,σ)
Definition
The R command that gives the area to the left of x in a normal distribution curve.
Term
Point estimate (p)
Definition
The single value that is an estimate of a parameter. Based on a single sample.
Term
Power
Definition

Power = 1 - β

The probability of rejecting the null hypothesis, given that it is false. Increases as α increases, as n increases, as σ decreases, and as the true value of μ is farther from μ0.

Term
Power function
Definition
A plot of power vs. values of μ.
Term
Practical significance
Definition
Real-world significance. A finding with statistical significance may be unimportant, uninteresting, or not practically useful in real life.
Term
Probability density function (PDF)
Definition
The formula for a normal distribution.
Term
Probability distribution
Definition
The distribution of the statistic in all possible samples of the same size. The sampling distribution of a statistic is the probability distribution of that statistic if samples of the same size were repeatedly drawn from the population.
Term
Rejection region (s)
Definition
Area equal to α. In the rejection region approach, if the test statistic falls in the rejection region, the null hypothesis is rejected. It is bound by critical values.
Term
Rejection region approach
Definition
A method of deterining significance. The chosen α value is used to calculate a rejection region. If the test statistic falls in the rejection region, the null hypothesis is rejected. Downside is that we get the same result if the test statistic is very close or very far from critical values.
Term
Robust
Definition
When a procedure still performs reasonably well when assumptions are violated. T procedures are robust to many violations of the normality assumption.
Term
Sample proportion (p-hat)
Definition
The parameter that p is an estimate of. We do not actually know the value of p-hat.
Term
Sample size (n)
Definition
The greater the sample size, the smaller the margin of error. Diminishing returns.
Term
Sampling distribution (X-bar)
Definition

The probability distribution of a statistic in all possible samples. μ stays the same; but σ is smaller, according to the equation 

σX-bar = (σ/√n)

Term
Significance level (α)
Definition

α = P(type I error | H0 is true)

The probability that the null hypothesis is true. Usually chosen to be 0.05. The probability of a type I error. It has a trade-off with β. Related to confidence level.

Term
Standard deviation (σ)
Definition
The greater the standard deviation, the greater the margin of error. Forms a linear relationship with margin of error.
Term
Standard error of the sample mean (SE(X-bar))
Definition

SE(X-bar) = s / √n

Because σ is unknown, SE(X-bar) is the estimate of the standard deviation of a sample. Used in a t test. 

Term
Standard normal distribution (z)
Definition
Normal distribution where μ = 0, and σ = 1.
Term
Statistical significance
Definition
Determined using the rejection region approach or the p-value approach. The effect observed in the sample was unlikely to have occured due to chance. Strongly affected by sample size.
Term
T distribution
Definition

aka Student's t distribution

Has heavier tails and a lower peak than the standard normal distribution. Measured in degrees of freedom. As degrees of freedom increases, the t distribution tends towards the standard normal distribution. Infinite degrees of freedom is the  standard normal distribution.

Term
T test
Definition

Used when σ is known. Works well if n > 40. Does not work well if n < 15. Same as a z-test, but uses a t distribution.

t = (x-bar - μ0) / (s/√n)

Term
Test statistic (z or t)
Definition

If the null hypothesis is true, z will have normal distribution.

If the null hypothesis is true, t will have a t distribution of n - 1 degrees of freedom.

Term
Transformation
Definition
Using the logs or square roots of data to create a normal distribution.
Term
Two-sided alternative
Definition

aka Two-tailed test

An alternative hypothesis used in a z test. P-value is the area to the left of -z and to the right of +z.

μ ≠ μ0

Term
Type I error
Definition
Rejecting the null hypothesis when actually it is true. The probability of a type I error is α.
Term
Type II error
Definition
Not rejecting the null hypothesis when actually it is false. The probability of a type II error is β.
Term
Unbiased estimator
Definition
Estimates a parameter when the expected value is μ. A good estimate if variability (s) is low.
Term
Upper bound
Definition
The part of the confidence interval above x bar.
Term
Z test
Definition

Used when σ is known. This is a rare occurrance. Must be normally distributed population.

z = x-bar - μ0 / (σ / √n)

Supporting users have an ad free experience!