Shared Flashcard Set

Details

Title

Statistics I: Term Test 2

Description

University of Guelph STAT*2040

Total Cards

Subject

Mathematics

Level

Undergraduate 2

Created

03/10/2015

Click here to study/print these flashcards.

Create your own flash cards! Sign up here.

Additional Mathematics Flashcards

Cards Return to Set Details

Term

Alternative Hypothesis (H_a)

Definition

aka Researcher hypothesis

The hypothesis that the researcher is trying to show. Can be one-sided or two-sided.

Term

Beta (β)

Definition

The probability of a type II error. Depends on α, n, and μ. It can be calculated if the true value of μ is known. It has a trade-off with α.

Term

Central Limit Theorem

Definition

Even if a population is not normally distributed, the sample mean will be normally distributed, as long as the sample size is large. Sample mean distribution tends towards the normal distribution as sample size increases. If sample size is 30 or more, it can be safe to assume normal distribution.

Term

Confidence interval

Definition

p ± margin of error

A type of statistical inference. Gives a range of plausible values for a parameter where we are a certain percent confident the parameter occurs. Based on sample data. There is a trade-off between confidence level and margin of error.

Term

Confidence level (z_α/2)

Definition

A percentage. The probability that p-hat is found within the confidence interval. The most commonly used confidence level is 95%. The larger the confidence level, the greater the margin of error. Related to α.

Term

Critical value

Definition

The numbers on the edge(s) of the rejection region.

Term

Degrees of freedom (DF)

Definition

n - 1

Term

Distribution-free procedure

Definition

aka Nonparametric procedure

Does not require a normal distribution.

Term

Hypothesis

Definition

Made about parameters, such as μ. Never about statistics such as x bar.

Term

Hypothesis testing

Definition

We reject the null hypothesis if there is significant evidence against it.

1. Formulate a null hypothesis and alternative hypothesis, not based on research data.

2. Calculate an appropriate test statistic (z or t), based on research data.

3. Assess the significance of the test statistic to determine significance of evidence against the null hypothesis.

Term

Hypothesized value (μ₀)

Definition

The value that μ is said to be equal to in the null hypothesis.

Term

Lower bound

Definition

The part of the confidence interval below x bar.

Term

Margin of error

Definition

aka Bound on error

aka Error bound

The distance on either side of p that forms the confidence interval. Maximum error of estimate.

Term

Minimum variance unbiased estimator

Definition

An unbiased estimator with the smallest possible variance.

Term

Normal distribution table

Definition

A list of values of the area under the normal distribution curve.

Term

Null hypothesis (H₀)

Definition

aka Status quo hypothesis

The hypothesis of no effect. No difference.

Term

One-sided alternative

Definition

aka One-tailed test

An alternative hypothesis used in a z or t test used when there is an interest to only one side of μ. P-value is the area to the left of z or to the right of z.

μ < μ₀

or

μ > μ₀

Term

P-value

Definition

The probability of getting the observed value of the test statistic or a more extreme value, if the null hypothesis is true. The smaller the p-value, the stronger the evidence against the null hypothesis. The null hypothesis is rejected if the p value is ≤ α. If the null hypothesis is true, the distribution of the p-value is uniform between 0 and 1. If the null hypothesis is false, the distribution tends towards 0. The tendency towards 0 depends on sample size, magnitude of difference between hypothesized and true mean, and variance.

p < 0.01 : Very strong evidence against H₀

0.01 < p < 0.05 : Strong evidence against H₀

0.05 < p < 0.1 : Some weak evidence against H₀

0.1 < p < 0.2 : Litter or no evidence against H₀

p > 0.2 : No evidence against H₀

Term

P-value approach

Definition

A method of determining significance. Measures the strength of the evidence against the null hypothesis.

Term

Parameter

Definition

We don't actually know the value of parameters. We used statistical inferece to estimate parameters.

Term

pnorm(x,μ,σ)

Definition

The R command that gives the area to the left of x in a normal distribution curve.

Term

Point estimate (p)

Definition

The single value that is an estimate of a parameter. Based on a single sample.

Term

Power

Definition

Power = 1 - β

The probability of rejecting the null hypothesis, given that it is false. Increases as α increases, as n increases, as σ decreases, and as the true value of μ is farther from μ₀.

Term

Power function

Definition

A plot of power vs. values of μ.

Term

Practical significance

Definition

Real-world significance. A finding with statistical significance may be unimportant, uninteresting, or not practically useful in real life.

Term

Probability density function (PDF)

Definition

The formula for a normal distribution.

Term

Probability distribution

Definition

The distribution of the statistic in all possible samples of the same size. The sampling distribution of a statistic is the probability distribution of that statistic if samples of the same size were repeatedly drawn from the population.

Term

Rejection region (s)

Definition

Area equal to α. In the rejection region approach, if the test statistic falls in the rejection region, the null hypothesis is rejected. It is bound by critical values.

Term

Rejection region approach

Definition

A method of deterining significance. The chosen α value is used to calculate a rejection region. If the test statistic falls in the rejection region, the null hypothesis is rejected. Downside is that we get the same result if the test statistic is very close or very far from critical values.

Term

Robust

Definition

When a procedure still performs reasonably well when assumptions are violated. T procedures are robust to many violations of the normality assumption.

Term

Sample proportion (p-hat)

Definition

The parameter that p is an estimate of. We do not actually know the value of p-hat.

Term

Sample size (n)

Definition

The greater the sample size, the smaller the margin of error. Diminishing returns.

Term

Sampling distribution (X-bar)

Definition

The probability distribution of a statistic in all possible samples. μ stays the same; but σ is smaller, according to the equation

σ_X-bar = (σ/√n)

Term

Significance level (α)

Definition

α = P(type I error | H₀ is true)

The probability that the null hypothesis is true. Usually chosen to be 0.05. The probability of a type I error. It has a trade-off with β. Related to confidence level.

Term

Standard deviation (σ)

Definition

The greater the standard deviation, the greater the margin of error. Forms a linear relationship with margin of error.

Term

Standard error of the sample mean (SE(X-bar))

Definition

SE(X-bar) = s / √n

Because σ is unknown, SE(X-bar) is the estimate of the standard deviation of a sample. Used in a t test.

Term

Standard normal distribution (z)

Definition

Normal distribution where μ = 0, and σ = 1.

Term

Statistical significance

Definition

Determined using the rejection region approach or the p-value approach. The effect observed in the sample was unlikely to have occured due to chance. Strongly affected by sample size.

Term

T distribution

Definition

aka Student's t distribution

Has heavier tails and a lower peak than the standard normal distribution. Measured in degrees of freedom. As degrees of freedom increases, the t distribution tends towards the standard normal distribution. Infinite degrees of freedom is the standard normal distribution.

Term

T test

Definition

Used when σ is known. Works well if n > 40. Does not work well if n < 15. Same as a z-test, but uses a t distribution.

t = (x-bar - μ₀) / (s/√n)

Term

Test statistic (z or t)

Definition

If the null hypothesis is true, z will have normal distribution.

If the null hypothesis is true, t will have a t distribution of n - 1 degrees of freedom.

Term

Transformation

Definition

Using the logs or square roots of data to create a normal distribution.

Term

Two-sided alternative

Definition

aka Two-tailed test

An alternative hypothesis used in a z test. P-value is the area to the left of -z and to the right of +z.

μ ≠ μ₀

Term

Type I error

Definition

Rejecting the null hypothesis when actually it is true. The probability of a type I error is α.

Term

Type II error

Definition

Not rejecting the null hypothesis when actually it is false. The probability of a type II error is β.

Term

Unbiased estimator

Definition

Estimates a parameter when the expected value is μ. A good estimate if variability (s) is low.

Term

Upper bound

Definition

The part of the confidence interval above x bar.

Term

Z test

Definition

Used when σ is known. This is a rare occurrance. Must be normally distributed population.

z = x-bar - μ₀ / (σ / √n)

Flashcard Machine - create, study and share online flash cards

Shared Flashcard Set

Details

Additional Mathematics Flashcards

Cards Return to Set Details

My Flashcards

Flashcard Library

Browse

About

Help

Mobile