Shared Flashcard Set

Details

Definitions--Chapter 3
Introductory Statistics definitions (descriptive measures)
37
Mathematics
Undergraduate 2
02/18/2013

Additional Mathematics Flashcards

 


 

Cards

Term
descriptive measures
Definition
numbers that are used to describe data sets
Term
measure of centeral tendency (measures of center)
Definition
descriptive measures that indicates where the center, or most typical value, in a data set lie

often called averages
Term
mean
Definition
sum of the observations divided by the number of observations
Term
median
Definition
arrange the data in increasing order.

If the number of observations is odd, then the median is the observation exactly in the middle of the ordered list.

If the number of observations is even, then the median is the mean of the two middle observations in the ordered list

In both cases, if we let n denote the number of observations, then the median is at position (n+1)/2 in the ordered list
Term
mode
Definition
find the frequency of each value in the data set

if no value occurs more than once, then the data set has no mode

otherwise, any value that occurs with the greatest frequency is a mode of the data set
Term
resistant measure
Definition
not sensitive to the influence of a few extreme observations (e.g. median, but not mean)
Term
trimmed mean
Definition
more resistant mean

created by removing a percentage of the smallest and largest observations before computing the mean
Term
sample mean
Definition
for a variable x, the mean of the observations for a sample is called a sample mean and is denoted as an x with a line over it

mean of the sample data
Term
measures of variation (measures of spread)
Definition
descriptive measures that indicate the amount of variation or spread in a data set
Term
range
Definition
difference between the maximum (largest) and minimum (smallest) observations
Term
standard deviation
Definition
measures variation by indicating how far, on average, the observations are from the mean
Term
deviations from the mean
Definition
how far each observation is from the mean
Term
sum of squared deviations
Definition
the sum of the squared deviations from the mean

gives a measure of the total deviation from the mean for all the observations
Term
Chebychev's Rule
Definition
valid for all data sets and implies, in particular, that at least 89% of the observations lie within three standard deviations to either side of the mean
Term
Empirical Rule
Definition
If the distribution of the data set is approximately bell shaped, then we can apply this rule, which implies, in particular, that roughly 99.7% of the observations lie within 3 standard deviations to either side of the mean
Term
percentiles
Definition
divide a data set into hundredths, or 100 equal parts
Term
deciles
Definition
divide a data set into tenths, or 10 equal parts
Term
quintiles
Definition
divide a data set into fifths, or 5 equal parts
Term
quartiles
Definition
divide a data set into quarters, or 4 equal parts
Term
first quartile
Definition
the number that divides the bottom 25% from the top 75%
Term
second quartile
Definition
the number that divides the bottom 50% from the top 50%

median
Term
third quartile
Definition
number that divides the bottom 75% from the top 25%
Term
interquartile range (IQR)
Definition
the difference between the first and third quartiles (Q3 - Q1)
Term
Five-Number Summary
Definition
min, Q1, Q2, Q3, max
Term
outliers
Definition
observations that fall well outside the overall pattern of data
Term
lower limit
Definition
Q1 -1.5 * IQR
Term
upper limit
Definition
Q3 + 1.5 * IQR
Term
potential outliers
Definition
observations that fall below the lower limit or above the upper limit
Term
boxplot (box-and-whisker diagram)
Definition
based on the five-number summary and can be used to provide a graphical display of the center and variation of a data set
Term
constructing a boxplot procedure
Definition
1. determine the quartiles

2. determine potential outliers and the adjacent values

3. draw a horizontal axis on which the numbers obtained in steps 1 and 2 can be located. above this axis, mark the quartiles and the adjacent values with vertical lines.

4. connect the quartiles to make a box, and then connect the ox to the adjacent values with lines

5. plot each potential outlier with an asterisk
Term
whiskers
Definition
two lines emanating from the box in a boxplot
Term
population mean (mean of a variable)
Definition
for a variable, x, the mean of all possible observations for the entire population
Term
population standard deviation (standard deviation of a variable)
Definition
for a variable, x, the standard deviation of all possible observations for the entire population
Term
parameter
Definition
a descriptive measure for a population
Term
statistic
Definition
a descriptive measure for a sample
Term
standardized variable
Definition
always has a mean of 0 and standard deviation of 1

the standardized version of a variable x is obtained by first subtracting from x its mean and then dividing by its standard deviation
Term
z-score
Definition
for an observed value of a variable, x, the corresponding value of the standarized variable z is called the z-score of the observation. The term standard score is often used instead of z-score.

A negative z-score indicates that the observation is below the mean, whereas a positive score indicates that the observation is above the mean
Supporting users have an ad free experience!