Term
What are the 4 steps for data-driven problem solving? |
|
Definition
1. Practical problem 2. Statistical problem 3. Statistical solution 4. Practical solution |
|
|
Term
What is the practical problem? |
|
Definition
Problem is stated in practical terms-billing errors cost us millions |
|
|
Term
What is the statistical problem? |
|
Definition
Express the practical problem as a statistical one.
What factors are significant predictors of billing errors? |
|
|
Term
What is a statistical solution? |
|
Definition
Obtain statistical solution using data analysis tools.
Inconsistency in the order filling process causes billing errors |
|
|
Term
What is a practical solution? |
|
Definition
Convert statistical solution to practical terms.
Simplify the billing process and train |
|
|
Term
|
Definition
The science (and art) of data-based decision-making |
|
|
Term
|
Definition
The entire process output which we wish to draw a conclusion |
|
|
Term
|
Definition
The subset used to represent the population |
|
|
Term
|
Definition
A sample drawn such that every member of the population has an equal chance of being selected |
|
|
Term
|
Definition
A quantity associated with the population |
|
|
Term
|
Definition
A quantity calculated from the sample |
|
|
Term
What are the 2 types of statistical tools? |
|
Definition
|
|
Term
|
Definition
aim to describe and summarize the important features of a population or process
graphical-charts, graphs etc numerical-mean, median, mode |
|
|
Term
|
Definition
Use sample data to help make comparisons or draw inferences on the overall population
Analysis tools hypothesis testing data modeling |
|
|
Term
What is a enumerative study? |
|
Definition
study aimed to answer questions about the current population like how many or in what proportion
focuses on historical rather than predictive |
|
|
Term
Which type of statistic is used in enumerative studies? |
|
Definition
|
|
Term
|
Definition
Answers questions like why or what are the causes of
Generalizes results to future states of the population |
|
|
Term
Which statistical tool does analytical studies use? |
|
Definition
both descriptive and inferential |
|
|
Term
What are the 2 types of measurement? |
|
Definition
|
|
Term
|
Definition
representations of categories or attributes
people, cars, animals
good/bad boy/girl |
|
|
Term
|
Definition
derived from a scale or continuum that is infinitely divisible.
seconds, minutes, inches
measurement of time, temp, weight |
|
|
Term
|
Definition
Data that is counted
discrete measurement term because they sort or count items based on attributes
-defects |
|
|
Term
|
Definition
Data that is measured
term for continuous measurement since they can take on infinite values within any 2 points |
|
|
Term
|
Definition
Groups are labels, no order -profession -color of car |
|
|
Term
|
Definition
groups are a logical order
-small -medium -large |
|
|
Term
|
Definition
Number of items or events -# of new car sales in a week -# of accidents in a year |
|
|
Term
|
Definition
Measurements made along a continuum -gas mileage -speed of pitched ball |
|
|
Term
|
Definition
Shows the relative frequency of defects in rank-order
Used to pick the low hanging fruits |
|
|
Term
|
Definition
Takes into account severity ratings in addition to frequency |
|
|
Term
|
Definition
Simple visual that conveys a lot of information |
|
|
Term
|
Definition
Divide data into groups to reveal sub-patterns to help pinpoint cause of variation. |
|
|
Term
|
Definition
targets there is both upper and lower limits |
|
|
Term
|
Definition
Average that is close to the target |
|
|
Term
|
Definition
|
|
Term
|
Definition
Average of a distribution |
|
|
Term
|
Definition
Sum of all observations ---------------------------- Total number of observations |
|
|
Term
|
Definition
Divides data to 2 halves-it's the value in the middle |
|
|
Term
|
Definition
Most frequently occurring value in the set |
|
|
Term
Common measurements of dispersion |
|
Definition
Range variance standard deviation |
|
|
Term
|
Definition
difference between the largest and smallest data in the set |
|
|
Term
|
Definition
average squared distance between mean and individual observations |
|
|
Term
|
Definition
positive square-root of the variance |
|
|
Term
|
Definition
68-95-99.7% rule
68% of values in the normal curve fall within one SD of the mean, 95% fall within 2 SD of the mean, and 99.7% fall within 3 SD |
|
|
Term
|
Definition
The x is the mean, median and mode |
|
|
Term
|
Definition
As sample size increases, distribution of the mean tends to take the normal shape of the actual population |
|
|
Term
|
Definition
how many standard deviations the observed value is from the mean |
|
|