Term
data that is measurable or countable |
|
Definition
|
|
Term
data with relative zero...this consider zero has value. Give example: |
|
Definition
interval data.
Temperature |
|
|
Term
data with absolute zero-- zero means no value is __________ . example: |
|
Definition
ratio data grade, height, weight, length, money |
|
|
Term
__________ is one that can take only a finite or countable set of values.
give example: |
|
Definition
discrete.
Example: # of blocks (can not have 1/2 of a block) # of attacks of asthma per week. # of children (cant have 1/2 of the kid) |
|
|
Term
Unit of measurement that can take on an infinite set of values. for example : weight blood pressure height age |
|
Definition
|
|
Term
meaningfully ordered categories is ___________. ( best to worst ranking, age categories) |
|
Definition
Ordinal data another defi.. ---are categorical data where there is a logical ordering to the categories. A good example is the Likert scale that you see on many surveys: 1=Strongly disagree; 2=Disagree; 3=Neutral; 4=Agree; 5=Strongly agree. |
|
|
Term
categories without a meaningful order identify ______________ . political affiliation, industry classification, ethnic/cultural groups) |
|
Definition
Nominal data
another defi:::: are categorical data where the order of the categories is arbitrary. A good example is race/ethnicity has values 1=White, 2=Hispanic, 3=American Indian, 4=Black, 5=Other. Note that the order of the categories is arbitrary.
Certain statistical concepts are meaningless for nominal data. For example it would be silly to ask what are the mean and standard deviation are for race/ethnicity |
|
|
Term
two category of qualitative ? two category of quantitative ? |
|
Definition
qualitative: nominal and ordinal
quantitative : discrete and continuous data |
|
|
Term
give more example of nominal data |
|
Definition
sex, alive or dead blood group o a b ab |
|
|
Term
give more example of ordinal data |
|
Definition
Copy paste from side 32..shit doesnt make sense but here it is...
Grade (i.e. of breast cancer) Better, same, worse Disagree, neutral, agree |
|
|
Term
in measurements out of 4 choice nominal ordinal interval ratio
which one is ...Ordinal measurement but the differences between the scores are equal? |
|
Definition
|
|
Term
in measurements out of 4 choice nominal ordinal interval ratio
which one is ..When the amt of variable is placed in order of magnitude? |
|
Definition
|
|
Term
n measurements out of 4 choice nominal ordinal interval ratio
which one is classification of variable into diff categories |
|
Definition
|
|
Term
n measurements out of 4 choice nominal ordinal interval ratio
which one is... measurement has a true zero point, absence of that variable |
|
Definition
|
|
Term
|
Definition
A subset or subgroup that should be representative of the population of interest. The scientific notation for sample is n. |
|
|
Term
__________ : generalization about a population based on sample data |
|
Definition
Statistical Inference
From wiki: Statistical inference is the process of drawing conclusions from data that are subject to random variation, for example, observational errors or sampling variation |
|
|
Term
what is Measure of reliability ? |
|
Definition
statement about the uncertainty associated with an inference |
|
|
Term
Question – a hospital conducts exit surveys from their emergency department, selecting every 10th person who exits the hospital. Is this a random sample? |
|
Definition
|
|
Term
|
Definition
Population – the set of units we are interested in learning about example – all 1450 emergency room patients seen |
|
|
Term
|
Definition
Variable – characteristic of an individual population unit (What are we measuring?) example – age, gender, admission diagnosis |
|
|
Term
Define Value in inferential statistics |
|
Definition
Value – what unit of measurement are we using? |
|
|
Term
end of Chap 1...god i want to committee suicide.... |
|
Definition
|
|
Term
in Pareto Diagram bars are arranged in which order |
|
Definition
Bars are arranged in decreasing order from Left to Right. Used for targeting major causes. |
|
|
Term
to achieve our goal.. what are 3 main aspects of distribution of data is shown on graph? |
|
Definition
- think : SCS
Shape – look at the graphs
Center – what number is in the middle?
Spread – how much variation? |
|
|
Term
what kind of graph of a distribution that when folded in half, produces two identical shapes? |
|
Definition
|
|
Term
in what shape graph a distribution in which scores are clustered at one end, and rarity of scores (the tail) occur on the other end. |
|
Definition
|
|
Term
the graph is ___________ if the tail (rare scores) occurs for the high scores to the right |
|
Definition
|
|
Term
the graph is ___________ if the tail (rare scores) occurs for the low scores to the left |
|
Definition
negatively skewed .. /\ / / |
|
|
Term
tendency of data to center about certain numerical values is called ________ name 3 that are used commonly |
|
Definition
central tendency
Mean Median Mode |
|
|
Term
Define mean, median and mode |
|
Definition
Mean: the sum of scores divided by the number of scores (average)
Median: the score with an equal amount of scores above and below it (50th percentile)
Mode: the score that occurs the most often |
|
|
Term
|
Definition
The Median Middle number when observations are arranged in order (not always the “number in the middle” - repeat values change relation to real limits) Median denoted by m Identified as : the observation if n is odd the mean of the middle two observations if n is even |
|
|
Term
|
Definition
The most frequently occurring value in the data set If all values are different, there is no mode. Mode can be unimodal, bimodal,or multimodal
Data displayed in a histogram will have a modal class – the class with the largest frequency |
|
|
Term
what do you know about dispersion?
know 5 things |
|
Definition
Refers to the variety exhibited by a set of observations
If all values in a data set are the same, there is no dispersion,
If the values are not all the same, dispersion is present
If values are different and close together, the dispersion is small
If the values are different and far apart, the dispersion is greater |
|
|
Term
Other terms used synonymously with dispersion are: |
|
Definition
oo this is a good test question ..lol
-Variation -variability - Spread - Scatter |
|
|
Term
variability the spread of the data across possible values 3 commonly used measures of variability are ? |
|
Definition
1) Range 2) variance 3) standard deviation. |
|
|
Term
|
Definition
largest measurement minus the smallest measurement. |
|
|
Term
what is Useful as a measure of variation within a given set of data |
|
Definition
|
|
Term
A Normal distribution with a mean of zero and a standard deviation of 1 is also known as the |
|
Definition
Standard Normal Distribution |
|
|
Term
z-core --dont know what it is..look it up |
|
Definition
Z-score – the distance between a measurement x and the mean, expressed in standard units |
|
|
Term
what is percentile rankings |
|
Definition
Percentile rankings make use of the pth percentile The median is an example of percentiles. Median is the 50th percentile (50% of observations lie above it, and 50% lie below it)
For any p, the pth percentile has p% of the measures lying below it, and (100-p)% above it |
|
|
Term
|
Definition
measurements between inner and outer fences are suspect measurements beyond outer fences are highly suspect |
|
|
Term
to detect outliers using z score |
|
Definition
Z-scores Scores of 3 in mounded distributions (2 in highly skewed distributions) are considered outliers |
|
|
Term
In statistical significance testing, the p-value is the probability of obtaining a test statistic at least as extreme as the one that was actually observed, assuming that the null hypothesis is true |
|
Definition
|
|
Term
|
Definition
The null hypothesis is stated, “There is NO difference between the two study groups note : More scientifically stated, the null hypothesis would be: “In patients with CAD, there is no difference in the cumulative five-year incidence of MIs between patients assigned to take ASA daily and patients assigned to take Med B daily”. |
|
|
Term
Alternated hypothesis
note : Statistical testing is always done as an exercise to disprove the null hypothesis |
|
Definition
n research, an alternate hypothesis, or research hypothesis, is generated which disagrees with the null hypothesis .
The alternate hypothesis would be stated: “In patients with CAD, there IS a difference in the cumulative five-year incidence of MIs between patients assigned to take ASA daily and patients assigned to take Med B daily”. |
|
|
Term
two tailed alternate hypothesis .. |
|
Definition
There is a difference between ASA and Med B regarding the incidence of MIs in CAD patients. (Perhaps ASA is better than Med B, or perhaps Med B is better than ASA. The two-tailed hypothesis does NOT state the direction of difference). One-Tailed Alternate Hypothesis: ASA is better than Med B. One-Tailed Alternate Hypothesis: Med B is better than ASA. |
|
|
Term
which tailed hypothesis should be chosen? |
|
Definition
Most authorities advocate a two-tailed approach unless there exists strong evidence that the one-tailed alternative hypothesis could not go in any other direction. |
|
|
Term
ooo just know this ..this is red
Another guidepost is needed to help the researcher decide if a result is likely due to chance alone, meaning it is not statistically significant, or if it is unlikely due to chance alone, meaning it is statistically significant |
|
Definition
|
|
Term
That point, or the line in the sand, is most often set at 0.05 and it is know as Alpha (a ). |
|
Definition
Alpha region: The most commonly accepted alpha criterion is 5%. (0.05)--this might be on test. she repeated 3 times in the powerpoint |
|
|
Term
|
Definition
When the results of our statistical test lead us to arrive at a false conclusion we have made a type I (alpha) or type II (beta) error.
When we falsely reject the null hypothesis, we have committed a type I error. The type I error corresponds to the alpha, or the criterion for rejection of the null.
When we falsely accept the null hypothesis, we have committed a type II error. This can happen when our sample size is too small to detect the differences in the variables being studied. |
|
|
Term
example of type 1 and type II error |
|
Definition
Type I Error – (false alarm) innocent person is found guilty
Type II Error – (missing a detection) guilty person is found innocent |
|
|
Term
Parametric tests are significance tests for what what kind of data? |
|
Definition
data from interval or ratio scales |
|
|
Term
Nonparametric tests are used to test hypotheses with what kind of data? |
|
Definition
nominal and ordinal data. |
|
|
Term
slide no 10 chart in chap 11 impt.
Decision tree for hypothesis testing |
|
Definition
population SD known ?
if yes: z-score
if no: t-score |
|
|
Term
Z test is used with what size sample |
|
Definition
|
|
Term
T-test is used with what kind of sample size |
|
Definition
The t-test is appropriate when the population variances are not known but the population distribution is normal |
|
|