Term
|
Definition
true score variance total variance |
|
|
Term
|
Definition
|
|
Term
Sources of Error Variance |
|
Definition
Test construction Test administration Test scoring/interpretation Other |
|
|
Term
|
Definition
Good for traits, not for emotional states Bad--practice effects, memory changes, fatigue, motivation, you know |
|
|
Term
|
Definition
Test-retest after 6 months |
|
|
Term
Coefficient of Equivalents |
|
Definition
Used with alternative or parallel forms |
|
|
Term
|
Definition
An estimate of inter-item consistency BAD for heterogeneous tests and speed tests |
|
|
Term
|
Definition
used to adjust split half reliability good to learn how many items to add to improve reliability or estimate internal consistency |
|
|
Term
|
Definition
measures different factors, content and topics MMPI |
|
|
Term
|
Definition
One factor/topic/content area |
|
|
Term
|
Definition
Stat of choice for determining inter-item consistency Good for dichotomous items (right or wrong) |
|
|
Term
|
Definition
Cronbach's Most widely used for estimating internal consistency Can be used with nondichotomous or Likerts .9 or higher = redundancy |
|
|
Term
All indices of reliability provide and index THAT IS? |
|
Definition
Characteristic of a particular group of test scores, NOT the test |
|
|
Term
Need more reliability depending on: |
|
Definition
the tests: Implications How many tests being used in a situation The purpose of the reliability |
|
|
Term
Does homo or hetero have high inter-item consistency? |
|
Definition
Homo! Use test retest with hetero |
|
|
Term
Does test-retest work with dynamic or static characteristics? |
|
Definition
|
|
Term
|
Definition
11th grade Act effects of 1st year grades in grad school Only get scores of people who make it to 11th grade and then again to grad school |
|
|
Term
Criterion-referenced test |
|
Definition
Compares a testaker to an objective or standard Traditional reliability is not appropriate for this type of test |
|
|
Term
|
Definition
Need to take a sample out of the universe of items Representative |
|
|
Term
|
Definition
Your score generalizes to the universe |
|
|
Term
Standard Error of Measurement |
|
Definition
Provide a measure of the precision of an observed test score HIGHER=lower reliability Mean plus or minus error ex. poll is accurate plus or minus 3% |
|
|
Term
|
Definition
is the plus or minus 3 part |
|
|
Term
Standard Error of Difference |
|
Definition
Scores can change from one administration to the other based on stuff other than error Use this stat to see if it is significantly diff |
|
|
Term
|
Definition
A process of gathering and evaluating evidence of validity |
|
|
Term
Da Big Tres trinitarian view |
|
Definition
Content, criterion, construct |
|
|
Term
|
Definition
Feels big 3 is incomplete, he wants to include societal values consequences how the test is being used |
|
|
Term
|
Definition
How adequately does a test sample behavior and represent the universe of behavior it was designed to measure |
|
|
Term
|
Definition
Developed a method for giving courts evidence a test is work related Panel of experts to rate items essential, useful or not necessary CONTENT |
|
|
Term
|
Definition
Content validity ratio =ne - n/2 N/ 2 Positive is good |
|
|
Term
|
Definition
How well a test score can be used to infer and individuals probably standing on some measure of interest (criterion) |
|
|
Term
|
Definition
RELIABILE RELEVANT VALID UNCONTAMINATED |
|
|
Term
|
Definition
the relationship between test scores and the criterion measured at the same time good for depression |
|
|
Term
|
Definition
A test score obtained @ one time against a criterion measured in the future |
|
|
Term
3 things to look at when doing validity |
|
Definition
Appropriate sample Attrition rates Appropriate criterion |
|
|
Term
|
Definition
Using multiple predictors...each added one should explain something new |
|
|
Term
|
Definition
the likelihood a testaker will score w/in some interval on the criterion chi squared |
|
|
Term
|
Definition
Tells you whether including a new measure will improve your base rate |
|
|
Term
|
Definition
Tell the difference in average criterion scores for selected v original |
|
|
Term
|
Definition
Base rate and hit rate and miss ratio |
|
|
Term
|
Definition
Construct developed to describe or explain behavior Construct validity--means high and low scores behave as predicted |
|
|
Term
All types of validity evidence are forms of... |
|
Definition
|
|
Term
|
Definition
Stat program thing that classes people or things together |
|
|
Term
|
Definition
Systematic variance...something in the test favors one group over another |
|
|
Term
|
Definition
Leniency Severity Central tendency Halo effect=me |
|
|
Term
Five steps of Test Construction |
|
Definition
Test Conceptualization Test Construction Test Tryout Item Analysis Test Revision |
|
|
Term
|
Definition
Use known masters and known novices to do preliminary research |
|
|
Term
|
Definition
The process of setting rules for assigning #'s in measurement |
|
|
Term
|
Definition
Nominal, Ordinal, Interval, Ratio Age scale Grade scale Stanine Scale |
|
|
Term
|
Definition
1-10, one being happiest 10 being least happy |
|
|
Term
|
Definition
Sum up responses Good for Likert |
|
|
Term
|
Definition
|
|
Term
|
Definition
Organize from most to least like you |
|
|
Term
|
Definition
Predetermined categories already named |
|
|
Term
|
Definition
|
|
Term
|
Definition
|
|
Term
|
Definition
Short answer, fill in the blank, essay |
|
|
Term
Class or category scoring |
|
Definition
Credit toward placement in a class or category DSM checklist for Depression |
|
|
Term
|
Definition
Compare your scores to another one of your scores |
|
|
Term
|
Definition
5-10 people per item Use target population |
|
|
Term
|
Definition
P = # of students who got ? right Total test takers |
|
|
Term
|
Definition
s= sqrt of (p * (1-p) IVI= (s*rc) |
|
|
Term
|
Definition
(s*rt) rt=correlation btwn item score and total test score |
|
|
Term
Item-discrimination Index |
|
Definition
d = [u-L/N] how well an item separates high from low scores neg # = bad |
|
|
Term
Item-characteristic curve |
|
Definition
Graph of low to high ability vs P of correct response |
|
|
Term
|
Definition
Like the illness causality scale for children--what they know about children assumes their level of cognitive development |
|
|