Term
Define what Stepwise Regression is |
|
Definition
Stepwise – very old and flawed approach to figuring out a step of predictors that is going to be good. Not systematic
Example: Variable Rxy (validities) Age .17 Gender .25 Education .09 Training .37 Motivation .03 Conscientious .28
Training would go in first since it has the highest correlation, then compute all partial correlations. Based off the partial correlations, the highest correlation is then selected and so on… |
|
|
Term
Define Hierarchical Regression |
|
Definition
look at gain in prediction from adding additional predictors. Systematic buildup of an argument.
Example:
• Yhat = b1(age) + b2(gender) + b0 • Yhat = b1(age) + b2(gender) + b3(education) + b4(training) + b0 • Yhat = b1(age) + b2(gender) + b3(education) + b4(training) + b5(motivation) + b6(conscientiousness) + b0 |
|
|
Term
Suppose you have a categorical variable with four groups, e.g. four regions of the country. You wish to use it as a predictor in a regression analysis. What is the general strategy for employing a categorical variable as a predictor in a regression analysis. |
|
Definition
• Create a set of covariables and use the coveriables in the regression analysis so that you are not implying any order. |
|
|
Term
In any coding scheme for g groups, how many codes are required to characterize the g groups? |
|
Definition
|
|
Term
A dummy variable coding scheme with group 3 as the baseline group |
|
Definition
G D1 D2 1 1 0 2 0 1 3 0 0
YHAT = B1d1 + B2D2 + B0 |
|
|
Term
an unweighted effects coding scheme with group 3 as the baseline group; |
|
Definition
G UE1 UE2 1 1 0 2 0 1 3 -1 -1
YHAT = B1UE1 + B2UE2 + B0 |
|
|
Term
a series of orthogonal contrast codes. For example, be able to code these contrasts: contrast code that contrasts the mean of the first group with the average of the means of the second and third groups; a contrast code that contrasts the mean of the second group with the mean of the third group. |
|
Definition
G C1 C2 1 2 0 2 -1 1 3 -1 -1 |
|
|
Term
For dummy coding, be able to take the general regression equation and the codes and explain what each of the coefficients in the equation is measuring
Example: YHAT = b1D1 + b2D2 + b0 |
|
Definition
B0 = the mean of the base group
B1 = the difference between (the mean of group 1) – (the mean of group 3 [base group])
B2 = the difference between (the mean of group 2) – (the mean of group 3 [base group]) |
|
|
Term
Are dummy codes centered? |
|
Definition
|
|
Term
Are the pairs of dummy codes in a dummy variable coding scheme orthogonal? |
|
Definition
|
|
Term
What sort of data configuration lends itself to coding with dummy codes? |
|
Definition
A definite control group and a definite base group |
|
|
Term
Suppose you run a one factor analysis of variance on a data set like that in the DATA LAYOUT above, q.6 (three groups). What will be the relationship of the resulting ANOV summary table to a regression analysis in which a set of dummy codes are used to code the three groups, and the criterion is the same as the dependent variable in the regression analysis? Explain the relationship of hypothesis for the overall F test in ANOV and the F test from this corresponding regression analysis. (Handout 10, page 5). |
|
Definition
ANOVA AND REGRESSION TABLE ARE EXACTLY THE SAME
Regression Anova Ss regression Ss between SS residual SS within Df Df (same) H0 = p2Y1d1d2 Ho = M1 = M2 = M3
null hypothesis for regression: Group members has no relationship to the criterion in both hypothesis |
|
|
Term
For unweighted effects coding, be able to take the general regression equation and the codes and explain what each of the coefficients in the equation is measuring if the groups are of equal size (see Handout 10, page 8). Be able to show how to find the deviation of the baseline group from the grand mean when unweighted effects codes are applied in the equal group case.
Example: YHAT = b1UE1 + b2UE2 + b0 |
|
Definition
B0 = the unweighted grand mean (4 + 8 + 6) / 3 = 6 B1 = the difference between (the mean of group 1) – (the grand mean) B2 = the difference between (the mean of group 2) – (the mean of group 3 [base group]) o Example: • GR1 = MEAN 4 ; GR2 = MEAN 8 ; GR3 = MEAN 6 • B0 = 6 • B1 = 4 – 6 = 2 • B2 = 8 – 6 = 2
Group 3 – the grand mean = -(b1 + b2) = -(-2 + 2) |
|
|
Term
How does the ANOV summary table for the data in the DATA LAYOUT relate to the analysis of regression summary table when the groups are coded with a set of unweighted effects codes? |
|
Definition
Identical in every respect |
|
|
Term
Are unweighted effects codes centered for equal group size? for unequal group size? |
|
Definition
Are centered for equal sample size
Not centered for unequal sample size |
|
|
Term
Are unweighted effects codes orthogonal? |
|
Definition
|
|
Term
Consider gender as a dummy coded varaible, 1=male, 0=female. Suppose you have the coefficients for the overall regression equation, where X is continuous and D is a dummy code
Yhat = .4X + .3 D + .2XD + 1.5
Explain what each of the four coefficients (including the intercept) measure. |
|
Definition
|
|
Term
Write the simple regression equation:
Yhat = .4X + .3 D + .2XD + 1.5 |
|
Definition
Yhat = .4X + 1.5
note: B1X & b0 only apply to the group coded 0 (females) |
|
|
Term
How would you get a simple regression for the group coded one (males) |
|
Definition
The easiest thing would be to reverse the codes, so that 1=female, 0=male, and rerun the regression equation. |
|
|
Term
What is a design matrix? What does it contain? |
|
Definition
A design matrix is a set of UE Codes
It captures all the effects in a design, all the factors and their interactions. |
|
|
Term
In a design matrix, what is the relationship of the degrees of freedom for the ANOV summary table and the number of effect codes. |
|
Definition
they are both the same
Each code has 1 degree of freedom. |
|
|
Term
What does it mean to say that with effects codes, the ANOVA partitions of variation are orthogonal? |
|
Definition
The variation is not overlapping |
|
|
Term
Now suppose you have unequal n's. Are the partitions of variation still orthogonal |
|
Definition
|
|
Term
What is Type III partialing in SAS GLM, in SPSS GLM? Regression or "unique" partialing in SPSS MANOVA? |
|
Definition
Each effect with all other effects partialed out |
|
|
Term
Show the regression equation for a two-group experiment with a continuous variable included. In the analysis of covariance, what is the categorical variable called? the continuous control variables? |
|
Definition
ANCOVA: YHAT = B1X [covariate] + b2C [categorical] + b0 |
|
|
Term
Show the difference in the data configuration for ANCOVA in a true experiment versus in a quasi-experiment. Why is there no correlation between the covariate and the variate in the true experiment? With what two things can the covariate be correlated in a quasi-experiment with nonrandom assignment? |
|
Definition
• In true experiment, categorical variable of treatment is randomly assigned. No correlation between X and C o X [Y] C (X and C are independent while overlapping with Y) o X = math ability of parent; Y = math achievement ; C = random treatment o Partial out individual difference with math ability based on parent ability. Gives more power because it partials out X and leaves [Y] and C overlap with the X taken out.
• Quasi-experiment, non-random assignment so that the treatment group membership can relate to the covariate o XC[Y] (all are overlapping circles) o X = math ability; Y = math achievement ; C = non-randomized treatment o School by intel facility example |
|
|
Term
What is a within class regression line in the ANCOVA? What assumption is made about within class regression lines in analysis of covariance? If the assumption is met, is the treatment effect constant over all levels of the covariate? |
|
Definition
Take the covariate (parent math ability) with criterion (child math achievement) and plot the regression of online group, within class regression. Plot the regression of the classroom group, within class regression.
Assumption: Slope is homogeneous
The treatment effect is constant |
|
|
Term
If the covariate (continuous variable) interacts with the categorical variable in ANCOVA, what does this tell you about the slopes of the within class regression lines in the groups of the ANCOVA? What is the difficulty in coming up with an estimate of the treatment effect in ANCOVA if the covariate interacts with the categorical treatment variable? |
|
Definition
Not equal (lines not parallel, interaction between the covariate X and the treatment C). Treatment effect is conditional on the covariate. |
|
|
Term
Write a regression equation for a categorical predictor "C", a continuous predictor "X", and their interaction. Rearrange this equation into the simple regression equation for the regression of Y on X at values of the categorical variable C. |
|
Definition
1. Yhat = b1X + b2C + b3XC + b0 2. Yhat = (b1 + b3C)X [simple slope] + (b2C + b0) [simple intercept] |
|
|
Term
What is meant by a fixed effects regression model? Under what condition will all inferences be correct in a fixed effects multiple regression, even if we merely sample cases and observe both X and Y? |
|
Definition
1: all the predictors are measured without error. 2: they take on a fix range of values (i.e. we take 1 year olds, two year olds, etc… not a continuum).
If we have multivariate normality of all variables, what that does gives us normally distributed residuals. We are find to inference, as long as we have normally distributed data. |
|
|