PA 705
Fall 2004
Homework Assignment # 7
Regression Analysis and Correlation
Due Nov. 17
I. Please do ORB problem #2 (p. 456) and #7 (p. 458).
II. Lab Work
Please attach your output.
Open Employee.sav
1. Choose two variables that you expect to be positively or negatively correlated. State your hypothesis and run the procedure to test it.
Was your hypothesis supported?
How strong is the relationship?
H1 As education increases, so does salary.
Ho There is no relationship between education and salary.
Test this hypothesis with a scatterplot. What does it suggest?
Test the hypothesis with a correlation procedure. What does your
finding suggest?
Run a bivariate regression procedure.
Is the hypothesis supported?
What would be your best guess as to what the salary would be for
an employee with no education?
For each year of education, how much salary is earned?
After doing additional research, we find that salary is often related to one's seniority in the agency. Add this second independent variable (months since hire).
How much, if any, does seniority add to the explanatory power of
our model?
Does education or seniority have a greater impact on salary?
What makes you say this?
We then find, to our dismay, that there is an error in recording months since hire and so the variable is not reliable. Meanwhile, our ongoing research has suggested that women and people of color are often paid less than men and nonminorities.
Re-run the regression procedure. Delete months since hire and
including gender and race. (Hint: you will need to look at Utilities/Variables
to see how they are coded before interpreting your output)
Did adding these variables improve the goodness of fit of the
model? On what evidence do you base your model?
What effect does being a women have on salary?
What effect does race have on salary?
Of the three independent variables, which has the greatest
impact on salary? What number tells you this?
The agency HR director argues that the effect that gender and race have on salary is really due to women and minorities holding different (i.e., lower paid) jobs than men and nonminorities
Is there a variable in this dataset that could be used to test
this assumption? Can we include it in the model? Why or why not?
Is there another means we could use to examine whether the difference between men/women's minorities/nonminorities is a function of occupational differences? If so, please do so and report your results.