Friday, October 4, 2013

Regression, Part 1


Jamie asks: When you did blogs on several of the statistical analyses, you did not do one on regression. Would that be one you would consider blogging about? I've been getting stuck on that.  

Of course, Jamie! I am going to assume that you are interested in multiple regression. This is a little more complicated to try to cover in a blog post, but let's give it a try! Multiple regression is a more sophisticated extension of correlation and is used when you want to explore the predicative ability of a set of independent variables (IV) on one continuous dependent measure (DV). There are different types of multiple regression that allow you to compare the predictive ability of particular independent variables ad find the best set of variables to predict a dependent variable. I will be looking at a standard multiple regression. 

An example might be the time to complete an exam (IV1), the person's grade on the exam (IV2), and perceived stress (DV). Our research questions are – 1) How well does time on the exam and grade on exam predict perceived stress? How much variance in perceived stress scores can be explained by scores on these two IVs? 2) Which is the best predictor of perceived stress: time or grade? 

Let's do an example of a multiple regression together. So open SPSS, first go to Edit on the menu, select Options and make sure there is a check in the box No scientific notation for small numbers in tables. Enter the following data for your sample: 

Under Variable view (see tab at bottom of page), It should look like: 

Name
Type
Width
Decimals
Label
Values
Ignore the rest
Examtime
numeric
8
0
Time to complete exam
None
Ignore the rest
Grade
numeric
8
0
Exam grade
None
Ignore the rest
Stress
numeric
8
0
Perceived stress
None
Ignore the rest

 
Go back to Data View and enter the following: 

Examtime
Grade
Stress
20
63
85
45
89
65
36
75
82
59
92
45
56
96
50
27
66
90
39
70
77
52
89
70
43
82
85
55
99
47

 
Go to Analyze/ Regression/Linear. Move your continuous DV (stress) into the Dependent box. 
Move your IVs (exam time and grade) into the independent box 
For Method, make sure Enter is selected.
Click on the Statistics button
             Select the following: Estimates, Confidence Intervals, Model fit, Part and partial correlations, and Collinearity diagnostics
             In the Residuals section, select Casewise diagnostics and Outliers outside 3 standard deviations. Click on Continue.
Click on the Plots button
             Click on *XRESID and move to the Y box
             Click on *ZPRED and move to the X box
In the section labeled Standardized Residual Plots, tick the normal probability plot option. Click on continue
Click on the Save button
             In the section labeled Distances, select Mahalanobis box and Cook's
Click on Continue and then OK 

Next time we will look at the output and interpretation. Do you have an issue or a question that you would like me to discuss in a future post? Would you like to be a guest writer? Send me your ideas! Send me an email with your ideas. leann.stadtlander@waldenu.edu 

No comments:

Post a Comment