:wave: Hi ...This is my first posting. After reviewing the talkstats archives for info on regression analyses, I turn to talkstats subscribers for help. The last stats class I took was about 5 years ago. I am currently working on a research study using a secondary data set (non-random sample). I am looking for variables predictive of program outcomes. The method of analysis I am taking; (1) examine assoication between all variables, with a Pearsons Chi-Square and a t-test to compare sample means between two ethnic groups in the study. (2) clean up data - missing data and outliers among the IV's, all of which may impact the regression analysis, (3) enter all variables at one time in a backward logistic regression.

Questions - am I on the right track? Do I clean up data prior to comparing variables? I have been running the data set on SPSS; Pearson Chi-Square & t-tests. What criteria does a independent variable have to meet to be entered into the regression analysis (i.e. strength)?

Any help will be greatly appreciated,

Thanks,

dataquest

