We work with whole populations not samples (I have access to the population data). There are normally thousands or tens of thousands of cases.
I could validate the model, I am not sure how to really. I use hold out data sets but only for time series.
LASSO is something I should consider. I worked on it a year or so ago and it exists in SAS, but I have never actually used it.
My original question, and this was not clear, is the validity of running a model and then throwing out variables that are not statistically significant. Based on Karabiner's comments I realize this may make no sense because I have the population (although I guess you could argue effects can vary over time so I have a chronological subsample - something I have never seen addressed).
Hlsmith although I have worked here a long time, I have never been a counselor and am not a SME in what we do. I have little first hand experience in this. I do data queries largely, and sometimes analysis. I have spent a lot of time reading the literature, which is what I base my analysis on.