There may indeed be a better way to do this, but here's my current thinking. I have a survey of 500 students at a university and have performed principal component analysis. Two factors are relevant for my research question, and I have output a score for each student so that there are now 2 new variables in my data (one I've named compliance, one I've named strategising).
I now have an extra 100 students who responded to the same survey but did so online (the first 500 were on paper). I want to know if there is a difference between those who complete online and those who were encouraged to fill in a sheet while they were in a lecture. They might take more time to think, might be more engaged, have stronger views - whatever. So here's what I want to do:
1. create a score for each of the 100 new students based on the variables from the previous factor analysis (where the new 100 students are not part of the analysis, they're just given a score from the 'formula' that is effectively my new variable).
2. perform a second factor analysis/PCA with all 600 students, creating new variable scores for each student on the new factors which are closest in interpretation to the old ones.
3. Compare the scores using Pearson correlation (or maybe chi-sq would be better?) to see how much difference it makes adding the extra 100 students.
I hope this makes sense. Please let me know if you know a way to do this, or a better way to achieve a similar outcome. All I can think of at the moment is to write an expression using the factor loadings as multipliers, but I have up to 35 contributing variables so it would be a bit of a pain.
Thank you
Mark
I now have an extra 100 students who responded to the same survey but did so online (the first 500 were on paper). I want to know if there is a difference between those who complete online and those who were encouraged to fill in a sheet while they were in a lecture. They might take more time to think, might be more engaged, have stronger views - whatever. So here's what I want to do:
1. create a score for each of the 100 new students based on the variables from the previous factor analysis (where the new 100 students are not part of the analysis, they're just given a score from the 'formula' that is effectively my new variable).
2. perform a second factor analysis/PCA with all 600 students, creating new variable scores for each student on the new factors which are closest in interpretation to the old ones.
3. Compare the scores using Pearson correlation (or maybe chi-sq would be better?) to see how much difference it makes adding the extra 100 students.
I hope this makes sense. Please let me know if you know a way to do this, or a better way to achieve a similar outcome. All I can think of at the moment is to write an expression using the factor loadings as multipliers, but I have up to 35 contributing variables so it would be a bit of a pain.
Thank you
Mark