Problems with the significance of a subset


New Member
I have performed a hedonic regression with a database where all the variables are very significant and according to the calculations made in R with a squared R = 0.61.

the same regression for a subset of this database when they met a certain condition, and the result has been that many of the variables that were previously significant are no longer and the squared R = 0.30

The problem is that these variables under my point of view should not be so affected.

Does anyone find any explanation to this? Does it only mean that these variables in this case are no longer significant, or could it show a deeper problem?

Thank you.
Last edited:


New Member
Because the variables that are no longer significant, from a logical point of view, should remain significant given the subset data