# Predicting Rsquared out-of-sample

#### Suarez Jochelges

##### New Member
Hi everyone.

Im currently writing my thesis and got advised to compare In-sample vs Out-of-sample prediction. Where R2 out of sample is defined as: My data set is for 1764 observations (months):
i created sample of my data set:

Code:
* Example generated by -dataex-. To install: ssc install dataex
dataex return t, count(30)
clear
input double return float t
.01242389  1
.02019115  2
.02964644  3
.03405454  4
.02848361  5
-.01542547  6
-.00863808  7
.01915629  8
.01136751  9
-.05886838 10
.03742423 11
.0257288 12
.02951672 13
.00101633 14
.04072984 15
.03142682 16
-.00213872 17
-.00795212 18
-.00034236 19
-.00761387 20
-.01335902 21
.01832136 22
-.00212152 23
.03346823 24
.01082923 25
.01575765 26
-.01307993 27
-.00878622 28
.01496919 29
-.01535528 30
end
I tried to run this codes:
Code:
gen t=_n
tsset t
gen segment = 1
replace segment = 0 if t<883
reg return L.infl L.badtimes if segment ==0
predict forecast if segment == 1
gen returns = return if segment == 1
reg return forecast
And got advised to try this:
Code:
gen t=_n
tsset t
gen segment = (t >883 & t<.)
su return if segment==0
local r_bar=r(mean)
reg return L.lty if segment ==0
predict forecast if segment == 1
predict res if segment == 1,res
g r_rhat_sq=res * res if segment == 1
g r_rbar_sq=(forecast -r_bar')^2 if segment == 1
su r_rhat_sq
local sumtop=r(sum)
su r_rbar_sq
local sumbot=r(sum)
local rsq=1 - (sumtop'/sumbot')
di "rsq'"
However, i'm convinced that there are fundamental issues in coding or statistics, since i get wrong results.
I would really appreciate any help i could receive in solving this issue.