So first let me give some background information on my study.

Research question: Are there differences on helpfulness between online vinyl record reviews with the indication of a verified purchase and online vinyl record reviews without this indication?

This indication of a verified purchase is a thing on Amazon that can be added to a review when the reviewer bought the exact same product as is presented on Amazon.

Anyways, I gathered 80 reviews (I know, this are too few reviews, but it was part of the assignment); 40 reviews with the indication and 40 reviews without.

I had some big outliers, but my teacher told me that I only should remove these if they influence my outcome.

I checked normality and homogeneity of variance for both (with and without the outliers) and they both gave negative results. I learned in class that I may solve this by doing a bootstrapped non-parametric test, or a normal parametric test. To be sure, I did both, again for both data files (with and without outliers). This gave me the following results

**With the outliers**

Boostrapped independent t-test

Boostrapped independent t-test

*Mdif*= -2.33,

*t*(78) =-2.02,

*p*= .054

**----> not significant, but a tendency to significance**

95% CI [-4.759, -0.159],

*d*= 0.04

**----> and an extremely small effect size**(I know, calculating an effect size makes no sense if your difference is non-significant, but mine is close to significance, so I figured I calculate it as a bonus )

**Chi-Square test**

X2 = 16.608,

*p*= .343

**---->**

**not significant**

**Without the outliers**

Bootstrapped independent t-test

Bootstrapped independent t-test

*Mdif*= -1.97,

*t*(71) =-3.69,

*p*= .002

**----> significant**

95% CI [-3.056, -0.938],

*d*= 0.09

**---->**

**but a**

**very small effect size**(I have never seen an effect size this small, so can I even use this?)

**Chi-Square test**

X2 = 14.264,

*p*= .161

**-----> not significant**

OK, now here is my question. What test should I report in my paper? Because some give significant differences, some don't?? I don't want to do any p-hacking, so what is the

**real result/good result**I have to report here?

Thank you so much in advance!! Please let me know if something is unclear.