I am writing with regards to a data set for wine characteristics, for 1600 different wines, each with a value of pH, acetic acid concentration (mg/L) and quality from an ordinal likert 1-10 ranking scale from a taste test. The data for acetic acid is heavily negatively skewed and the pH is less skewed. Both are therefore non parametric. The quality ranking data is also discernibly non parametric.
I am looking for a statistical test to determine the significance of a relationship being: acetic acid content in each group. And also pH in each group.
I know from box plots that there is a clear relationship of decreasing acetic acid content mean and median, smaller IQR and less outliers, as the rank of quality increases. So I am expecting a significant relationship. as you look at the quality ranks 1-10 (of which the 1600 wines are categorised) by increasing , acetic acid concentration decreases. As does pH.
Any advice appreciated! Searching for this test has made me and my partner question every facet of our statistics understanding!
I am looking for a statistical test to determine the significance of a relationship being: acetic acid content in each group. And also pH in each group.
I know from box plots that there is a clear relationship of decreasing acetic acid content mean and median, smaller IQR and less outliers, as the rank of quality increases. So I am expecting a significant relationship. as you look at the quality ranks 1-10 (of which the 1600 wines are categorised) by increasing , acetic acid concentration decreases. As does pH.
Any advice appreciated! Searching for this test has made me and my partner question every facet of our statistics understanding!