Recent content by Furn

1. Comparing Distributions of Datasets

That is though what I'd like to reflect. A new category suddenly becoming relevant, due to a child choosing a new colour sweet, would then change the picture of how evenly/unevenly spread their preferences are. I think I would like to compare across relative frequencies, and use that to...
2. Comparing Distributions of Datasets

I would indeed like that to be the case, ie we ignore any 0 values and only look at the observed categories. So even if, say, there were 10 possible sweet colours and a child took different amounts of sweets that were of 3 different colours, then I would ignore the fact that there are 7...
3. Comparing Distributions of Datasets

There is no 'hard-coded' absolute limit, although if I look through the data in practice the maximum number of categories appears to be 12.
4. Comparing Distributions of Datasets

There is no fixed number of choices. Sometimes there will be 3 colours available, sometimes more or less. The example I just gave was for colour, but the same question applies for a variety of different observations - none of which have a consistent number of possible choices. The data does...
5. Comparing Distributions of Datasets

Hi @GretaGarbo and @Dason. Apologies for the confusion - I think my terminology and use of words have been quite poor! I am indeed referring to spread rather than distribution (in statistical terms). Let me give you an example that I hope makes it clearer. Two children, Alice and Betty, are...
6. Comparing Distributions of Datasets

Thanks @GretaGarbo. Let me be a bit clearer about the examples. Dataset A contains the values of 10 and 1 Dataset B contains the values of 10, 1, and 1 Dataset C contains the values of 4, 4, 3, 2, and 2 What I'm looking for is some way to depict the distribution of each dataset in a number...
7. Comparing Distributions of Datasets

I've been asked to find a method to best compare the distribution of a number datasets that have small sample sizes. Bonus points for a solution/result that is in a scale of 0-1, i.e. a distribution approaching 1 is bordering on perfectly unequal and a distribution approaching 0 is bordering on...