Search results

  1. F

    Comparing Distributions of Datasets

    That is though what I'd like to reflect. A new category suddenly becoming relevant, due to a child choosing a new colour sweet, would then change the picture of how evenly/unevenly spread their preferences are. I think I would like to compare across relative frequencies, and use that to...
  2. F

    Comparing Distributions of Datasets

    I would indeed like that to be the case, ie we ignore any 0 values and only look at the observed categories. So even if, say, there were 10 possible sweet colours and a child took different amounts of sweets that were of 3 different colours, then I would ignore the fact that there are 7...
  3. F

    Comparing Distributions of Datasets

    There is no 'hard-coded' absolute limit, although if I look through the data in practice the maximum number of categories appears to be 12.
  4. F

    Comparing Distributions of Datasets

    There is no fixed number of choices. Sometimes there will be 3 colours available, sometimes more or less. The example I just gave was for colour, but the same question applies for a variety of different observations - none of which have a consistent number of possible choices. The data does...
  5. F

    Comparing Distributions of Datasets

    Hi @GretaGarbo and @Dason. Apologies for the confusion - I think my terminology and use of words have been quite poor! I am indeed referring to spread rather than distribution (in statistical terms). Let me give you an example that I hope makes it clearer. Two children, Alice and Betty, are...
  6. F

    Comparing Distributions of Datasets

    Thanks @GretaGarbo. Let me be a bit clearer about the examples. Dataset A contains the values of 10 and 1 Dataset B contains the values of 10, 1, and 1 Dataset C contains the values of 4, 4, 3, 2, and 2 What I'm looking for is some way to depict the distribution of each dataset in a number...
  7. F

    Comparing Distributions of Datasets

    I've been asked to find a method to best compare the distribution of a number datasets that have small sample sizes. Bonus points for a solution/result that is in a scale of 0-1, i.e. a distribution approaching 1 is bordering on perfectly unequal and a distribution approaching 0 is bordering on...