Recent content by Mr. Smith

  1. M

    Delete Excess Data and Not Lose Stat Significance?

    That game theory re-framing is helpful and pulls in solid concepts. I haven't had much luck yet finding related models. I've been working with some more simple examples to see what other strategies may work. The scaling solution I proposed, when looking at the data at any given moment, may...
  2. M

    Delete Excess Data and Not Lose Stat Significance?

    Capping data based on date or data limits, either unit may be used. Once there's a solution, I can just adapt it to the data model. The reasons capping data was emphasized (aside from that is what will be preferably used) were some date tips didn't account for the "all time" question. That's...
  3. M

    Delete Excess Data and Not Lose Stat Significance?

    The current prototype does track unique songs and how many votes each has. The goal, however, is to not cut data based on date. The dilemma is how to cut data to keep within a data cap. The key question is how to cut songs without giving the remaining songs an unfair advantage. Would scaling be...
  4. M

    Delete Excess Data and Not Lose Stat Significance?

    Thanks for all the responses. The only analysis is having a relatively accurate top 100 (or whatever number is ultimately decided). Selection bias is a concern. If I understood the question, the list is to inform decisions about allocating resources. There is only one input. The complicating...
  5. M

    Delete Excess Data and Not Lose Stat Significance?

    Hi. I'm working with data that may get too large. I'm wondering if there is a way to keep the data relatively accurate and manageable by deleting "insignificant" data. For simplicity, let's say it's a top 100 list of all-time (from when our list was started). Users enter their favorite song...