data analysis

  1. I

    Case studies for loyalty management (insight extraction)

    Hi all, I have just been called for an interview with a loyalty management company. I have been told that my data mining skills will be assessed using case studies. Given that this will be my first job if I succeed and so I am not entirely familiar with all the problems affecting real...
  2. J

    dendrogram graph - how many clusters would you pick in this with reasons why?

    Hi having a little problem with this question which i need to learn for my test coming up.. i have produced a dendrogram (on sas enterprise) which consists of 77 countrys with various information. I need to discuss the graph and explain how many clusters i would pick and why? (its more...
  3. K

    Scatterplot relationship issue.

    Hi, the data below is just a sample of the data I am trying to enter into a scatterplot. The issue is that a previous correlation test shows a p value of 0.011, a clear relationship. However, when I took the averages of 196 entries of the data below, the scatterplot shows little...
  4. A

    Question about what test to run when comparing groups.

    I am doing a research study on the effects of study methods on exam performance. I have already administered a survey and collected data from students asking them what study method they plan to use on an upcoming exam (highlighting, re-reading, or self-testing). All students were required to...
  5. A

    Where can I find real data for ANOVA for a project?

    Where can I find real data for ANOVA for a project? I have my semester project due soon and I need to get real life data to use for the analysis. I searched but didn't get the best results. any one know any website?
  6. D

    Question about model validation.

    I'm working on a problem on my own (not for class or work) and I just wanted to get some outside opinions before I progress further. As far as my background, I've taken some introductory statistics courses for psychology majors, and an intro probability and statistics course for engineers...
  7. R

    SPSS Question Regarding entering data and calculation

    Hi, i am doing a research on stress level and i have 2 questionnaire and the total no. of respondents is 60. First questionnaire has been measured on a 6 point likert scale and it has total 16 questions. Second questionnaire has been measured on 5 point likert scale and it has total 46...
  8. H

    Re: Help needed for forecasting model

    Hi, I have a dataset of some monthly usage values. I have to forecast the for the next 12 months. I have taken log of data for ease of use. It seems to me that ARMA(2,0,2) model fits best for it. Can you confirm if I am right or wrong? If it is, please tell me which one fits the best. Also...
  9. A

    What analysis should I run?

    Hi, I am not sure which analysis to run based on what I am trying to figure out, any guidance is appreciated. The research question is- are certain ethnicities more likely to report their ethnicity in the online classroom? Data- I have a variable of ethnicities (1= Caucasian, 2= African...
  10. W

    Minitab/Excel to Analyze Data - What do I use?

    Hi, I've been collecting data from a couple different experiments I've been running the lab. I run a mixing system through runs with solid material. The material is the same but some of the properties keep changing when I put it in (pH, %-solids) and are recorded through tests. I take T=0...
  11. K

    crazy p-value

    I am taking data analysis, and am having a bit of trouble. It has been a couple years since i took intro to stats, so i am very rusty. I performed a t-test (two sided, and both alternates) on Sleuth 2 ex0222, and I get a p value with an negative exponent. this seems ridiculous to me, and I...
  12. C

    Whether / how to combine 2 linear equations for different variables, and weighting

    I'm analysing results from measurements of 2 variables (a, b). The goal is to use them singly or together to determine a 3rd variable, c. All 3 variables are instrumented, measured data (not social science). We made over 5000 simultaneous measurements of a & b. Our equipment software gave us...
  13. W

    Finding specific terms in several files.

    I have several documents, and would like to see which files contain a given term. How to do that in R?
  14. A

    Help With The Preffered Package Type Assessment

    Hello there to everyone! I have the question concerning the underlying data analysis methodology. Here is the small introduction. There is a data from the e-commerce site about its customers' purchases and there are 875 observations total. Each observation consists of 5 values. Scales of value...
  15. T

    [RapidMiner] Decision Tree Parameters in RapidMiner

    Hello I was wondering if somebody would kindly explain to me the different parameters I can use on a standard decision tree. By parameters I mean the following: Criterion, minimal size for split, minimal leaf size, minimal gain, maximal depth, confidence. How would I determine what those...
  16. M

    Finding a value to show there's significant differnece between two population means

    I'm working with two data sets and want to compare the difference of means between two particular rankings between two different years. My boss wants me to find a number to prove there is significant difference between these means. I am wondering if there is a statistics equation to find this...
  17. M

    We have found statistical differences with t-tests, but there must be more!

    Hello TS users, The attached data set refers to patients that had an operation, a vast number of measurements were taken before and after. We have found some evidence of improvement in questionaires for pain scale (VAS), ability of living (RMS & ODS) and walking distance, all tested using...
  18. Z

    Survey Analysis

    Hey Guy's I really need your help. I am not a statistician. I have to analyze surveys that have 6 observations (people) and 20 factors (food preference with answers; like, dislike, neither or no) What method should i use to analyses this on SAS or Excel
  19. N

    Matched vs. Unmatched t-tests

    I am conducting a t-test and have to choose between an unmatched and matched test. I am looking at the vaccination percentage rate between men and women in a given year. Would this be a matched test? My understanding is that in a matched test, the two variables need to be independent, but I am...
  20. C

    Regularized Canonical Correlation Analysis

    Hi, I have a question about sample size and regularized canonical correlation analysis. Is a sample size of 15 unreasonable to use in a regularized canonical correlation analysis? If one has lots and lots of potential explanatory variables (multicollinearity expected) and multiple response...