data analysis

  1. W

    Finding specific terms in several files.

    I have several documents, and would like to see which files contain a given term. How to do that in R?
  2. A

    Help With The Preffered Package Type Assessment

    Hello there to everyone! I have the question concerning the underlying data analysis methodology. Here is the small introduction. There is a data from the e-commerce site about its customers' purchases and there are 875 observations total. Each observation consists of 5 values. Scales of value...
  3. T

    [RapidMiner] Decision Tree Parameters in RapidMiner

    Hello I was wondering if somebody would kindly explain to me the different parameters I can use on a standard decision tree. By parameters I mean the following: Criterion, minimal size for split, minimal leaf size, minimal gain, maximal depth, confidence. How would I determine what those...
  4. M

    Finding a value to show there's significant differnece between two population means

    I'm working with two data sets and want to compare the difference of means between two particular rankings between two different years. My boss wants me to find a number to prove there is significant difference between these means. I am wondering if there is a statistics equation to find this...
  5. M

    We have found statistical differences with t-tests, but there must be more!

    Hello TS users, The attached data set refers to patients that had an operation, a vast number of measurements were taken before and after. We have found some evidence of improvement in questionaires for pain scale (VAS), ability of living (RMS & ODS) and walking distance, all tested using...
  6. Z

    Survey Analysis

    Hey Guy's I really need your help. I am not a statistician. I have to analyze surveys that have 6 observations (people) and 20 factors (food preference with answers; like, dislike, neither or no) What method should i use to analyses this on SAS or Excel
  7. N

    Matched vs. Unmatched t-tests

    I am conducting a t-test and have to choose between an unmatched and matched test. I am looking at the vaccination percentage rate between men and women in a given year. Would this be a matched test? My understanding is that in a matched test, the two variables need to be independent, but I am...
  8. C

    Regularized Canonical Correlation Analysis

    Hi, I have a question about sample size and regularized canonical correlation analysis. Is a sample size of 15 unreasonable to use in a regularized canonical correlation analysis? If one has lots and lots of potential explanatory variables (multicollinearity expected) and multiple response...
  9. C

    Canonical Correlation (Package CCA)

    Hi there, I'm using R (and package CCA) trying to perform a regularized canonical correlation analysis (There are 12-13 "response" variables in one dataset, another dataset of a whole lot (i.e. >400) of potential "explanatory" variables, but only a sample size of 15). Gonzalez et al. (2008...
  10. A

    Joint and Marginal Densities.

    I am having some problems grasping the concept of these joint and marginal densities. It would really help if someone could provide me with an answer for the following question: Find the joint and marginal densities corresponding to the cdf F(X, Y) = (1 - е^αx){1-e^βy), x > 0, y>0, α >...