cluster

1. Latent Class Analysis - How to identify the main drivers?

Hi All! I ran a LCA to identify the best possible segmentation of classes in a population. I used Latent Gold for this and imported the clusters later in to SPSS for some comparisons and calculation. I am a bit stuck with the following questions: Are there any metrics which indicate which of...

Hello! Can somebody please help me figure where should I draw the line to determine the number of clusters? I'm thinking 3 clusters but i'm not 100%.
3. proc fastclus output interpretation

Hello, I am using proc fastclus to perform k-means clustering. In Figure 42.2: Cluster Summary Table from the FASTCLUS Procedure, an R-square value is reported (please see link below)...
4. Whats is the method to analyze characteristics from two groups?

Hello there, Im new in the statistics world and i hope that you can help me. i have two groups of sample that are divided by one y (depended) variable y = binary, is a good employee or no then i have ten variables (x) divided into demografic variables and profile test. Example: age, emotion...
5. Cluster analysis with conjoint analysis data

Hello everybody, My team and I have an assignment to run a conjoint analysis with SPSS. So far, we were able to make the orthogonal design to generate 16 cards and run a survey with respondents. Then, we were able to run the conjoint analysis in SPSS to get the Utility for each attribute...
6. Principal component and clustering

Hi! I have a question concerning principal component analysis and how to pursue. First of all, this is my case: I have a lot of data points containing a calculated Q and a measured T. Q is calculated by the sum of different q's. I would like to make the following diagonal clusters...
7. What method for this problem? [Correlation / Correspondence]

Hi, I have a data set like the following with some categorical/nominal values (like Gender and Age Group) and a ratio scaled value (DropOut, which is the week after registration in which a person dropped out of a program [so it's never zero or less, and it's not the calender week]): I want...
8. K-means cluster for skewed dataset

Hi, I've 7 columns(variables) and their percentiles are shown as below for 14K rows. I've tried to create k-means clusters for 14k observations of 7 variables. A-G are products the numerics are turnover for A-G If you look at the table, massive dataset is having no turnover for all...
9. Which cluster analysis methods to use to analyze purchase behavior with time?

Hello everyone, I quite new to cluster analysis, but read through a fair amount of material, still I'm puzzled with the following task. Performing cluster analysis on my customers, using the follwing data set SubjectN; Week1 Purchase; Week2 Purchase;.....; Week12 Purchase These...
10. Combining two samples to then do Cluster Analysis/multivariate analysis

I was wondering if someone can help. For a university assignment I have been asked to propose a marketing research method & survey etc for a company. I have been set an online takeaway website to do. We are asked in the survey to find out about food habits and the takeaway market. Survey has...
11. Hierarchical Cluster Analyis: Working with Groups

Hello everybody! I am not sure if you provide me an answer to my questions, but I am pretty desperate and would be glad of any help I can get! I am conducting a study in which I applied first Factor Analysis (for a question that was asked with Likert Scale, regarding characteristics of a...
12. Analysing ranked data

I had following question in my questionnaire: Rank following factors: price, quality, advertisement, brand, reference from 1 (very important) to 5 (least important) that influenced on your buying decision process. Which approaches may be used in order to measure most important factors? I...
13. What technique to use?

I am studying modeling, and I am looking for a good technique to optimize an insurance process, but I am not quite sure how to accomplish what I want to do. I have a dataset with several categorical variables (policy characteristics) and one numerical variable (historical losses). What I...
14. panel data - within-group estimate - individual fixed effects retrieved

Hi TalkStats people! I am analyzing panel data. First, I have to decide whether to use a random or fixed effect estimator. The Hausman test suggests to use the fixed effect estimator (also named within group estimator). Thus, this is what I am using. However, this method eliminates the...
15. Indicate significant differences between two clustered bars

Hello, I'm trying to add line and star annotation in a clustered bar chart, in order to indicate significant difference between two groups, similar to this: Example in MATLAB Can anyone help me, how can it be done in SPSS?
16. Generalized Linear Model and fixed effects

I want to estimate a model with a fractional outcome variable following Papke & Wooldridge (2002). I run glm dep_var independent_var, family(binomial) link(logit) robust, right? How can I add a three level fixed effects? My approach until now was to use egen new_var = group(var1 var2 var3)...
17. Clustering a pair of 3 points for a given x position?

Hi all, If i have a pair of points that have different y positions (A,B,C) but with the same x coordinate, is it possible to cluster this pair of 3 points together and not individually? I'd like to see the occurrence of this pair of 3 points together in a given sample and see what pair...
18. SPSS Linear regression with data clusters

Hello, I'm trying to do a linear regression in SPSS where the data is clustered. I have n = 58 with 7 different measurement variables. One of those variables is a measurement, the other six are an observation (yes/no). I can run the linear regression just fine, however, I need to account for...
19. Performing comparisons after a cluster analysis

I have a data set of N subjects, and P variables, with some vars measuring performance on a task, and some measuring subjects' intrinsic characteristic. Each of the performance variables consists of the proportion of correct responses (range 0 to 1.0) to some set of questions. The characteristic...
20. PhD - which statistical analysis to use? MLM, t tests, ANOVA?

Hi, my research question is "do large cognitive changes precede a period of mood stability?" (N = 30), I have 20 therapy sessions data for each participant, which includes scores on a cognitive style questionnaire and scores on a mood questionnaire of each session. My supervisor has suggested...