I would like to assess difference in survival of the "same" cohort classified by two different classifications of disease. Most of patients with the disease would be the same in compared groups, but some will be included or excluded due to the difference in classification criteria. Would standard log-rank test be appropriate? Is there a method that would account for shared patients?]]>

My question relates to health psychology study. I’m measuring self-assessed wellness at two timepoints (6 months apart). At the first timepoint, I’m also measuring some ‘personal factors' (6 self-assessment questionnaires, e.g what they think about their health). I’m looking to see whether these personal factors at time 1 predict their wellness at time 2. The data is questionnaire scores, all interval data.

My IV is scores from an right wing authoritarianism scale. Possible scores for a person range from 0 to 180. No decimals(Does it matter?).

My DV is where the person states s/he resides on a political spectrum ranging from -4(left) to +4(right). I guess I can code it as 1-9 or 0-8; or categorically as ultra left, very left, left, somewhat left, center, somewhat right, right, very right, ultra right.

What's your family type? Authoritarian, democratic, undefinable, inconsistent.

My DV is a score on scale. Continious type obviously.

So I am going to try and explain this as well as I can but if anything is unclear just let me know. So currently i am busy with research on personality using the Big Five Trait Inventory. The point of the research is to compare my own scores on the traits to the averagescore of the population on each of these traits. I am also to compare my own score to average of the scores given to me by four class members.

I also have three (affective, problems-solving and depressive

I ran a propensity score matching in order to evaluate a rehabilitation program.

We published this paper due to a siginificant finding that showed that the program is effective in reducing recidivism rates.

Now, I am writing a new paper and I am using the matches samples from the study. I am checking whether the interaction between ethnicity and program participation is significant in reducing recidivism

log(log(1-F(t)))=-Beta(log(lambda))-Beta(log(t)) to get the starting values.

The only problem is that F(t) is greater than 1 which leads taking a log of a negative values (not possible.) Anyother suggestions on getting these starting values.]]>

I don't know what's wrong but I've been trying to compare the means of two independent groups using the independent t test in SPSS but I fail each time I try.

I am pretty new into statistics and I did tests for autocorrelation in R (Durbin-Watson and Breusch-Godfrey).

As far as I have understood:

- the DW test should be between 1.5 and 2.5 (my results

The Harriet Hotel in downtown Boston has 100 rooms that rent for $150 per night. It costs the hotel $30 per room in variable costs (cleaning, bathroom items, etc.) each night a room is occupied. For each reservation accepted, there is a 5% chance that the guest will not arrive. If the hotel overbooks, it costs $200 to compensate guests whose reservations cannot be honored. How many reservations should the hotel accept if it wants to...

I'm a French psychology student. My research is about anxiety levels, self esteem levels and defense mechanisms in a population of young adolescents with conduct disorders.

- Independent

I'm currently in France for my Master degree of psychology. I work on a research about the question of the relationship between symptoms of conduct disorder and childhood depression through the representation of child attachment.

I have three variables:

-Dependent variable: depression rate for children measured with MDI-C test and CBCL (Child behavior Checklist)

-Independant variable : attachment type measured with the

I'm attempting to write an R package based on some functions I wrote. I think I have everything down but I've ran into a road block I'm hoping you can help with.

I'm using R-Studio, R 3.4.4 and roxygen to do my bidding. I'm trying to run a check (--as-cran) and both of my functions (dsr.R, dsrr.R) return the same error:

I'm looking for a bit of feedback on a research project I'm designing. It is outside my usual scope, and I don't have current access to a statistician; thought I'd reach out here.

I apologize that I can't be completely forthcoming with details about the project; it involves PHI and a sensitive subject. I can answer questions to try and clear things up in terms of my description.

For my experiment, I ve wheat straw to feed dairy cows. Before feeding I need to process it in two way to improve its nutritional value. Firstly I need to sterilize it (I want to apply 4 methods to sterilization) and then addition of good bacteria (I want to inoculate the sterilized straw with 3 types of bacteria). So, in my understanding, my study has two factors; 1. sterilization method (with 4 levels) and 2) type of bacteria (with...

I’m writing my thesis about the question: is ruminative thinking an independent risk factor for predicting marihuana use (controlled for sex and depression). In this study there are 300 participants of which 60 used marihuana. To investigate this relationship I wanted to do a multiple regression analyses. However, after analyzing the data I found that the following assumptions of the multiple regression are violated:

Linear relationship between (a) the dependent variable and each of

dominance y [aw=weight], sortvar (x) rule both

but it says dominance command is unrecognised. after reading couple of forums i came to know that i have to download its ado file and then install the command. Despite multiple attempts, im unable to run this. Kindly help me with the following queries

1. how and from where to download ado file to run dominance command for stata 15

2. how to go about installing it.

Any prompt reply would

I have a problem where I want to prove that the data belonging to invidivuals of one group has a higher variance than the other.

The experiment involves a parameter of movement that changed in one group of organisms, but without a clear trend. The parameter remains stable in individuals belonging the control group, whereas it either fluctuated in the test group both up and down depending on the individual. I want to show that it changes, but it is hard to test as I am measuring the...

What are the pros and cons to using a permutation test looking at mean differences? Should I do this with medians instead? Also, what are alternatives beyond Wilcoxon-style tests?

Here is the

But for this question, I believe we can just call this a logistic regression question. So I standardize all candidate variables entered into the model (e.g., 3 are continuous and the rest ~ 10 are

I am conducting a one-way ANCOVA with a fixed factor (group) and a continuous co-variate (height). I compared the estimated marginal means with the means for saved, predicted values, and I found that the sets of means are not equal. In http://www-01.ibm.com/support/docview.wss?uid=swg21477021, explains that this occurs because covariate-adjusted means for all groups are calculated using the grand mean value(s) of the covariate(s) . However, predicted values for individual...

I am currently working on my master's thesis where I have to conduct a meta-analysis on the issue of what determines the cost of debt capital for private firms.

For that purpose I have already collected a bunch of relevant papers which go for that research question under the use of a OLS regression.

Now the issue:

How can I convert the p-values, which go along with the estimated coefficients for the independent variables, to a z-score (which is required to apply

I have a data set looking at only those who have died/been removed from a much larger group and have a lot of data about them (i.e. age, BMI, sex, 4 symptom types, distance from hospital, smoker/ex/never smoker- probably have around 80-90% of this data for the 120 or so in this group).

I do not have the data on those who went on to have curative treatment other than total number which is perhaps around 800...

I have a model, which trades forex, and the model has a lot of parameters. I run the model with a lot of parameter combinations (test combinations), and try to choose the best ones, where the output is ordered by the profit or the percent of winning trades, or any other qualifier.

My test involves a sample of credit card customers that have defaulted, and I'm using a test vs. control treatment to see if my treatment shows any statistically significant improvement in...

I am working on a replication of a research paper where I have to compile my own data to copy the regression done in the research paper.

I have never done a regression like this before and I am not sure about the data I chose - I would appreciate any help and input.

All the variables I need:

I want to analyse the impact of turning a production line on and off (variable X, binary) on the environmental quality of wastewater being discharged; more specifically, the suspended solids content (variable Y, continuous). I have a set of daily data for a whole month, which shows if the production line was running or not, and the respective suspended solids content for those days. In other words, I have a data set similar to below:

15th January X=0 (line was not running)

df0$new <- ifelse(df0$old=="yes",1,0)

In this code I am creating a new variable called "new" that is equal to 1 if the variable "old" is equal to yes or is otherwise equal to 0. But in the variable "old" I have missing data (represented as -99, -98, NAN). So how can I account for there being missing values?

The second question is about using an "OR" statement.

df0$z

