Cross sectional


New Member
I am running an analysis on secondary data that I was able to get from a health clinic on all women who gave birth in the clinic in the last two years. The dataset contains 3,600 women and includes only 5 variables (age, residence status (rural/urban), birth outcomes (stillbirth, live, premature), type of delivery and the number of antenatal visits).

I am planning to run a multiple logistic regression to estimate the association between mother's age and birth outcome while accounting for other factors in the dataset.
My questions are:
1- What is the best way to handle missing values? (there are only a few missing values in the number of antenatal visits)
2- I am worried that I only have a few variables in the dataset. Do you have any suggestions for strategies that I can employ to make the most out of my data?