I am trying to do a linear regression with walking speed of pedestrians as the dependent and population size as the independent variable. I have collected the same amount of samples (200) for each city. Other studies work with the average walking speed per city. My guess is because they have an unequal sample size. My question is, should I regress with all the samples or should I use the aggregated values (average walking speed) for my regressison?