# Search results

1. ### [Python] regex fixed-width positive lookbehinds

In R (via stringi) I could write (?<=a|bc) but doing this with the re module results in this error: raise error("look-behind requires fixed-width pattern") error: look-behind requires fixed-width pattern Is it true that if I have the regex (?<=a|bc) in Python I need to write it as...
2. ### R to Python List manipulation

Taking the plunge and learning Python at a deeper level to compliment my R skills. I am coming up against a wall with manipulating named lists in the way I can in R. R/Python folks can you tell me the equivalent ways to do this common R tasks? R Way of working with list: x <- list(...
3. ### quantile-quantile plot (qqplot) in R by hand-ish

My understanding of a qqplot was it was the sorted values for a variable on the y against the theoretical values from a normal (or whatever distribution) distribution on the x. We can get the y values from our sample and the x values from looking up the p value in a given distribution and...
4. ### Monte Carlo Simulation for Predicting Agile Stories Completed

A team at work saw this post on using MC for predicting a forcast of stories completed: http://scrumage.com/blog/2015/09/agile-project-forecasting-the-monte-carlo-method/ I have a series of questions as I know of Monte Carlo Simulation but have not used them. 1. Is the basic gist: Get mean and...
5. ### How do I read github csv into R?

Let's say I got a csv on GitHub how do I read it into R?
6. ### Math notation break set into n length groups

I have a vector of length 26 and want to split it into groups of length 5. I can write it in code but what's the math notation for breaking a set up into n length groups keeping the elements consecutively ordered? Using what Bryan Goodrich wrote here...
7. ### Confidence intervals for proportions: approximating a discrete distribution with a co

I saw on this website http://onlinestatbook.com/2/estimation/proportion_ci.html The following quote about calculating a CI for a proportion: Giving: p \pm Z_{.95}\sqrt{\frac{p(1-p)}{N}} \pm \frac{.5}{N} Where (it appears) N is the sample size. Another website...
8. ### Python nested list compared to R's

Are these two things equivalent in R and Python respectively? r <- list(list(c("a", "b"), c("d", "e")), list(c("f", "g"), c("h", "i"))) python = [[("a", "b"), ("d", "e")], [("f", "g"), ("h", "i")]]
9. ### Determine distribution and parameters

I have a question but maybe it's the wrong question so I'll state the task first... I want to make data that looks like the data I'm working with without actually being the data itself. So I want to maintain structure as much as possible and generate an n row data set with similar...
10. ### Math notations length of

I know we use n for length of a vector in many cases such as in the mean formula: Well let's say I have a list of vectors maybe: x = \{a, b, c\} and a = \{1, 2, 3 \}, b = \{4, 2, 3, 6 \}, c = \{14\} I want to talk about the length of x but also about the lengths of (a, b, and...
11. ### Confidence Interval Wording Write Up

When we get a 95% confidence interval for a mean is it appropriate to say:
12. ### Show distributions, same height different variances

I want show two distributions on the same pot that have identical height but different variances. I understand logically why my approach below doesn't result in equal heights. Now how can I force it to have equal heights. if (!require("pacman")) install.packages("pacman")...
13. ### Weaknesses in the assumptions misuses of HLM/multilevel modeling

Hopefully the title says it. What do people see as potential/perceived weaknesses in the assumptions and/or misuses of HLM/multilevel modeling?
14. ### Assumptions of statistic models

I was watching [Hadley's dplyr video](https://www.youtube.com/watch?v=8SGif63VW6E) and he said a statement that made me think: 8SGif63VW6E#t=134 I asked for clarification via Twitter and he said: I'm curious what others think about these comments. Do people agree? Is he saying the model...
15. ### Proper name for scores calculated via formula

Often in my field folks will take various count data and apply some formula to input different information and output some sort of summary. An example is readability scores: What is this act of data manipulation called? I'b played with ... data transformation data summarizing...
16. ### Weights for weighted mean: Do these weights make sense?

I'm attempting to weight a mean to make outliers less impactful. I know people use a median here often but am looking to try to make a mean that's rbust to outliers. That is the further an observation is from the mean the less weight it has. I'm sure this has been done before. In...
17. ### Ecologists & Sociologist views wanted: Species found together

I am curious about some ecological statistics/scores. I believe that across times species of animals may migrate to different locations. This movement is likely due to resources (or in the human population interests as well). I posit that we can measure the migration (amount of...
18. ### Moving average: Smoothing until stable

I am looking at a score in discourse over equally spaced intervals. Basically this is a time series. I want to view this as a line graph using the cumulative average. HEre's the ugly code that I used in R and the plot: set.seed(10) dat <- data.frame(value=rnorm(1000, sd=10), time=1:1000)...
19. ### Theoretical Dissertation Direction/Resources (need structure)

For many stats based people a theoretical dissertation (e.g., proposing a new methodology) is common. In my institution's department (and field for that matter) theoretical dissertations are not known. I am aiming to undertake this sort of dissertation (specifically a piece outlining and...
20. ### Figring out where missing parenthesis goes in Trigonometric functions formula

I trying to figure out a bounding box (locations within a square mile of a center point) on a globe. This is more complicated than the traditional Cartesian 2-D system. I found a site that claims to have it figured out and they're smarter than me so we'll trust it (be test the solution when I...