Search results

  1. trinker

    [Python] regex fixed-width positive lookbehinds

    In R (via stringi) I could write (?<=a|bc) but doing this with the re module results in this error: raise error("look-behind requires fixed-width pattern") error: look-behind requires fixed-width pattern Is it true that if I have the regex (?<=a|bc) in Python I need to write it as...
  2. trinker

    R to Python List manipulation

    Taking the plunge and learning Python at a deeper level to compliment my R skills. I am coming up against a wall with manipulating named lists in the way I can in R. R/Python folks can you tell me the equivalent ways to do this common R tasks? R Way of working with list: x <- list(...
  3. trinker

    quantile-quantile plot (qqplot) in R by hand-ish

    My understanding of a qqplot was it was the sorted values for a variable on the y against the theoretical values from a normal (or whatever distribution) distribution on the x. We can get the y values from our sample and the x values from looking up the p value in a given distribution and...
  4. trinker

    Monte Carlo Simulation for Predicting Agile Stories Completed

    A team at work saw this post on using MC for predicting a forcast of stories completed: http://scrumage.com/blog/2015/09/agile-project-forecasting-the-monte-carlo-method/ I have a series of questions as I know of Monte Carlo Simulation but have not used them. 1. Is the basic gist: Get mean and...
  5. trinker

    How do I read github csv into R?

    Let's say I got a csv on GitHub how do I read it into R?
  6. trinker

    Math notation break set into n length groups

    I have a vector of length 26 and want to split it into groups of length 5. I can write it in code but what's the math notation for breaking a set up into n length groups keeping the elements consecutively ordered? Using what Bryan Goodrich wrote here...
  7. trinker

    Confidence intervals for proportions: approximating a discrete distribution with a co

    I saw on this website http://onlinestatbook.com/2/estimation/proportion_ci.html The following quote about calculating a CI for a proportion: Giving: p \pm Z_{.95}\sqrt{\frac{p(1-p)}{N}} \pm \frac{.5}{N} Where (it appears) N is the sample size. Another website...
  8. trinker

    Python nested list compared to R's

    Are these two things equivalent in R and Python respectively? r <- list(list(c("a", "b"), c("d", "e")), list(c("f", "g"), c("h", "i"))) python = [[("a", "b"), ("d", "e")], [("f", "g"), ("h", "i")]]
  9. trinker

    Determine distribution and parameters

    I have a question but maybe it's the wrong question so I'll state the task first... I want to make data that looks like the data I'm working with without actually being the data itself. So I want to maintain structure as much as possible and generate an n row data set with similar...
  10. trinker

    Math notations length of

    I know we use n for length of a vector in many cases such as in the mean formula: Well let's say I have a list of vectors maybe: x = \{a, b, c\} and a = \{1, 2, 3 \}, b = \{4, 2, 3, 6 \}, c = \{14\} I want to talk about the length of x but also about the lengths of (a, b, and...
  11. trinker

    Confidence Interval Wording Write Up

    When we get a 95% confidence interval for a mean is it appropriate to say:
  12. trinker

    Show distributions, same height different variances

    I want show two distributions on the same pot that have identical height but different variances. I understand logically why my approach below doesn't result in equal heights. Now how can I force it to have equal heights. if (!require("pacman")) install.packages("pacman")...
  13. trinker

    Weaknesses in the assumptions misuses of HLM/multilevel modeling

    Hopefully the title says it. What do people see as potential/perceived weaknesses in the assumptions and/or misuses of HLM/multilevel modeling?
  14. trinker

    Assumptions of statistic models

    I was watching [Hadley's dplyr video](https://www.youtube.com/watch?v=8SGif63VW6E) and he said a statement that made me think: 8SGif63VW6E#t=134 I asked for clarification via Twitter and he said: I'm curious what others think about these comments. Do people agree? Is he saying the model...
  15. trinker

    Proper name for scores calculated via formula

    Often in my field folks will take various count data and apply some formula to input different information and output some sort of summary. An example is readability scores: What is this act of data manipulation called? I'b played with ... data transformation data summarizing...
  16. trinker

    Weights for weighted mean: Do these weights make sense?

    I'm attempting to weight a mean to make outliers less impactful. I know people use a median here often but am looking to try to make a mean that's rbust to outliers. That is the further an observation is from the mean the less weight it has. I'm sure this has been done before. In...
  17. trinker

    Ecologists & Sociologist views wanted: Species found together

    I am curious about some ecological statistics/scores. I believe that across times species of animals may migrate to different locations. This movement is likely due to resources (or in the human population interests as well). I posit that we can measure the migration (amount of...
  18. trinker

    Moving average: Smoothing until stable

    I am looking at a score in discourse over equally spaced intervals. Basically this is a time series. I want to view this as a line graph using the cumulative average. HEre's the ugly code that I used in R and the plot: set.seed(10) dat <- data.frame(value=rnorm(1000, sd=10), time=1:1000)...
  19. trinker

    Theoretical Dissertation Direction/Resources (need structure)

    For many stats based people a theoretical dissertation (e.g., proposing a new methodology) is common. In my institution's department (and field for that matter) theoretical dissertations are not known. I am aiming to undertake this sort of dissertation (specifically a piece outlining and...
  20. trinker

    Figring out where missing parenthesis goes in Trigonometric functions formula

    I trying to figure out a bounding box (locations within a square mile of a center point) on a globe. This is more complicated than the traditional Cartesian 2-D system. I found a site that claims to have it figured out and they're smarter than me so we'll trust it (be test the solution when I...