Recent content by Jake

  1. Jake

    Fleeting/Random Thoughts

    Sounds good, gotta try that
  2. Jake

    Comments on residuals

    In my opinion these residual plots look fine enough, I wouldn't be too worried.
  3. Jake

    Factor analysis

    Another, similar way to approach this would be partial least squares regression.
  4. Jake

    R to Python List manipulation

    If (and only if) you're working with a dictionary where you know that every value is a list, then you could cast the dict values (discarding the keys) into a list of lists using `list(x.values())` and then apply one of the recipes here for flattening a list...
  5. Jake

    R to Python List manipulation

    First of all, that's a dictionary, not a list. You construct dictionaries with curly braces and lists with square brackets. A dictionary maps keys to values. A dictionary's .keys() method will return, well, its keys: in this case the list names ['Titles', 'Entities']. The .values() method will...
  6. Jake

    Multilevel regression with two clusters

    Like others already mentioned, this is a crossed random effects model, which can easily be fit in most (but not all) stats packages, including lme4 in R, SAS PROC MIXED/GLIMMIX, and others. The syntax is package-specific of course but usually it's as simple as just add separate random effect...
  7. Jake

    Overdispersion/ unobserved heterogenity in logistic regression.

    "Unobserved heterogeneity" in logistic regression is nothing to be afraid of. I address this here, arguing directly against Allison and Mood: http://jakewestfall.org/blog/index.php/2018/03/12/logistic-regression-is-not-fucked/ Overdispersion is a completely different issue. In logistic...
  8. Jake

    What's the difference between running an ANCOVA and running an ANOVA with the residuals (of the covariate) as the response?

    Yes, I agree. Mainly I think the alternative method is interesting as a way of understanding what ANCOVA is doing "under the hood." But in practice you wouldn't normally literally do it that way.
  9. Jake

    What's the difference between running an ANCOVA and running an ANOVA with the residuals (of the covariate) as the response?

    This is almost true, but not quite. You're missing one step here: you also need to regress the independent variable (IV) on the covariate and save those residuals too. Then if you regress the DV residuals (which you already mentioned) on the IV residuals (which I just mentioned), the resulting...
  10. Jake

    R squared and correlation in R

    In your own example, r = 0.53 and R^2 = 0.28....so clearly they're not the same.
  11. Jake

    SAS v R

    Feel free to explain the methodological problems you spotted.
  12. Jake

    SAS v R

    There is data. There is data. There is data. There is data. There is data. There is data. http://r4stats.com/articles/popularity/ You cannot keep ignoring this and spewing this unfounded **** about SAS being more popular than R in general. Are there specific pockets of industry where SAS is more...
  13. Jake

    What podcasts do you enjoy or listen to?

    I enjoy Very Bad Wizards
  14. Jake

    Unobserved heterogeneity in logistic regression

    Yes, I agree. We discussed that in the chatbox starting with the following two chats:
  15. Jake

    Unobserved heterogeneity in logistic regression

    Thanks, yes, this is an excerpt from Pearl's book, which I own and have read. I've definitely re-read that section several times while working on my blog post.