1. T

    Should I include year effects in my logistic regression?

    I made the following logistic regression model for my master's thesis. FAIL= LATE + SIZE + AGE + EQUITY + PROF + SOLV + LIQ + IND. Where I take a look if late filing of financial statements (independent variable) is an indicator of failure of small companies (dependent variable). FAIL is a dummy...
  2. A

    Regression modeling

    Hello everyone, I am a student in agricultural genetic engineering, and I am working on presenting a research paper about the genetic relationship between efficiency of feed intake and diseases. This paper presents a lot of statistical information which is quite hard for me to process quickly...
  3. S

    Is it logical to perform PCA and reduction on observations instead of features?

    I am currently working with a set of code called GPMSA that was published by LANL. The code serves to create a Gaussian process model of some simulator and perform regression with experimental data. I am working to understand everything but there is something that is confusing me. PCA is...
  4. A

    linear regression - how to include curve functions into my scatterplot

    or my task I have to: Draw a scatter plot 'price' (y-axis) vs. 'distance center' (x-axis). Then estimate the relationship between these 2 variables for the values 1, 2, 3, 4, 5 and 6 of the variable 'number of people' (so that you end up with six regression functions in your scatterplot)...
  5. C

    Comparing the relative contribution of 3 factors to a composite score per group

    Hi everyone, I would like to ask for help with the following question: I have the results of three different groups in a math test. The test score is actually a composite score calculated as the total sum of correctly solved problems * problems difficulty. There are 5 levels of difficulty...
  6. Z

    Multicategorical logistic regression in LPA/LCA analysis?

    Hi all, I am doing an LPA analysis, and I would like to know when I have completed the category classification for the subsequent Multicategorical logistic analysis. What does a reference group I better set? I have seen some papers that set a baseline group, after which all other groups are...
  7. J

    Polynomial regression Coefficient SE results are different from examples

    Hi, I am running a polynomial regression on 5 sample dilutions and when I am calculating it on my software the results are different. The raw data is For example my results for the cubic is as follows: However, in the example given the standard error of the coefficients are different as...
  8. S

    Comparing outcomes of two treatment groups: t-test/Mann-Whitney U versus regression

    Hi all - I'm a student and had a question about statistical analysis. I'd like to compare post-treatment outcomes between two groups: group A who received a traditional drug (n = 200), and group B who received a newer drug (n = 100). Some post-treatment outcomes are continuous, whereas others...
  9. M

    Need Help Finding Whats Stats To Perform

    Hi there, I've been given a data set involving a food-training study. In week 1, baseline weight and BMI were recorded for each participant. In week 2, participants underwent four sessions of an online go/no-go response inhibition task. The task involved two treatments: food inhibition and...
  10. L

    R-squared is too high

    I just want to ask on how to make the R-square decrease without ruining the model. My research is about economics and the R-squared is said to be too high.
  11. B

    Which analysis is adequate: ANCOVA, multilevel...?

    In the experiment, after a baseline assessment, participants will be assigned to one of two conditions (between-subjects factor). In condition A, I expect their response time to be higher in large stimuli than in small stimuli, but slower in colored stimuli than in b/w stimuli compared to their...
  12. O

    Why is an interaction term better than two regressions?

    I have struggled with the "because it is" and vague answers on this topic for a while and I was hoping someone could actually give a real reason why using an interaction term in regression is better than doing two regressions. For example: While doing a GLM to see how environmental factors...
  13. N

    Help with creating a model!?

    I want to create a model that assesses the effect of Drone usage (binary) in transporting medical samples to laboratories on Treatment Success (also binary). I was thinking: Dependent: TS = treatment success, so either successful or unsuccessful. Independent: TAT = turnaround time = the...
  14. U

    Help determining start values of coefficients for a nonlinear model

    Hi everyone. For a dataset consisting of three quantitative variables, H, M and W I have to build a non linear model of this form: E(H)=b0+b1*M+(W/(b3+b4*M)). I tried using the "nls()" function in R, but I don't know how to determine the start values of the coefficients, b0, b1, b3 and b4. Can...
  15. M

    Spurious Regression with non stationary time-series

    Hi everyone, I'd like to have a confirmation on the correctness of the following interpretation: Let say that we want to run a very simple regression like the following one: We are regressing two I(1) series since x and y are assumed to be both described by a random walk process. The errors of...
  16. U

    Doubt on a model

    Hi everyone, for an universitary assignment, I have to model one dependent variable H based on two other independent variables, M and W; the model I have to fit to the data is this: E(H)=b0+b1*M+(W/(b3+b4*M)) do you have any clue on what kind of model this is and how can it be adapted in R?
  17. S

    Multiple Regression

    I have been given a college assignment and need to interpret these results (see attached). From what I can understand, the Annual Personal Outcome is the DV and there are several IVs (i.e gender, sexual orientation etc.). As such, has a multiple linear regression been conducted here? I'm...
  18. K

    Difference between simulating the dependent variable and simulating the error terms and adding them to the fitted values values assuming normality?

    What's the statistical difference between simulating the dependent variable and simulating the error terms and adding them to the fitted values values assuming normality (gaussian GLM)? Say I'm doing a simple multiple regression on the following data (R): n <- 40 x1 <- rnorm(n, mean=3, sd=1)...
  19. S

    Treatment of Coefficients from Regression Using Lagged Independent Variables

    I'm running a regression on two time series of financial returns, one dependent and one explanatory/independent. For the explanatory time series, I'm creating several lagged versions and using all of them as independent variables in in a multiple linear regression. My question is, how do I...
  20. L

    What statistical test should I use?

    Hi there, I'm currently in the process of carrying out a systematic review and I've gathered cost estimates from studies for a specific type of treatment (with two different approaches), I am interested in finding out if these costs have decreased over time. I have 40 cost estimates for the...