Hi All,
When I use a histogram to view my data, I get the following... (see age.png)
I'm trying to figure out the distribution...
From visual inspection, I assumed it would be lognormal. The data is left balanced and there's no "peaks" otherwise.
However, in R I try the following...
> s <- sum( (log(age+.1) - u )^2 ) / length(age)
> u <- sum(log(age+.1)) / length(age)
> s <- sum( (log(age+.1) - u )^2 ) / length(age)
> my_lnorm<-rlnorm(length(age), u, s)
> qqplot( my_lnorm, age )
(I add by .1 because some of the ages == 0. log(0) returns -Inf)
See qqplot.png for the result.
So according to the qq-plot, the data clearly does not match...
Is there a better way to determine distributions?





Reply With Quote



