<?xml version="1.0" encoding="ISO-8859-1"?>

<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/">
	<channel>
		<title>Statistics Help @ Talk Stats Forum - Applied Statistics</title>
		<link>http://www.talkstats.com/</link>
		<description>Statistics in finance, economics, engineering.  Actuarial science. Econometrics. Operations research.</description>
		<language>en</language>
		<lastBuildDate>Sun, 19 May 2013 04:06:11 GMT</lastBuildDate>
		<generator>vBulletin</generator>
		<ttl>60</ttl>
		<image>
			<url>http://www.talkstats.com/images/misc/rss.png</url>
			<title>Statistics Help @ Talk Stats Forum - Applied Statistics</title>
			<link>http://www.talkstats.com/</link>
		</image>
		<item>
			<title><![CDATA[Newbie to stats - comparing pre & post test results with missing data]]></title>
			<link>http://www.talkstats.com/showthread.php/43671-Newbie-to-stats-comparing-pre-amp-post-test-results-with-missing-data?goto=newpost</link>
			<pubDate>Fri, 17 May 2013 23:15:48 GMT</pubDate>
			<description><![CDATA[Hi 
 
I'm a complete beginner to stats and I can't work out how I can analyse some data with SPSS despite it being very simple on the surface.  
 
I...]]></description>
			<content:encoded><![CDATA[<div>Hi<br />
<br />
I'm a complete beginner to stats and I can't work out how I can analyse some data with SPSS despite it being very simple on the surface. <br />
<br />
I have 2 lists of quiz test scores. <br />
<br />
List 1 is scores for a group prior to receiving training.<br />
List 2 is scores for the same group after receiving training.<br />
<br />
None of the scores have any identifying participant information so I cannot match them against each other.<br />
<br />
Also not everyone took part in the post-training test.<br />
<br />
So for example: - <br />
<br />
Pre-training scores = 14, 1, 13, 10, 3, 13, 13, 11, 14, 5<br />
Post-training scores = 13, 15, 9, 15, 14, 15, 15<br />
<br />
That's it, that's all the data I have. I know how to work out the mean but is it possible to analyse this kind of data any further?<br />
<br />
I'm sorry for posting such a basic question but I'm hoping that a generous stats-wizard can help out a complete amateur!<br />
<br />
Thanks very much!</div>

 ]]></content:encoded>
			<category domain="http://www.talkstats.com/forumdisplay.php/19-Applied-Statistics">Applied Statistics</category>
			<dc:creator>miksima</dc:creator>
			<guid isPermaLink="true">http://www.talkstats.com/showthread.php/43671-Newbie-to-stats-comparing-pre-amp-post-test-results-with-missing-data</guid>
		</item>
		<item>
			<title>How would I determine significant factors for a conditional drop?</title>
			<link>http://www.talkstats.com/showthread.php/43639-How-would-I-determine-significant-factors-for-a-conditional-drop?goto=newpost</link>
			<pubDate>Thu, 16 May 2013 23:09:28 GMT</pubDate>
			<description>Hi, 
 
I have a financial data set of company-specific financial returns post a recurring event. I am interested in determining which factors would...</description>
			<content:encoded><![CDATA[<div>Hi,<br />
<br />
I have a financial data set of company-specific financial returns post a recurring event. I am interested in determining which factors would differentiate large negative returns from everything else. <br />
<br />
I first thought of LDA but the predictors and responses are very far from normal so I didn't attempt it. I tried using logistic regression. I set my response variable equal to one if the returns were less than -5% and 0 otherwise (arbitrary cut-off, I thought I would iterate through several choices and look for consistency). This model worked terribly. The only time it worked (~25% accurate predictions in-sample) was when I used a huge number of predictors, about 270. The AIC was higher than sparser models but the predictions were much better. Because of this I had no faith in it and I dropped it.<br />
<br />
My next idea was to use a much sparser linear regression model, about 20 predictors. First I fit the model and then I added interaction terms for all the coefficients one-by-one (add1 in R). I then made a list of of the coefficients that were statistically significant and, using a different sample, I classified the companies by deciles on each event date according to their standardized values of the significant predictors. I looked at the distribution of returns above and below a certain threshold in the deciles. An ideal outcome is when A) the predictors are statistically significant in the first sample, B) either decile 1 or 10 has more returns below the thresholds than anticipated (ie more than 10%) and C) the deciles that look good in B) also have less returns above a given threshold. <br />
<br />
Essentially I am looking for right skew in the tail deciles. <br />
<br />
I have tried switching from linear regression to quantile regression, using low values of tau. Both methods seem to work (for now, still working on it) in that the statistically significant predictors translate well into the out of sample decile tests. <br />
<br />
I would appreciate comments and suggestions about my method. I can simply skip the whole regression part and apply the decile test to everything but I feel this way I have more confidence in my model, do you agree? Do you have any improvements you can think of?<br />
<br />
Thanks a lot.</div>

 ]]></content:encoded>
			<category domain="http://www.talkstats.com/forumdisplay.php/19-Applied-Statistics">Applied Statistics</category>
			<dc:creator>badmax</dc:creator>
			<guid isPermaLink="true">http://www.talkstats.com/showthread.php/43639-How-would-I-determine-significant-factors-for-a-conditional-drop</guid>
		</item>
		<item>
			<title>Econometrics question</title>
			<link>http://www.talkstats.com/showthread.php/43616-Econometrics-question?goto=newpost</link>
			<pubDate>Thu, 16 May 2013 10:02:12 GMT</pubDate>
			<description><![CDATA[Econometrics question: http://i.imgur.com/dCHdGnm.png [1] 
I would like to check that my answers are correct: for a. I wrote that it would be a DF...]]></description>
			<content:encoded><![CDATA[<div>Econometrics question: <a href="http://i.imgur.com/dCHdGnm.png" target="_blank">http://i.imgur.com/dCHdGnm.png</a> [1]<br />
I would like to check that my answers are correct: for a. I wrote that it would be a DF test (rho - 1) / standard deviation (rho). H0= rho=1 , H1: |rho|&lt;1. The ciritical values would be drawn from a nonstand distribution under a null hypthesis of nonstationarity which is skewed to the right and so the critical values are smaller than that of a standard t test. And the test would be a one sided test.<br />
For part b, would it be a Phillips-Perron test?<br />
For part c I said it would be a standard T-test. (rho-1) / sd(rho). CV from t distribution. H0: rho = 1. H1: rho =/= 1.<br />
Is there anything else I should include in the answer?<br />
<br />
<br />
One further question: <a href="http://i.imgur.com/CeotSNN.png" target="_blank">http://i.imgur.com/CeotSNN.png</a> [2] Would I do IV and then do Random effects / GLS?</div>

 ]]></content:encoded>
			<category domain="http://www.talkstats.com/forumdisplay.php/19-Applied-Statistics">Applied Statistics</category>
			<dc:creator>Falc</dc:creator>
			<guid isPermaLink="true">http://www.talkstats.com/showthread.php/43616-Econometrics-question</guid>
		</item>
	</channel>
</rss>
