In Example 5.1, we compared PE ratios on the NYSE and NASDAQ exchanges (see Table 5.1). Use a simple linear regression to model this data, where y is the
PE ratio, and x is a 0 if the stock is from the NYSE exchange and a 1 if from the NASDAQ exchange.
(a) In words, how would you interpret β0
and β1
for this definition of x?
(b) Compare the results of the parameter estimates from your regression to the sample means within the groups. Are these consistent with your interpretations in (a)?
(c) How does the t test for nonzero β1
from your regression compare to the pooled t test value? How does it compare to the unequal variance independent samples t test?
(d) What assumptions, if any, are violated by treating this data as a regression?
Example 5.1
The price-to-earnings ratio (PE) is a measure that is commonly used to evaluate whether a stock may be overpriced. The data in Table 5.1 represent small random samples of stocks having reported PE on the NASDAQ and NYSE stock exchanges (as of 1/30/2020). Do the two stock exchanges differ with respect to the PE of their stocks? The left panel in Figure 5.1 shows that the PE is very positively skewed but suggests that the values may tend to be higher on the NASDAQ exchange. The right panel is the box plot for the transformed data (LNPE 5 ln(10 1 PE), where adding 10 is necessary because one value is negative). These data are still skewed, but much less so than for the raw data. While this panel also suggests that the typical values are higher on the NASDAQ, we will see that the actual evidence for this is surprisingly weak.