Learning Module 10 Simple Linear Regression
50 questions available
Key Points
- OLS estimates b_hat1 and b_hat0 minimize sum of squared residuals.
- SST = SSR + SSE; R^2 = SSR / SST = r^2 in SLR.
- t-tests for slope and intercept use n - 2 df; F-test compares MSR to MSE.
- Prediction interval uses se and increases with distance of Xf from Xbar.
- Transformations (log/lin) change interpretation: log-log gives elasticity.
Key Points
- Examine residual plots to detect violations.
- Heteroskedasticity and autocorrelation affect standard errors and tests.
- Remedies: transform variables, weighted/GMM/GLS, add dummies, correct data errors.
- Outliers can distort slope and R^2; always inspect raw data and corrected values.
- Normality matters more in small samples; large samples may rely on CLT for inference.
Key Points
- t-statistic for slope uses SE that depends on se and X variance.
- F-statistic for model fit compares explained to unexplained variance; in SLR F = t^2.
- Prediction intervals incorporate model uncertainty and distance of Xf from Xbar.
- p-values quantify smallest alpha at which H0 would be rejected.
- Testing nonzero constants (e.g., slope = 1) uses the same t formula with hypothesized value.
Key Points
- Select functional form based on residual patterns and fit metrics.
- Log transformations change coefficient interpretation (elasticities and percent changes).
- Indicator variables let SLR compare group means and capture regime shifts.
- Always check and clean data; outliers can materially change results.
- Balance statistical significance with economic/practical significance.
Questions
In a simple linear regression of Y on X using OLS, which expression gives the estimated slope coefficient b_hat1?
View answer and explanationWhich equality holds in a correctly estimated simple linear regression with an intercept?
View answer and explanationIf the sample correlation between X and Y in SLR is r = 0.8 and SD(X)=2 and SD(Y)=5, what is the estimated slope b_hat1 (approx)?
View answer and explanationWhich statement best describes R-squared in simple linear regression?
View answer and explanationYou estimate SLR with n = 30 and find SSE = 180. What is the standard error of the estimate se?
View answer and explanationWhich assumption is violated if residuals plotted versus X show a clear U-shaped pattern?
View answer and explanationIn testing H0: b1 = 0 versus Ha: b1 ≠ 0 in SLR with n observations, what is the degrees of freedom for the t-statistic?
View answer and explanationIf sample size n increases while sample correlation r remains fixed, what happens to the t-statistic for testing r = 0?
View answer and explanationWhich of the following changes would reduce the standard error of the slope estimate SE(b_hat1) in SLR?
View answer and explanationIn SLR, the F-statistic for testing whether the model explains variance equals:
View answer and explanationWhich diagnostic plot would best help detect heteroskedasticity in a regression model?
View answer and explanationWhen residuals in a time-series regression show seasonally higher positive values every fourth quarter, which assumption is violated?
View answer and explanationYou estimate Y on X and obtain b_hat1 = 1.25, SE(b_hat1) = 0.3124, and df = 4. For a two-sided 5% test of H0: b1 = 0, the critical t is ±2.776. Which conclusion is correct?
View answer and explanationWhich change will widen a 95% prediction interval for Yhat at a specific Xf?
View answer and explanationWhat is the proper interpretation of the intercept b_hat0 in SLR?
View answer and explanationIf you regress monthly returns on an indicator variable EARN that equals 1 for months with earnings announcements and 0 otherwise, what does the slope coefficient represent?
View answer and explanationWhich functional form lets you interpret the slope b1 directly as the elasticity of Y with respect to X?
View answer and explanationYou fit SLR and find one observation has unusually large X and large residual; this point is best described as:
View answer and explanationWhich remedy is appropriate if residuals show increasing spread as X increases (heteroskedasticity)?
View answer and explanationIn SLR, you observe R^2 = 0.80 and se = 3.46. Which statement is most accurate?
View answer and explanationYou estimate ln(Y) = b0 + b1 X. A one-unit increase in X leads to what approximate change in Y?
View answer and explanationWhich is true regarding p-values reported for regression coefficients?
View answer and explanationYou have SLR with estimated b_hat1=0.98 and SE(b_hat1)=0.052. Test H0: b1 = 1.0 at 5% level (two-sided). Which result is correct? (t = (0.98-1)/0.052 = -0.385).
View answer and explanationWhich statement about the ANOVA decomposition SST = SSR + SSE is correct?
View answer and explanationYou forecast Y at Xf=6 given b_hat0=4.875, b_hat1=1.25. What is Yhat?
View answer and explanationWhich of the following increases the power of the t-test for a slope coefficient in SLR?
View answer and explanationIf residuals are not normally distributed in a small sample regression, which consequence is most direct?
View answer and explanationWhich of the following is an effect of an outlier caused by data entry error far from the bulk of observations?
View answer and explanationWhich statement about prediction intervals vs. confidence intervals for mean response is true?
View answer and explanationWhen comparing two nested models (Model A with only intercept, Model B with intercept and one X), which test evaluates whether X adds explanatory power?
View answer and explanationIf you estimate ln Y = 0.6 + 0.2951 FATO and SE of estimate se = 0.2631, what is interpretation of coefficient 0.2951?
View answer and explanationAn analyst finds slope p-value = 0.044 in a regression with n=6. What is correct inference at 5% level?
View answer and explanationWhich phrase best describes heteroskedasticity?
View answer and explanationYou estimate SLR for CPI forecasts: intercept 0.0001 (SE 0.0002), slope 0.9830 (SE 0.0155), n=60. Test H0: slope = 1.0 at 5% two-sided. t = (0.9830 - 1)/0.0155 ≈ -1.097. What conclusion?
View answer and explanationWhich data situation favors use of Spearman rank correlation over Pearson correlation?
View answer and explanationYou have SLR and want a prediction interval for Y at Xf. Which of these reduces width of that interval?
View answer and explanationWhich functional form would you try if scatter of Y vs X shows curvature with increasing slope (convex)?
View answer and explanationIn SLR, what does the standardized residual equal?
View answer and explanationWhich test statistic equals the t-statistic squared in simple linear regression?
View answer and explanationWhen should you prefer a lin-log model (Y = b0 + b1 ln X) over lin-lin?
View answer and explanationWhich of these is a direct consequence of autocorrelated residuals in a time-series regression?
View answer and explanationWhich statistic would you compute to examine whether residuals follow a normal distribution in a small-sample regression?
View answer and explanationYou fit SLR and obtain residuals with markedly fatter tails than normal in a small sample. Best action?
View answer and explanationWhich of these is a valid form for a log-lin regression where percent-change interpretation applies?
View answer and explanationIn a time-series SLR of revenue on time, residuals show upward jump each fourth quarter. A suitable regression modification is:
View answer and explanationWhich of the following best describes the standard error of the forecast sf used in prediction intervals?
View answer and explanationWhich of these indicates a good reason to use weighted least squares (WLS)?
View answer and explanationYou test H0: b0 ≤ 3 vs Ha: b0 > 3 and calculate t_intercept = 0.79 with critical one-sided t=2.132. Which decision?
View answer and explanationWhich of these best justifies transforming variables before regression (e.g., log transform)?
View answer and explanationWhich of the following is the most important first step before trusting regression outputs?
View answer and explanation