0% found this document useful (0 votes)
105 views59 pages

Question #1 of 139

Uploaded by

hahaha hahaha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
105 views59 pages

Question #1 of 139

Uploaded by

hahaha hahaha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 59

2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #1 of 139 Question ID: 1472238

When evaluating the fit of a machine learning algorithm, it is most accurate to state that:

A) precision is the percentage of correctly predicted classes out of total predictions.


accuracy is the ratio of correctly predicted positive classes to all predicted positive
B)
classes.
C) recall is the ratio of correctly predicted positive classes to all actual positive classes.
A is accuracy.
B is precision.
C is answer

Question #2 of 139 Question ID: 1508649

The unsupervised machine learning algorithm that reduces highly correlated features into
fewer uncorrelated composite variables by transforming the feature covariance matrix best
describes:

A) k-means clustering.
B) principal components analysis.
C) hierarchical clustering.
B. . K-means partitions observations into a fixed number (k) of non-overlapping clusters. Hierarchical
clustering is an unsupervised iterative algorithm used to build a hierarchy of clusters

Question #3 of 139 Question ID: 1472093

David Wellington, CFA, has estimated the following log-linear trend model: LN(xt) = b0 + b1t +
εt. Using six years of quarterly observations, 2001:I to 2006:IV, Wellington gets the following
estimated equation: LN(xt) = 1.4 + 0.02t. The first out-of-sample forecast of xt for 2007:I is
closest to:

A) 1.88.
B) 4.14.
C) 6.69.

C. Wellington's out-of-sample forecast of LN(xt) is 1.9 = 1.4 + 0.02 × 25, and e 1.9 = 6.69. (Six years of
quarterly observations, at 4 per year, takes us up to t = 24. The first time period after that is t = 25.)

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 1/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #4 of 139 Question ID: 1508647

Considering the various supervised machine learning algorithms, a penalized regression


where the penalty term is the sum of the absolute values of the regression coefficients best
describes:

A) support vector machine (SVM).


C
B) k-nearest neighbor (KNN).
C) least absolute shrinkage and selection operator (LASSO).

Question #5 of 139 Question ID: 1472193

Suppose you estimate the following model of residuals from an autoregressive model:

εt2 = 0.4 + 0.80εt-12 + µt, where ε = ε^

If the residual at time t is 2.0, the forecasted variance for time t+1 is:

A) 3.2.
B) 3.6.
C) 2.0.
B. The variance at t=t+1 is 0.4 + [0.80 (4.0)] = 0.4 + 3.2. = 3.6.

Question #6 of 139 Question ID: 1472210

Which supervised learning model is most appropriate (1) when the Y-variable is continuous
and (2) when the Y-variable is categorical

Continuous Y-variable Categorical Y-variable

A) Classification Neural Networks

B) Regression Classification
B
C) Decision trees Regression

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 2/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #7 of 139 Question ID: 1472142

Frank Batchelder and Miriam Yenkin are analysts for Bishop Econometrics. Batchelder and
Yenkin are discussing the models they use to forecast changes in China's GDP and how they
can compare the forecasting accuracy of each model. Batchelder states, "The root mean
squared error (RMSE) criterion is typically used to evaluate the in-sample forecast accuracy
of autoregressive models." Yenkin replies, "If we use the RMSE criterion, the model with the
largest RMSE is the one we should judge as the most accurate."

With regard to their statements about using the RMSE criterion:

A) Batchelder is incorrect; Yenkin is correct.


B) Batchelder is incorrect; Yenkin is incorrect. B

C) Batchelder is correct; Yenkin is incorrect.

Question #8 of 139 Question ID: 1472154

Barry Phillips, CFA, has estimated an AR(1) relationship (xt = b0 + b1 × xt-1 + et) and got the
following result: xt+1 = 0.5 + 1.0xt + et. Phillips should:

A) first difference the data because b1 = 1.

B) not first difference the data because b0 = 0.5 < 1.

C) not first difference the data because b1 − b0 = 1.0 − 0.5 = 0.5 < 1.
A. The condition b1 = 1 means that the series has a unit root and is not stationary. The
correct way to transform the data in such an instance is to first difference the data.

Question #9 of 139 Question ID: 1472185

Which of the following is a seasonally adjusted model?

A) (Salest - Sales t-1)= b0 + b1 (Sales t-1 - Sales t-2) + b2 (Sales t-4 - Sales t-5) + εt.

B) Salest = b0 + b1 Sales t-1 + b2 Sales t-2 + εt.

C) Salest = b1 Sales t-1+ εt.


A. This model is a seasonal AR with first differencing.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 3/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #10 of 139 Question ID: 1472092

In the time series model: yt=b0 + b1 t + εt, t=1,2,...,T, the:

A) disturbance terms are autocorrelated.

B) change in the dependent variable per time period is b1.

C) disturbance term is mean-reverting.


B. The disturbance term should be independent and identically distributed. There is no reason to
expect the disturbance term to be mean-reverting, and if the residuals are autocorrelated,
the research should correct for that problem.

Question #11 of 139 Question ID: 1472155

Barry Phillips, CFA, has the following time series observations from earliest to latest: (5, 6, 5,
7, 6, 6, 8, 8, 9, 11). Phillips transforms the series so that he will estimate an autoregressive
process on the following data (1, -1, 2, -1, 0, 2, 0, 1, 2). The transformation Phillips employed
is called:

A) beta drift.
B) moving average.
C) first differencing.

C. Phillips obviously first differenced the data because the 1=6-5, -1=5-6, .... 1 = 9 - 8, 2 = 11 - 9.

Question #12 of 139 Question ID: 1472236

In big data projects, data exploration is least likely to encompass:

A) feature design.
B) feature engineering. A

C) feature selection.

Question #13 of 139 Question ID: 1472111

Trend models can be useful tools in the evaluation of a time series of data. However, there
are limitations to their usage. Trend models are not appropriate when which of the following
violations of the linear regression assumptions is present?

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 4/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A) Model misspecification.
B) Heteroskedasticity.
C) Serial correlation.
C. One of the primary assumptions of linear regression is that the residual terms are not correlated
with each other. If serial correlation, also called autocorrelation, is present, then trend models are not
an appropriate analysis tool.

Question #14 of 139 Question ID: 1472140

David Brice, CFA, has used an AR(1) model to forecast the next period's interest rate to be
0.08. The AR(1) has a positive slope coefficient. If the interest rate is a mean reverting
process with an unconditional mean, a.k.a., mean reverting level, equal to 0.09, then which
of the following could be his forecast for two periods ahead?

A) 0.072.
B) 0.081.
C) 0.113.
B. As Brice makes more distant forecasts, each forecast will be closer to the unconditional mean. So, the two
period forecast would be between 0.08 and 0.09, and 0.081 is the only possible answer.

Question #15 of 139 Question ID: 1472209

Which of the following about unsupervised learning is most accurate?

Unsupervised learning has lower forecasting accuracy as compared to supervised


A)
learning.
B) Classification is an example of unsupervised learning algorithm.
C) There is no labeled data. C

Question #16 of 139 Question ID: 1472112

Rhonda Wilson, CFA, is analyzing sales data for the TUV Corp, a current equity holding in her
portfolio. She observes that sales for TUV Corp. have grown at a steadily increasing rate over
the past ten years due to the successful introduction of some new products. Wilson
anticipates that TUV will continue this pattern of success. Which of the following models is
most appropriate in her analysis of sales for TUV Corp?

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 5/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A log-linear trend model, because the data series can be graphed using a straight,
A)
upward-sloping line.
A linear trend model, because the data series is equally distributed above and below
B)
the line and the mean is constant.
A log-linear trend model, because the data series exhibits a predictable, exponential
C)
growth trend.

Question #17 of 139 Question ID: 1472233

Which of the following uses of data is most accurately described as curation?

An investor creates a word cloud from financial analysts’ recent research reports
A)
about a company.
An analyst adjusts daily stock index data from two countries for their different
B)
market holidays.
A data technician accesses an offsite archive to retrieve data that has been stored
C)
there.
B. Curation is ensuring the quality of data, for example by adjusting for bad or missing data. Word clouds are
a visualization technique. Moving data from a storage medium to where they are needed is referred to as
transfer.
Bill Johnson, CFA, has prepared data concerning revenues from sales of winter clothing
made by Polar Corporation. This data is presented (in $ millions) in the following table:

Lagged Change Seasonal Lagged


Change In Sales
In Sales Change In Sales

Quarter Sales Y Y + (−1) Y + (−4)

2013.1 182

2013.2 74 −108

2013.3 78 4 −108

2013.4 242 164 4

2014.1 194 −48 164

2014.2 79 −115 −48 −108

2014.3 90 11 −115 4

2014.4 260 170 11 w

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 6/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #18 - 23 of 139 Question ID: 1472165

The preceding table will be used by Johnson to forecast values using:

A) an autoregressive model with a seasonal lag.


B) a log-linear trend model with a seasonal lag.
C) a serially correlated model with a seasonal lag.

A. AR model is a model that regressed on its past values.

Question #19 - 23 of 139 Question ID: 1472166

The value that Johnson should enter in the table in place of "w" is:

A) 164.
B) −48.
C) −115. A. seasonal lag.

Question #20 - 23 of 139 Question ID: 1472167

Imagine that Johnson prepares a change-in-sales regression analysis model with seasonality,
which includes the following:

Coefficients

Intercept −6.032

Lag 1 0.017

Lag 4 0.983

Based on the model, expected sales in the first quarter of 2015 will be closest to:

A) 210.
B) 155.
C) 190.
A. Substituting the 1-period lagged data from 2014.4 and the 4-period lagged data from
2014.1 into the model formula, change in sales is predicted to be 6.032 + (0.017 × 170) +
(0.983 × 48) = 50.326. Expected sales are 260 + (50.326) = 209.674.
https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 7/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #21 - 23 of 139 Question ID: 1472168

Johnson's model was most likely designed to incorporates correction for:

A) heteroskedasticity of model residuals.


B) cointegration in the time series.
C) nonstationarity in time series data.
C. Johnson's model transforms raw sales data by first differencing it and then modeling change in sales.
This is most likely an adjustment to make the data stationary for use in an AR model.

Question #22 - 23 of 139 Question ID: 1472169

To test for covariance-stationarity in the data, Johnson would most likely use a:

A) Dickey-Fuller test.
B) t-test.
C) Durbin-Watson test.
A. The Dickey-Fuller test for unit roots could be used to test whether the data is covariance
non-stationarity. The Durbin-Watson test is used for detecting serial correlation in the residuals of trend
models but cannot be used in AR models. A t-test is used to test for residual autocorrelation in AR
models.

Question #23 - 23 of 139 Question ID: 1472170

The presence of conditional heteroskedasticity of residuals in Johnson's model is would


most likely to lead to:

A) invalid standard errors of regression coefficients and invalid statistical tests.


B) invalid estimates of regression coefficients, but the standard errors will still be valid.
invalid standard errors of regression coefficients, but statistical tests will still be
C)
valid.
A

Question #24 of 139 Question ID: 1508640

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 8/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

The data below yields the following AR(1) specification: xt = 0.9 – 0.55xt-1 + Et , and the
indicated fitted values and residuals.

Time xt fitted values residuals

1 1 - -

2 -1 0.35 -1.35

3 2 1.45 0.55

4 -1 -0.2 -0.8

5 0 1.45 -1.45

6 2 0.9 1.1

7 0 -0.2 0.2

8 1 0.9 0.1

9 2 0.35 1.65

The following sets of data are ordered from earliest to latest. To test for ARCH, the
researcher should regress:

A) (1, 4, 1, 0, 4, 0, 1, 4) on (1, 1, 4, 1, 0, 4, 0, 1).


B) (-1.35, 0.55, -0.8, -1.45, 1.1, 0.2, 0.1, 1.65) on (0.35, 1.45, -0.2, 1.45, 0.9, -0.2, 0.9, 0.35).
(1.8225, 0.3025, 0.64, 2.1025, 1.21, 0.04, 0.01) on (0.3025, 0.64, 2.1025, 1.21, 0.04,
C)
0.01, 2.7225).
C. The test for ARCH is based on a regression of the squared residuals on their lagged values.
The squared residuals are (1.8225, 0.3025, 0.64, 2.1025, 1.21, 0.04, 0.01, 2.7225). So,
(1.8225, 0.3025, 0.64, 2.1025, 1.21, 0.04, 0.01) is regressed on (0.3025, 0.64, 2.1025, 1.21,
0.04, 0.01, 2.7225).

Question #25 of 139 Question ID: 1472144

William Zox, an analyst for Opal Mountain Capital Management, uses two different models
to forecast changes in the inflation rate in the United Kingdom. Both models were
constructed using U.K. inflation data from 1988-2002. In order to compare the forecasting
accuracy of the models, Zox collected actual U.K. inflation data from 2004-2005, and
compared the actual data to what each model predicted. The first model is an AR(1) model
that was found to have an average squared error of 10.429 over the 12 month period. The
second model is an AR(2) model that was found to have an average squared error of 11.642
over the 12 month period. Zox then computed the root mean squared error for each model
to use as a basis of comparison. Based on the results of his analysis, which model should
Zox conclude is the most accurate?

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 9/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A) Model 1 because it has an RMSE of 5.21.


B) Model 1 because it has an RMSE of 3.23.
B
C) Model 2 because it has an RMSE of 3.41.

Yolanda Seerveld is an analyst studying the growth of sales of a new restaurant chain called
Very Vegan. The increase in the public's awareness of healthful eating habits has had a very
positive effect on Very Vegan's business. Seerveld has gathered quarterly data for the
restaurant's sales for the past three years. Over the twelve periods, sales grew from $17.2
million in the first quarter to $106.3 million in the last quarter. Because Very Vegan has
experienced growth of more than 500% over the three years, the Seerveld suspects an
exponential growth model may be more appropriate than a simple linear trend model.
However, she begins by estimating the simple linear trend model:

(sales)t = α + β × (Trend)t + εt

Where the Trend is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12.

Regression Statistics

Multiple R 0.952640

R2 0.907523

Adjusted R2 0.898275

Standard Error 8.135514

Observations 12

1st order autocorrelation coefficient of the


residuals: −0.075

ANOVA

df SS

Regression 1 6495.203

Residual 10 661.8659

Total 11 7157.069

Coefficients Standard Error

Intercept 10.0015 5.0071

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 10/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Trend 6.7400 0.6803

The analyst then estimates the following model:

(natural logarithm of sales)t = α + β × (Trend)t + εt

Regression Statistics

Multiple R 0.952028

R2 0.906357

Adjusted R2 0.896992

Standard Error 0.166686

Observations 12

1st order autocorrelation coefficient of the


residuals: −0.348

ANOVA

df SS

Regression 1 2.6892

Residual 10 0.2778

Total 11 2.9670

Coefficients Standard Error

Intercept 2.9803 0.1026

Trend 0.1371 0.0140

Seerveld compares the results based upon the output statistics and conducts two-tailed
tests at a 5% level of significance. One concern is the possible problem of autocorrelation,
and Seerveld makes an assessment based upon the first-order autocorrelation coefficient of
the residuals that is listed in each set of output. Another concern is the stationarity of the
data. Finally, the analyst composes a forecast based on each equation for the quarter
following the end of the sample.

Question #26 - 29 of 139 Question ID: 1472106

Are either of the slope coefficients statistically significant?


https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 11/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A) The simple trend regression is not, but the log-linear trend regression is.
B) Yes, both are significant. B

C) The simple trend regression is, but not the log-linear trend regression.

Question #27 - 29 of 139 Question ID: 1472107

With respect to the possible problems of autocorrelation and nonstationarity, using the log-
linear transformation appears to have:

A) not improved the results for either possible problems.


B) improved the results for nonstationarity but not autocorrelation.
C) improved the results for autocorrelation but not nonstationarity.

Question #28 - 29 of 139 Question ID: 1472108

Using the simple linear trend model, the forecast of sales for Very Vegan for the first out-of-
sample period is:

A) $97.6 million.
A
B) $113.0 million.
C) $123.0 million.

Question #29 - 29 of 139 Question ID: 1472109

Using the log-linear trend model, the forecast of sales for Very Vegan for the first out-of-
sample period is:

A) $109.4 million.
B) $117.0 million. B

C) $121.2 million.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 12/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #30 of 139 Question ID: 1472206

Alexis Popov, CFA, has estimated the following specification: xt = b0 + b1 × xt-1 + et. Which of
the following would most likely lead Popov to want to change the model's specification?

A) b0 < 0.

B) Correlation(et, et-2) is significantly different from zero.

C) Correlation(et, et-1) is not significantly different from zero.


B. If correlation(et, et-2) is not zero, then the model suffers from 2nd order serial correlation. Popov may
wish to try an AR(2) model. Both of the other conditions are acceptable in an AR(1) model.

Question #31 of 139 Question ID: 1472201

One choice a researcher can use to test for nonstationarity is to use a:

A) Breusch-Pagan test, which uses a modified t-statistic.


B) Dickey-Fuller test, which uses a modified t-statistic.

C) Dickey-Fuller test, which uses a modified χ2 statistic.


B. The Dickey-Fuller test estimates the equation (xt – xt-1) = b0 + (b1 - 1) * xt-1 + et and tests if H0 : (b1 –
1) = 0. Using a modified t-test, if it is found that (b1– 1) is not significantly different from zero, then it is
concluded that b1 must be equal to 1.0 and the series has a unit root.

Question #32 of 139 Question ID: 1472204

Alexis Popov, CFA, is analyzing monthly data. Popov has estimated the model xt = b0 + b1 ×
xt-1 + b2 × xt-2 + et. The researcher finds that the residuals have a significant ARCH process.
The best solution to this is to:

A) re-estimate the model using a seasonal lag.


B) re-estimate the model with generalized least squares. B
C) re-estimate the model using only an AR(1) specification.

Question #33 of 139 Question ID: 1472119

Which of the following statements regarding covariance stationarity is CORRECT?

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 13/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

The estimation results of an AR model involving a time series that is not covariance
A)
stationary are meaningless.
A time series that is covariance stationary may have residuals whose mean changes
B)
over time.
C) A time series may be both covariance stationary and heteroskedastic. A

Question #34 of 139 Question ID: 1472189

The table below shows the autocorrelations of the lagged residuals for the first differences
of the natural logarithm of quarterly motorcycle sales that were fit to the AR(1) model: (ln
salest − ln salest − 1) = b0 + b1(ln salest − 1 − ln salest − 2) + εt. The critical t-statistic at 5%
significance is 2.0, which means that there is significant autocorrelation for the lag-4
residual, indicating the presence of seasonality. Assuming the time series is covariance
stationary, which of the following models is most likely to CORRECT for this apparent
seasonality?

Lagged Autocorrelations of First Differences in the Log of Motorcycle Sales

Lag Autocorrelation Standard Error t-Statistic

1 −0.0738 0.1667 −0.44271

2 −0.1047 0.1667 −0.62807

3 −0.0252 0.1667 −0.15117

4 0.5528 0.1667 3.31614

A) ln salest = b0 + b1(ln salest − 1) − b2(ln salest − 4) + εt.


C
B) (ln salest − ln salest − 4) = b0 + b1(ln salest − 1 − ln salest − 2) + εt.

(ln salest − ln salest − 1) = b0 + b1(ln salest − 1 − ln salest − 2) + b2(ln salest − 4 − ln salest


C)
− 5 ) + ε t.

Jason Cranwell, CFA, has hypothesized that sales of luxury cars have grown at a constant
rate over the past 15 years.

Question #35 - 38 of 139 Question ID: 1472114


https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 14/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

After discussing the above matter with a colleague, Cranwell finally decides to use an
autoregressive model of order one i.e. AR(1) for the above data. Below is a summary of the
findings of the model:

b0 0.4563

b1 0.6874

Standard error 0.3745

R-squared 0.7548

Durbin Watson 1.23

F 12.63

Observations 180

Calculate the mean reverting level of the series.

A) 1.26.
B) 1.46. B

C) 1.66.

Question #36 - 38 of 139 Question ID: 1472115

Cranwell is aware that the Dickey Fuller test can be used to discover whether a model has a
unit root. He is also aware that the test would use a revised set of critical t-values. What
would it mean to Bert to reject the null of the Dickey Fuller test (Ho: g = 0) ?

A) There is no unit root.


B) There is a unit root and the model cannot be used in its current form.
C) There is a unit root but the model can be used if covariance-stationary.
A. The null hypothesis of g = 0 actually means that b1 – 1 = 0 , meaning that b1 = 1. Since we
have rejected the null, we can conclude that the model has no unit root.

Question #37 - 38 of 139 Question ID: 1472116

Cranwell would also like to test for serial correlation in his AR(1) model. To do this, Cranwell
should:

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 15/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A) use the provided Durbin Watson statistic and compare it to a critical value.
B) use a t-test on the residual autocorrelations over several lags.
determine if the series has a finite and constant covariance between leading and
C)
lagged terms of itself.
B

Question #38 - 38 of 139 Question ID: 1472117

When using the root mean squared error (RMSE) criterion to evaluate the predictive power
of the model, which of the following is the most appropriate statement?

A) Use the model with the highest RMSE calculated using the in-sample data.
B) Use the model with the lowest RMSE calculated using the out-of-sample data.
C) Use the model with the lowest RMSE calculated using the in-sample data.

Question #39 of 139 Question ID: 1472191

Suppose you estimate the following model of residuals from an autoregressive model:

εt2 = 0.25 + 0.6ε2t-1 + µt, where ε = ε^

If the residual at time t is 0.9, the forecasted variance for time t+1 is:

A) 0.736.
B) 0.850.
A
C) 0.790.

Question #40 of 139 Question ID: 1472139

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 16/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

The regression results from fitting an AR(1) to a monthly time series are presented below.
What is the mean-reverting level for the model?

Model: ΔExpt = b0 + b1ΔExpt–1 + εt

Coefficients Standard Error t-Statistic p-value

Intercept 1.3304 0.0089 112.2849 < 0.0001

Lag-1 0.1817 0.0061 30.0125 < 0.0001

A) 0.6151.
B) 1.6258.
B
C) 7.3220.

Question #41 of 139 Question ID: 1472213

The degree to which a machine learning model retains its explanatory power when
predicting out-of-sample is most commonly described as:

A) predominance.
B) hegemony.
C) generalization.
C. Generalization describes the degree to which, when predicting out-of-sample, a machine
learning model retains its explanatory power.

Question #42 of 139 Question ID: 1472187

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 17/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

The table below shows the autocorrelations of the lagged residuals for quarterly theater
ticket sales that were estimated using the AR(1) model: ln(salest) = b0 + b1(ln salest − 1) + et.
Assuming the critical t-statistic at 5% significance is 2.0, which of the following is the most
likely conclusion about the appropriateness of the model? The time series:

Lagged Autocorrelations of the Log of Quarterly Theater Ticket Sales

Lag Autocorrelation Standard Error t-Statistic

1 −0.0738 0.1667 −0.44271

2 −0.1047 0.1667 −0.62807

3 −0.0252 0.1667 −0.15117

4 0.5528 0.1667 3.31614

A) would be more appropriately described with an MA(4) model.


B) contains seasonality.
C) contains ARCH (1) errors.
B. The time series contains seasonality as indicated by the strong and significant
autocorrelation of the lag-4 residual.

Question #43 of 139 Question ID: 1472235

The process of splitting a given text into separate words is best characterized as:

A) tokenization.
A
B) stemming.
C) bag-of-words.

Question #44 of 139 Question ID: 1508636

Consider the estimated model xt = -6.0 + 1.1 xt-1 + 0.3 xt-2 + εt that is estimated over 50

periods. The value of the time series for the 49th observation is 20 and the value of the time
series for the 50th observation is 22. What is the forecast for the 51st observation?

A) 23.
B) 24.2.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 18/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

C) 30.2. B

Question #45 of 139 Question ID: 1472208

Which of the following statements about supervised learning is most accurate?

A) Supervised learning does not differentiate between tag and features.


B) Supervised learning requires human intervention in machine learning process.
Typical data analytics tasks for supervised learning include classification and
C)
prediction.

Question #46 of 139 Question ID: 1472094

Modeling the trend in a time series of a variable that grows at a constant rate with
continuous compounding is best done with:

A) a log-linear transformation of the time series.


B) a moving average model.
C) simple linear regression.
A

Question #47 of 139 Question ID: 1472128

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 19/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Troy Dillard, CFA, has estimated the following equation using quarterly data: xt = 93 - 0.5×xt–

1 + 0.1×xt– 4 + et. Given the data in the table below, what is Dillard's best estimate of the first

quarter of 2007?

Time Value

2005: I 62

2005: II 62

2005: III 66

2005: IV 66

2006: I 72

2006: II 70

2006: III 64

2006: IV 66

A) 67.20.
B) 66.40.
A
C) 66.60.

Question #48 of 139 Question ID: 1472234

Big data is most likely to suffer from low:

A) velocity.
B) veracity. B

C) variety.

Winston Collier, CFA, has been asked by his supervisor to develop a model for predicting the
warranty expense incurred by Premier Snowplow Manufacturing Company in servicing its
plows. Three years ago, major design changes were made on newly manufactured plows in
an effort to reduce warranty expense. Premier warrants its snowplows for 4 years or 18,000
miles, whichever comes first. Warranty expense is higher in winter months, but some of
Premier's customers defer maintenance issues that are not essential to keeping the

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 20/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

machines functioning to spring or summer seasons. The data that Collier will analyze is in
the following table (in $ millions):

Seasonal Lagged
Change in Lagged Change in
Warranty Change in
Quarter Warranty Warranty
Expense Warranty
Expense yt Expense yt-1
Expense yt-4

2002.1 103

2002.2 52 –51

2002.3 32 –20 –51

2002.4 68 +36 –20

2003.1 91 +23 +36

2003.2 44 –47 +23 –51

2003.3 30 –14 –47 –20

2003.4 60 +30 –14 +36

2004.1 77 +17 +30 +23

2004.2 38 –39 +17 –47

2004.3 29 –9 –39 –14

2004.4 53 +24 –9 +30

Winston submits the following results to his supervisor. The first is the estimation of a trend
model for the period 2002:1 to 2004:4. The model is below. The standard errors are in
parentheses.

(Warranty expense)t = 74.1 - 2.7* t + et

(14.37) (1.97)

R-squared = 16.2%

Winston also submits the following results for an autoregressive model on the differences in
the expense over the period 2004:to 2004:4. The model is below where "y" represents the
change in expense as defined in the table above. The standard errors are in parentheses.

yt = -0.7 - 0.07* yt-1 + 0.83* yt-4 + et

(0.643) (0.0222) (0.0186)

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 21/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

R-squared = 99.98%

After receiving the output, Collier's supervisor asks him to compute moving averages of the
sales data.

Question #49 - 52 of 139 Question ID: 1472158

Collier's supervisors would probably not want to use the results from the trend model for all
of the following reasons EXCEPT:
A
A) the model is a linear trend model and log-linear models are always superior.
B) the slope coefficient is not significant.
it does not give insights into the underlying dynamics of the movement of the
C)
dependent variable.

Question #50 - 52 of 139 Question ID: 1472159

For this question only, assume that Winston also ran an AR(1) model with the following
results:

yt = −0.9 − 0.23* yt −1 + et

R-squared = 78.3%

(0.823) (0.0222)

The mean reverting level of this model is closest to:

A) 1.16.
B) −0.73.
B
C) 0.77.

Question #51 - 52 of 139 Question ID: 1472160

Based on the autoregressive model, expected warranty expense in the first quarter of 2005
will be closest to:

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 22/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A) $51 million.
C
B) $60 million.
C) $65 million.

Question #52 - 52 of 139 Question ID: 1472161

Based on the results, is there a seasonality component in the data?


A

A) Yes, because the coefficient on yt–4 is large compared to its standard error.

B) Yes, because the coefficient on yt is small compared to its standard error.

C) No, because the slope coefficients in the autoregressive model have opposite signs.
The coefficient on the 4th lag tests the seasonality component. The t-statistic is equal to 0.83/0.0186 =
44.62, which is greater than the critical t-value (5% LOS, 2-tailed, dof = 4) = 2.78

Question #53 of 139 Question ID: 1472134

The regression results from fitting an AR(1) model to the first-differences in enrollment
growth rates at a large university includes a Durbin-Watson statistic of 1.58. The number of
quarterly observations in the time series is 60. At 5% significance, the critical values for the
Durbin-Watson statistic are dl = 1.55 and du = 1.62. Which of the following is the most
accurate interpretation of the DW statistic for the model?

A) Since DW > dl, the null hypothesis of no serial correlation is rejected.

B) The Durbin-Watson statistic cannot be used with AR(1) models.

C) Since dl < DW < du, the results of the DW test are inconclusive.
B

Clara Holmes, CFA, is attempting to model the importation of an herbal tea into the United
States which last year was $54 million. She gathers 24 years of annual data, which is in
millions of inflation-adjusted dollars.

She computes the following equation:

(Tea Imports)t = 3.8836 + 0.9288 × (Tea Imports)t − 1 + et

t-statistics (0.9328) (9.0025)

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 23/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

R2 = 0.7942

Adj. R2 = 0.7844

SE = 3.0892

N = 23

Holmes and her colleague, John Briars, CFA, discuss the implication of the model and how
they might improve it. Holmes is fairly satisfied with the results because, as she says "the
model explains 78.44 percent of the variation in the dependent variable." Briars says the
model actually explains more than that.

Briars and Holmes decide to ask their company's statistician about the consequences of
serial correlation. Based on what Briars and Holmes tell the statistician, the statistician
informs them that serial correlation will only affect the standard errors and the coefficients
are still unbiased. The statistician suggests that they employ the Hansen method, which
corrects the standard errors for both serial correlation and heteroskedasticity.

Given the information from the statistician, Briars and Holmes decide to use the estimated
coefficients to make some inferences. Holmes says the results do not look good for the
future of tea imports because the coefficient on (Tea Import)t − 1 is less than one. This means
the process is mean reverting. Using the coefficients in the output, says Holmes, "we know
that whenever tea imports are higher than 41.810, the next year they will tend to fall.
Whenever the tea imports are less than 41.810, then they will tend to rise in the following
year." Briars agrees with the general assertion that the results suggest that imports will not
grow in the long run and tend to revert to a long-run mean, but he says the actual long-run
mean is 54.545. Briars then computes the forecast of imports three years into the future.

Question #54 - 57 of 139 Question ID: 1472096

With respect to the statement that the company's statistician made concerning the
consequences of serial correlation, assuming the company's statistician is competent, we
would most likely deduce that Holmes and Briars did not tell the statistician:

A) the sample size.


B) the model’s specification.
C) the value of the Durbin-Watson statistic.
B. Serial correlation will bias the standard errors. It can also bias the coefficient estimates in
an autoregressive model of this type. Thus, Briars and Holmes probably did not tell the
statistician the model is an AR(1) specification

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 24/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #55 - 57 of 139 Question ID: 1472097

The statistician's statement concerning the benefits of the Hansen method is:

correct, because the Hansen method adjusts for problems associated with both
A)
serial correlation and heteroskedasticity.
not correct, because the Hansen method only adjusts for problems associated with
B)
serial correlation but not heteroskedasticity.
not correct, because the Hansen method only adjusts for problems associated with
C)
heteroskedasticity but not serial correlation.
A

Question #56 - 57 of 139 Question ID: 1472098

Using the model's results, Briar's forecast for three years into the future is:

A) $47.151 million.
B) $54.543 million. C
C) $54.108 million.

Question #57 - 57 of 139 Question ID: 1472099

With respect to the comments of Holmes and Briars concerning the mean reversion of the
import data, the long-run mean value that:
A
A) Briars computes is correct.
B) Briars computes is not correct, and his conclusion is probably not accurate.
C) Briars computes is not correct, but his conclusion is probably accurate.

Question #58 of 139 Question ID: 1472218

What is the appropriate remedy in the presence of excessive number of features in a data
set?

A) Dimension reduction.
https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 25/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

B) Big data analysis.


C) Unsupervised learning. A

Question #59 of 139 Question ID: 1472162

Which of the following statements regarding unit roots in a time series is least accurate?

Even if a time series has a unit root, the predictions from the estimated model are
A)
valid.
B) A time series that is a random walk has a unit root.
C) A time series with a unit root is not covariance stationary.
A. A time series with a unit root will follow a random walk process. Since a time series that
follows a random walk is not covariance stationary, modeling such a time series in an AR
model can lead to incorrect statistical conclusions, and decisions made on the basis of
these conclusions may be wrong.
Bert Smithers, CFA, is a sell-side analyst who has been asked to look at the luxury car sector.
He has hypothesized that sales of luxury cars have grown at a constant rate over the past 15
years.

Exhibit 1

b0 0.4563

b1 0.6874

Standard error 0.3745

R-squared 0.7548

Durbin-Watson 1.23

F 12.63

Observations 15

20X1 sales ($bn) 1.05

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 26/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #60 - 65 of 139 Question ID: 1472172

If his assumption about a constant is correct, which of the following models is most
appropriate for modeling these data?

A) LuxCarSales = b0 + b1(t) + et. C

B) LuxCarSalest = b0 + b1LuxCarSales(t-1) + et.

C) ln(LuxCarSales) = b0 + b1(t) + et.

Question #61 - 65 of 139 Question ID: 1472173

After discussing the above matter with a colleague, Bert finally decides to use an annual
autoregressive model of Order One [i.e., AR(1)]. Using the data in Exhibit 1, calculate the
mean reverting level of the series.

A) 1.26.
B) 1.46. B

C) 1.66.

Question #62 - 65 of 139 Question ID: 1472174

Bert is aware that the Dickey Fuller test can be used to discover whether a model has a unit
root. He is also aware that the test would use a revised set of critical t-values. What would it
mean to Bert to reject the null of the Dickey Fuller test (Ho: g = 0)?

A) There is a unit root but the model can be used if covariance-stationary.


B) There is a unit root and the model cannot be used in its current form.
C) There is no unit root. C

Question #63 - 65 of 139 Question ID: 1472175

Bert would also like to test for serial correlation in his AR(1) model. How could this be done?
https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 27/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A) use a t-test on the residual autocorrelations over several lags.


B) use the provided Durbin-Watson statistic and compare it to a critical value.
determine if the series has a finite and constant covariance between leading and
C)
lagged terms of itself.
A

Question #64 - 65 of 139 Question ID: 1472176

When using the root mean squared error (RMSE) criterion to evaluate the predictive power
of the model, which of the following is the most appropriate statement?

A) Use the model with the highest RMSE calculated using the in-sample data.
B) Use the model with the lowest RMSE calculated using the out-of-sample data.
C) Use the model with the lowest RMSE calculated using the in-sample data.
B

Question #65 - 65 of 139 Question ID: 1472177

Bert would like to use his AR(1) model to forecast future sales of luxury automobiles. What is
the annualized growth rate between today and 20X3?

A) 10%.
B) 12%.
A. The annualized return between 20X1 and 20X3 is, therefore, ((1.27 / 1.05)^0.5) – 1 =
C) 11%. 9.87%.

Question #66 of 139 Question ID: 1472146

The primary concern when deciding upon a time series sample period is which of the
following factors?

A) The length of the sample time period.


B) Current underlying economic and market conditions.
C) The total number of observations.
B. There will always be a tradeoff between the increase statistical reliability of a longer time
period and the increased stability of estimated regression coefficients with shorter time
periods. Therefore, the underlying economic environment should be the deciding factor
when selecting a time series sample period.
https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 28/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #67 of 139 Question ID: 1472135

The table below includes the first eight residual autocorrelations from fitting the first
differenced time series of the absenteeism rates (ABS) at a manufacturing firm with the
model ΔABSt = b0 + b1ΔABSt-1 + εt. Based on the results in the table, which of the following
statements most accurately describes the appropriateness of the specification of the model,
ΔABSt = b0 + b1ΔABSt-1 + εt?

Lagged Autocorrelations of the Residuals of the First Differences in Absenteeism


Rates

Lag Autocorrelation Standard Error t-Statistic

1 −0.0738 0.1667 −0.44271

2 −0.1047 0.1667 −0.62807

3 −0.0252 0.1667 −0.15117

4 −0.0157 0.1667 −0.09418

5 −0.1262 0.1667 −0.75705

6 0.0768 0.1667 0.46071

7 0.0038 0.1667 0.02280

8 −0.0188 0.1667 −0.11278

The Durbin-Watson statistic is needed to determine the presence of significant


A)
correlation of the residuals. C

The negative values for the autocorrelations indicate that the model does not fit the
B)
time series.
C) The low values for the t-statistics indicate that the model fits the time series.

Question #68 of 139 Question ID: 1472214

In machine learning, out-of-sample error equals:

A) Standard error plus data error plus prediction error.


B) forecast error plus expected error plus regression error. C

C) bias error plus variance error plus base error.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 29/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #69 of 139 Question ID: 1472207

The technique in which a machine learns to model a set of output data from a given set of
inputs is best described as:

A) supervised learning. A

B) unsupervised learning.
C) deep learning.

Question #70 of 139 Question ID: 1472211

A rudimentary way to think of machine learning algorithms is that they:

A) “find the pattern, apply the pattern.”


B) “synthesize the pattern, review the pattern.”
C) “develop the pattern, interpret the pattern.”
A. Machine learning attempts to extract knowledge from large amounts of data by learning
from known examples in order to determine an underlying structure in the data. The focus
is on generating structure or predictions without human intervention.

Question #71 of 139 Question ID: 1472163

An AR(1) autoregressive time series model:

A) cannot be used to test for a unit root.


B) can be used to test for a unit root, which exists if the slope coefficient equals one.
can be used to test for a unit root, which exists if the slope coefficient is less than
C)
one.
B

Albert Morris, CFA, is evaluating the results of an estimation of the number of wireless
phone minutes used on a quarterly basis within the territory of Car-tel International, Inc.
Some of the information is presented below (in billions of minutes):

Wireless Phone Minutes (WPM)t = b0 + b1 WPMt–1 + εt

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 30/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

ANOVA Degrees of Freedom Sum of Squares Mean Square

Regression 1 7,212.641 7,212.641

Error 26 3,102.410 119.324

Total 27 10,315.051

Coefficients Coefficient Standard Error of the Coefficient

Intercept –8.0237 2.9023

WPMt–1 1.0926 0.0673

The variance of the residuals from one time period within the time series is not dependent
on the variance of the residuals in another time period.

Morris also models the monthly revenue of Car-tel using data over 96 monthly observations.
The model is shown below:

Sales (CAD$ millions) = b0 + b1 Salest−1 + εt

Coefficients Coefficient Standard Error of the Coefficient

Intercept 43.2 12.32

Salest−1 0.8867 0.4122

Question #72 - 75 of 139 Question ID: 1472101

The value for WPM this period is 544 billion. Using the results of the model, the forecast
Wireless Phone Minutes three periods in the future is:

A) 691.30.
B) 586.35. C

C) 683.18.

Question #73 - 75 of 139 Question ID: 1472102

The WPM model was specified as a(n):

A) Moving Average (MA) Model.


https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 31/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

B) Autoregressive (AR) Model.


C) Autoregressive (AR) Model with a seasonal lag. B

Question #74 - 75 of 139 Question ID: 1472103

The mean reverting level of monthly sales is closest to:

A) 381.29 million.
B) 8.83 million. A

C) 43.2 million.

Question #75 - 75 of 139 Question ID: 1472104

Morris concludes that the current price of Car-tel stock is consistent with single stage
constant growth model (with g=3%). Based on this information, the sales model is most
likely:

Incorrectly specified and first differencing the data would be an appropriate


A)
remedy.
B) Correctly specified.
Incorrectly specified and first differencing the natural log of the data would be an
C)
appropriate remedy.

C. Constant growth

Question #76 of 139 Question ID: 1472126

The model xt = b0 + b1 xt-1 + b2 xt-2 + b3 xt-3 + b4 xt-4 + εt is:

A) an autoregressive model, AR(4).


B) a moving average model, MA(4). A

C) an autoregressive conditional heteroskedastic model, ARCH.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 32/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Hanna Kowalski is a senior fixed-income portfolio analyst at Czarnaskala BP. Kowalski


supervises Lena Nowak, who is a junior analyst.

Over the past several years, Kowalski has become aware that investment firms are
increasingly using technology to improve their investment decision making. Kowalski has
become particularly interested in machine learning techniques and how they might be
applied to investment management applications.

Kowalski has read a number of articles about machine learning in various journals for
financial analysts. However, she has only a minimal knowledge of how she might source
appropriate model inputs, interpret model outputs, and translate those outputs into
investment actions.

Kowalski and Nowak meet to discuss plans for incorporating machine learning into their
investment model. Kowalski asks Nowak to research machine learning and report back on
the types of investment problems that machine learning can address, how the algorithms
work, and what the various terminology means.

After spending a few hours researching the topic, Nowak makes a number of statements to
Kowalski on the topics of:

Classification and regression trees (CART)


Hierarchical clustering
Neural networks
Reinforcement learning (RL) algorithms

Kowalski is left to work out which of Nowak's statements are fully accurate and which are
not.

Question #77 - 80 of 139 Question ID: 1472228

Nowak first tries to explain classification and regression tree (CART) to Kowalski. CART is
least likely to be applied to predict a:

A) categorical target variable, producing a classification tree.


B) continuous target variable, producing a regression tree.
C
C) discrete target variable, producing a cardinal tree.

Question #78 - 80 of 139 Question ID: 1472229

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 33/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Which of the following statements Nowak makes about hierarchical clustering is most
accurate?

A) In divisive hierarchical clustering, the algorithm seeks out the two closest clusters.
Hierarchical clustering is a supervised iterative algorithm that is used to build a
B)
hierarchy of clusters.
Bottom-up hierarchical clustering begins with each observation being its own
C)
cluster.
C. Clustering is always unsupervised.

Question #79 - 80 of 139 Question ID: 1472230

Which of the following statements Nowak makes about neural networks is most accurate?
Neural networks:

have four types of layers: an input layer, agglomerative layers, regularization layers,
A)
and an output layer.
have an input layer node that consists of a summation operator and an activation
B)
function.
C) are effective in tasks with non-linearities and complex interactions among variables.
C. Neural networks have three types of layers: an input layer, hidden layers, and an output layer.
The hidden layer nodes (not the input layer nodes) each consist of a summation operator
and an activation function; these nodes are where learning takes place.

Question #80 - 80 of 139 Question ID: 1472231

Nowak tries to explain the reinforcement learning (RL) algorithm to Kowalski and makes a
number of statements about it. The reinforcement learning (RL) algorithm involves an agent
that is most likely to:

A) perform actions that will minimize costs over time.


B) make use of direct labeled data and instantaneous feedback.
C) take into consideration the constraints of its environment.
C. The reinforcement learning (RL) algorithm involves an agent that will perform actions that will maximize
its rewards over time, taking into consideration the constraints of the environment. Unlike supervised
learning, reinforcement learning has neither instantaneous feedback nor direct labeled data for each
observation.

Question #81 of 139 Question ID: 1472150

Which of the following statements regarding time series analysis is least accurate?
https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 34/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

An autoregressive model with two lags is equivalent to a moving-average model with


A)
two lags.
If a time series is a random walk, first differencing will result in covariance
B)
stationarity.
C) We cannot use an AR(1) model on a time series that consists of a random walk.
A

Freja Karlsson is a bond analyst with Storbank AB. Over the past several months, Karlsson
has been working to develop her own machine learning (ML) model that she plans to use to
predict default of the various bonds that she covers. The inputs to the model are various
pieces of financial data that Karlsson has compiled from multiple sources.

After Karlsson has constructed the model using her knowledge of appropriate variables,
Karlsson runs the model on the training set. Each firm's bonds are classified as predicted- to-
default or predicted-not-to-default. When Karlsson's model predicts that a bond will default
and the bond actually defaults, Karlsson considers this to be a true positive. Karlsson then
evaluates the performance of her model using error analysis. The confusion matrix that
results is shown in Exhibit 1.

N = 474 Actual Bond Status

Bond Default No Default

Model Prediction Bond Default 307 31

No Default 23 113

Question #82 - 85 of 139 Question ID: 1472240

Based on Exhibit 1, Karlsson's model's precision is closest to:

A) 91%.
B) 81%. A

C) 71%.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 35/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #83 - 85 of 139 Question ID: 1472241

Karlsson is especially concerned about the possibility that her model may indicate that a
bond will not default, but then the bond actually defaults. Karlsson decides to use the
model's recall to evaluate this possibility. Based on the data in Exhibit 1, the model's recall is
closest to:

A) 83%.
B) 73%.
C) 93%. C

Question #84 - 85 of 139 Question ID: 1472242

Karlsson would like to gain a sense of her model's overall performance. In her research,
Karlsson learns about the F1 score, which she hopes will provide a useful measure. Based on
Exhibit 1, Karlsson's model's F1 score is closest to:

A) 72%.
B) 82%. C

C) 92%.

Question #85 - 85 of 139 Question ID: 1472243

Karlsson also learns of the model measure of accuracy. Based on Exhibit 1, Karlsson's
model's accuracy metric is closest to:

A) 79%.
B) 69%.
C) 89%.
C

Question #86 of 139 Question ID: 1472110

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 36/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Dianne Hart, CFA, is considering the purchase of an equity position in Book World, Inc, a
leading seller of books in the United States. Hart has obtained monthly sales data for the
past seven years, and has plotted the data points on a graph. Hart notices that the revenues
are growing at approximately 4.5% per year. Which of the following statements regarding
Hart's analysis of the data time series of Book World's sales is most accurate? Hart should
utilize a:

log-linear model to analyze the data because it is likely to exhibit a compound


A)
growth trend. A

mean-reverting model to analyze the data because the time series pattern is
B)
covariance stationary.
C) linear model to analyze the data because the mean appears to be constant.

Question #87 of 139 Question ID: 1508646

Consider the following estimated model:

(Salest - Sales t-1)= 100 - 1.5 (Sales t-1 - Sales t-2) + 1.2 (Sales t-4 - Sales t-5) t=1,2,.. T

and Sales for the periods 1999.1 through 2000.2:

t Period Sales

T 2000.2 $1,000

T-1 2000.1 $900

T-2 1999.4 $1,200

T-3 1999.3 $1,400

T-4 1999.2 $1,000

T-5 1999.1 $800

The forecasted Sales amount for 2000.3 is closest to:

A) $1,430.00.
B) $1,730.00. A

C) $730.00.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 37/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #88 of 139 Question ID: 1472120

Which of the following is NOT a requirement for a series to be covariance stationary? The:

A) expected value of the time series is constant over time.


B) covariance of the time series with itself (lead or lag) must be constant.
C) time series must have a positive trend.
C

Question #89 of 139 Question ID: 1472212

Overfitting is least likely to result in:

A) inclusion of noise in the model.


B) higher number of features included in the data set.
C) higher forecasting accuracy in out-of-sample data.
C

Question #90 of 139 Question ID: 1472192

Which of the following is least likely a consequence of a model containing ARCH(1) errors?
The:

A) regression parameters will be incorrect.


B) model's specification can be corrected by adding an additional lag variable.
C) variance of the errors can be predicted.
B.The presence of autoregressive conditional heteroskedasticity (ARCH) indicates that the
variance of the error terms is not constant. This is a violation of the regression
assumptions upon which time series models are based. The addition of another lag
variable to a model is not a means for correcting for ARCH (1) error

Question #91 of 139 Question ID: 1472148

Which of the following statements regarding the instability of time-series models is most
accurate? Models estimated with:

a greater number of independent variables are usually more stable than those with
A)
a smaller number.
B) longer time series are usually more stable than those with shorter time series.
https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 38/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

C) shorter time series are usually more stable than those with longer time series.
C. Those models with a shorter time series are usually more stable because there is less opportunity
for variance in the estimated regression coefficients between the different time periods.

Question #92 of 139 Question ID: 1472205

Alexis Popov, CFA, wants to estimate how sales have grown from one quarter to the next on
average. The most direct way for Popov to estimate this would be:

A) an AR(1) model.
B) a linear trend model. B
C) an AR(1) model with a seasonal lag.

Vikas Rathod, an enrolled candidate for the CFA Level II examination, has decided to perform
a calendar test to examine whether there is any abnormal return associated with
investments and disinvestments made in blue-chip stocks on particular days of the week. As
a proxy for blue-chips, he has decided to use the S&P 500 index. The analysis will involve the
use of dummy variables and is based on the past 780 trading days. Here are selected
findings of his study:

RSS 0.0039

SSE 0.9534

SST 0.9573

R-squared 0.004

SEE 0.035

Jessica Jones, CFA, a friend of Rathod, overhears that he is interested in regression analysis
and warns him that whenever heteroskedasticity is present in multiple regression this could
undermine the regression results. She mentions that one easy way to spot conditional
heteroskedasticity is through a scatter plot, but she adds that there is a more formal test.
Unfortunately, she can't quite remember its name. Jessica believes that heteroskedasticity
can be rectified using White-corrected standard errors. Her son Jonathan who has also taken
part in the discussion, hears this comment and argues that White correction would typically
reduce the number of Type I errors in financial data.

Question #93 - 98 of 139 Question ID: 1508642

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 39/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

How many dummy variables should Rathod use?

A) Six.
B) Four. B

C) Five.

Question #94 - 98 of 139 Question ID: 1472196

What is most likely represented by the intercept of the regression?

A) The intercept is not a driver of returns, only the independent variables.


B) The drift of a random walk.
C) The return on a particular trading day. C

Question #95 - 98 of 139 Question ID: 1472197

What can be said of the overall explanatory power of the model at the 5% significance?

The coefficient of determination for the above regression is significantly higher than
A)
the standard error of the estimate, and therefore there is value to calendar trading.
B) There is value to calendar trading.
C) There is no value to calendar trading. C

Question #96 - 98 of 139 Question ID: 1508643

The test mentioned by Jessica is known as the:

A) Durbin-Watson, which is a two-tailed test.


B) Breusch-Pagan, which is a one-tailed test.
B
C) Breusch-Pagan, which is a two-tailed test.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 40/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #97 - 98 of 139 Question ID: 1508644

Are Jessica and her son Jonathan, correct in terms of the method used to correct for
heteroskedasticity and the likely effects?

A) One is correct.
B) Neither is correct.
C
C) Both are correct.

Question #98 - 98 of 139 Question ID: 1472200

Assuming the a1 term of an ARCH(1) model is significant, the following can be forecast:

A) The square of the error term.


B) The variance of the error term.
C) A significant a1 implies that the ARCH framework cannot be used.
B. A Model is ARCH(1) if the coefficient a1 is significant. It will allow for the estimation of the
variance of the error term.

Question #99 of 139 Question ID: 1472219

Dimension reduction is most likely to be an example of:

A) supervised learning.
B) clustering.
C
C) unsupervised learning.

Question #100 of 139 Question ID: 1508648

Considering the various supervised machine learning algorithms, a linear classifier that
seeks the optimal hyperplane and is typically used for classification, best describes:

A) support vector machine (SVM).


B) k-nearest neighbor (KNN). A

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 41/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

C) classification and regression tree (CART).

Question #101 of 139 Question ID: 1472215

A random forest is least likely to:

A) provide a solution to overfitting problem.


B) be a classification tree.
C) reduce signal-to-noise ratio.

C. Since each tree only uses a subset of features, random forests can mitigate the problem of
overfitting. Because errors across different trees tend to cancel each other out, using random
forests can increase the signal-to-noise ratio.

Question #102 of 139 Question ID: 1472149

David Brice, CFA, has tried to use an AR(1) model to predict a given exchange rate. Brice has
concluded the exchange rate follows a random walk without a drift. The current value of the
exchange rate is 2.2. Under these conditions, which of the following would be least likely?

A) The residuals of the forecasting model are autocorrelated.


B) The process is not covariance stationary.
A
C) The forecast for next period is 2.2.

Question #103 of 139 Question ID: 1472138

Which of the following statements regarding a mean reverting time series is least accurate?

A) If the time-series variable is x, then xt = b0 + b1xt-1.

If the current value of the time series is above the mean reverting level, the
B)
prediction is that the time series will increase.
If the current value of the time series is above the mean reverting level, the
C)
prediction is that the time series will decrease. B

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 42/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Diem Le is analyzing the financial statements of McDowell Manufacturing. He has modeled


the time series of McDowell's gross margin over the last 16 years. The output is shown
below. Assume 5% significance level for all statistical tests.

Autoregressive Model

Gross Margin – McDowell Manufacturing

Quarterly Data: 1st Quarter 1985 to 4th Quarter 2000

Regression Statistics

R-squared 0.767

Standard error of forecast 0.049

Observations 64

Durbin-Watson 1.923 (not statistically significant)

Coefficient Standard Error t-statistic

Constant 0.155 0.052 ?????

Lag 1 0.240 0.031 ?????

Lag 4 0.168 0.038 ?????

Autocorrelation of Residuals

Lag Autocorrelation Standard Error t-statistic

1 0.015 0.129 ?????

2 –0.101 0.129 ?????

3 –0.007 0.129 ?????

4 0.095 0.129 ?????

Partial List of Recent Observations

Quarter Observation

4th Quarter 2002 0.250

1st Quarter 2003 0.260

2nd Quarter 2003 0.220

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 43/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

3rd Quarter 2003 0.200

4th Quarter 2003 0.240

Abbreviated Table of the Student's t-distribution (One-Tailed Probabilities)

df p = 0.10 p = 0.05 p = 0.025 p = 0.01 p = 0.005

50 1.299 1.676 2.009 2.403 2.678

60 1.296 1.671 2.000 2.390 2.660

70 1.294 1.667 1.994 2.381 2.648

Question #104 - 107 of 139 Question ID: 1472122

Le can conclude that the model is:

A) properly specified because the Durbin-Watson statistic is not significant.


B) properly specified because there is no evidence of autocorrelation in the residuals.
not properly specified because there is evidence of autocorrelation in the residuals
C)
and the Durbin-Watson statistic is not significant. B

Question #105 - 107 of 139 Question ID: 1472123

What is the forecast for the gross margin in the first quarter of 2004?

A) 0.246.
B) 0.250.
C
C) 0.256.

Question #106 - 107 of 139 Question ID: 1472124

With respect to heteroskedasticity in the model, we can definitively say:

A) nothing.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 44/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

an ARCH process exists because the autocorrelation coefficients of the residuals


B)
have different signs.
C) heteroskedasticity is not a problem because the DW statistic is not significant.
A

Question #107 - 107 of 139 Question ID: 1472125

Supposing the time series is actually a random walk, which of the following approaches
would be appropriate prior to using an autoregressive model?

A) First differencing the time series.


A
B) ARCH.
C) Convert the time series by taking a natural log of the series.

Question #108 of 139 Question ID: 1472127

Troy Dillard, CFA, has estimated the following equation using semiannual data: xt = 44 +
0.1×xt– 1 – 0.25×xt– 2 - 0.15×xt– 3 + et. Given the data in the table below, what is Dillard's best
forecast of the second half of 2007?

Time Value

2003: I 31

2003: II 31

2004: I 33

2004: II 33

2005: I 36

2005: II 35

2006: I 32

2006: II 33

A) 34.36.
A
B) 34.05.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 45/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

C) 33.74.

Question #109 of 139 Question ID: 1472141

Suppose that the time series designated as Y is mean reverting. If Yt+1 = 0.2 + 0.6 Yt, the best
prediction of Yt+1 is:

A) 0.3.
B) 0.8. C

C) 0.5.

Question #110 of 139 Question ID: 1472151

A time series x that is a random walk with a drift is best described as:

A) xt = b0 + b1xt − 1 + εt.

B) xt = b0 + b1 xt − 1.

C) xt = xt − 1 + εt. A

Question #111 of 139 Question ID: 1472237

Under which of these conditions is a machine learning model said to be underfit?

A) The input data are not labelled.


B
B) The model treats true parameters as noise.
C) The model identifies spurious relationships.

Question #112 of 139 Question ID: 1472136

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 46/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

An analyst modeled the time series of annual earnings per share in the specialty department
store industry as an AR(3) process. Upon examination of the residuals from this model, she
found that there is a significant autocorrelation for the residuals of this model. This indicates
that she needs to:
B
A) switch models to a moving average model.
B) revise the model to include at least another lag of the dependent variable.
C) alter the model to an ARCH model.

Question #113 of 139 Question ID: 1472152

Given an AR(1) process represented by xt+1 = b0 + b1×xt + et, the process would not be a
random walk if:

A) b1 = 1.
C
B) E(et)=0.

C) the long run mean is b0 / (1-b1).

Question #114 of 139 Question ID: 1472132

The model xt = b0 + b1 xt − 1 + b2 xt − 2 + b3 xt −12 + εt is an autoregressive model of type:

A) AR(12).
B) AR(2).
B
C) AR(1).

Question #115 of 139 Question ID: 1472145

Consider the estimated AR(2) model, xt = 2.5 + 3.0 xt-1 + 1.5 xt-2 + εt t=1,2,...50. Making a
prediction for values of x for 1 ≤ t ≤ 50 is referred to as:

A) an out-of-sample forecast.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 47/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

B) an in-sample forecast.
C) requires more information to answer the question. B

Question #116 of 139 Question ID: 1472133

The procedure for determining the structure of an autoregressive model is:

estimate an autoregressive model (for example, an AR(1) model), calculate the


A) autocorrelations for the model's residuals, test whether the autocorrelations are
different from zero, and add an AR lag for each significant autocorrelation.
test autocorrelations of the residuals for a simple trend model, and specify the
B)
number of significant lags.
estimate an autoregressive model (e.g., an AR(1) model), calculate the
C) autocorrelations for the model's residuals, test whether the autocorrelations are
different from zero, and revise the model if there are significant autocorrelations.
C. The procedure is iterative: continually test for autocorrelations in the residuals and stop
adding lags when the autocorrelations of the residuals are eliminated. Even if several of
the residuals exhibit autocorrelation, the lags should be added one at a time.

Question #117 of 139 Question ID: 1472153

A time series that has a unit root can be transformed into a time series without a unit root
through:

A) first differencing.
A
B) mean reversion.
C) calculating moving average of the residuals.

Question #118 of 139 Question ID: 1472131

An analyst wants to model quarterly sales data using an autoregressive model. She has
found that an AR(1) model with a seasonal lag has significant slope coefficients. She also
finds that when a second and third seasonal lag are added to the model, all slope
coefficients are significant too. Based on this, the best model to use would most likely be an:

A) AR(1) model with no seasonal lags.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 48/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

B) ARCH(1).
C
C) AR(1) model with 3 seasonal lags.

Question #119 of 139 Question ID: 1472118

To qualify as a covariance stationary process, which of the following does not have to be
true?

A) Covariance(xt, xt-1) = Covariance(xt, xt-2).

B) E[xt] = E[xt+1]. A

C) Covariance(xt, xt-2) = Covariance(xt, xt+2).

Question #120 of 139 Question ID: 1472143

Which of the following statements regarding an out-of-sample forecast is least accurate?

A) Forecasting is not possible for autoregressive models with more than two lags.
Out-of-sample forecasts are of more importance than in-sample forecasts to the
B)
analyst using an estimated time-series model.
There is more error associated with out-of-sample forecasts, as compared to in-
C)
sample forecasts.
A

Question #121 of 139 Question ID: 1472188

Which of the following statements regarding seasonality is least accurate?

Not correcting for seasonality when, in fact, seasonality exists in the time series
A)
results in a violation of an assumption of linear regression.
The presence of seasonality makes it impossible to forecast using a time-series
B)
model.
B

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 49/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A time series that is first differenced can be adjusted for seasonality by


C) incorporating the first-differenced value for the previous year's corresponding
period.

Question #122 of 139 Question ID: 1508645

Consider the following estimated model:

(Salest - Sales t-1) = 30 + 1.25 (Sales t-1 - Sales t-2) + 1.1 (Sales t-4 - Sales t-5) t=1,2,.. T

and Sales for the periods 1999.1 through 2000.2:

t Period Sales

T 2000.2 $2,000

T-1 2000.1 $1,800

T-2 1999.4 $1,500

T-3 1999.3 $1,400

T-4 1999.2 $1,900

T-5 1999.1 $1,700

The forecasted Sales amount for 2000.3 is closest to:

A) $1,730.00.
B) $2,270.00.
A
C) $2,625.00.

Question #123 of 139 Question ID: 1508637

Suppose that the following time-series model is found to have a unit root:

Salest = b0 + b1 Sales t-1+ εt

What is the specification of the model if first differences are used?

A) Salest = b0 + b1 Sales t-1 + b2 Sales t-2 + εt.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 50/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

B) Salest = b1 Sales t-1+ εt.

C) (Salest - Salest-1)= b0 + b1 (Sales t-1 - Sales t-2) + εt. C

Joyce Tan manages a medium-sized investment fund at Marina Bay Advisors that specializes
in international large cap equities. Over the four years that she has been portfolio manager,
Tan has been invested in approximately 40 stocks at a time.

Tan has used a number of methodologies to select investment opportunities from the
universe of investable stocks. In some cases, Tan uses quantitative measures such as
accounting ratios to find the most promising investment candidates. In other cases, her
team of analysts suggest investments based on qualitative factors and various investment
hypotheses.

Tan begins to wonder if her team could leverage financial technology to make better
decisions. Specifically, she has read about various machine learning techniques to extract
useful information from large financial datasets, in order to uncover new sources of alpha.

Question #124 - 127 of 139 Question ID: 1472223

Tan is interested in using a supervised learning algorithm to analyze stocks. This task is least
likely to be a classification problem if the target variable is:

A) categorical.
B) ordinal.
C
C) continuous.

Question #125 - 127 of 139 Question ID: 1472224

After Tan implements a particular new supervised machine learning algorithm, she begins to
suspect that the holdout samples she is using are reducing the training set size too much. As
a result, she begins to make use of K-fold cross-validation. In the K-fold cross-validation
technique, after Tan shuffles the data randomly it is most likely that:

A) k – 1 samples will be used as validation samples.


B) k – 1 samples will be used as training samples.
C) the data will be divided into k – 1 equal sub-samples.
B. In the K-fold cross-validation technique, the data is shuffled randomly and then divided into k equal sub-samples. One
sample is saved to be used as a validation sample, and the other k – 1 samples are used as training samples
https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 51/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #126 - 127 of 139 Question ID: 1472225

At first Tan bases her stock picks on the results of a single machine-learning model, but then
begins to wonder if she should instead be using the predictions of a group of models.
Compared to a single machine-learning model, an ensemble machine learning algorithm is
most likely to produce predictions that are:

A) more precise but less dependable.


B) more accurate and more stable. B

C) less reliable but more steady.

Question #127 - 127 of 139 Question ID: 1472226

Tan is interested in applying neural networks, deep learning nets, and reinforcement
learning to her investment process. Regarding these techniques, which of the following
statements is most accurate?

Reinforcement learning algorithms achieve maximum performance when they stay


A)
as far away from their constraints as possible.
Neural networks work well in the presence of non-linearities and complex
B)
interactions among variables.
Neural networks with one or more hidden layers would be considered deep learning
C)
nets (DLNs).
B. Neural networks with at least three hidden layers are known as deep learning nets (DLNs).
Reinforcement learning algorithms use an agent that will maximize its rewards over time,
within the constraints of its environment.
Housing industry analyst Elaine Smith has been assigned the task of forecasting housing
foreclosures. Specifically, Smith is asked to forecast the percentage of outstanding
mortgages that will be foreclosed upon in the coming quarter. Smith decides to employ
multiple linear regression and time series analysis.

Besides constructing a forecast for the foreclosure percentage, Smith wants to address the
following two questions:

Research Question Is the foreclosure percentage significantly affected by short-term


1: interest rates?

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 52/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Research Question Is the foreclosure percentage significantly affected by government


2: intervention policies?

Smith contends that adjustable rate mortgages often are used by higher risk borrowers and
that their homes are at higher risk of foreclosure. Therefore, Smith decides to use short-
term interest rates as one of the independent variables to test Research Question 1.

To measure the effects of government intervention in Research Question 2, Smith uses a


dummy variable that equals 1 whenever the Federal government intervened with a fiscal
policy stimulus package that exceeded 2% of the annual Gross Domestic Product. Smith sets
the dummy variable equal to 1 for four quarters starting with the quarter in which the policy
is enacted and extending through the following 3 quarters. Otherwise, the dummy variable
equals zero.

Smith uses quarterly data over the past 5 years to derive her regression. Smith's regression
equation is provided in Exhibit 1:

Exhibit 1: Foreclosure Share Regression Equation

foreclosure share = b0 + b1(ΔINT) + b2(STIM) + b3(CRISIS) + ε

where:

Foreclosure = the percentage of all outstanding mortgages foreclosed upon during


share the quarter

= the quarterly change in the 1-year Treasury bill rate (e.g., ΔINT = 2 for a
ΔINT
two percentage point increase in interest rates)

STIM = 1 for quarters in which a Federal fiscal stimulus package was in place

= 1 for quarters in which the median house price is one standard


CRISIS
deviation below its 5-year moving average

The results of Smith's regression are provided in Exhibit 2:

Exhibit 2: Foreclosure Share Regression Results

Variable Coefficient t-statistic

Intercept 3.00 2.40

ΔINT 1.00 2.22

STIM -2.50 -2.10

CRISIS 4.00 2.35

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 53/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

The ANOVA results from Smith's regression are provided in Exhibit 3:

Exhibit 3: Foreclosure Share Regression Equation ANOVA Table

Source Degrees of Freedom Sum of Squares Mean Sum of Squares

Regression 3 15 5.0000

Error 16 5 0.3125

Total 19 20

Smith expresses the following concerns about the test statistics derived in her regression:

If my regression errors exhibit conditional heteroskedasticity, my t-


Concern 1:
statistics will be underestimated.

If my independent variables are correlated with each other, my F-statistic


Concern 2:
will be overestimated.

Before completing her analysis, Smith runs a regression of the changes in foreclosure share
on its lagged value. The following regression results and autocorrelations were derived using
quarterly data over the past 5 years ( Exhibit 4 and Exhibit 5, respectively):

Exhibit 4. Lagged Regression Results

Δ foreclosure sharet = 0.05 + 0.25(Δ foreclosure sharet– 1)

Exhibit 5. Autocorrelation Analysis

Lag Autocorrelation t-statistic

1 0.05 0.22

2 -0.35 -1.53

3 0.25 1.09

4 0.10 0.44

Exhibit 6 provides critical values for the Student's t-Distribution

Exhibit 6: Critical Values for Student's t-Distribution

Area in Both Tails Combined

Degrees of Freedom 20% 10% 5% 1%

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 54/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

16 1.337 1.746 2.120 2.921

17 1.333 1.740 2.110 2.898

18 1.330 1.734 2.101 2.878

19 1.328 1.729 2.093 2.861

20 1.325 1.725 2.086 2.845

Question #128 - 133 of 139 Question ID: 1472179

The most appropriate interpretation from the foreclosure share regression equation model
is:

A) Multiple-R of the model is 0.87.


B) Variable STIM explains 37.5% of the variation in foreclosure share.
C) Multiple-R of the model is 0.75.
A. R2 = RSS/SST = 15/20 = 0.75
Multiple-R2 = (0.75)^0.50 = 0.87.

Question #129 - 133 of 139 Question ID: 1472180

Based on her regression results in Exhibit 2, using a 5% level of significance, Smith should
conclude that:

stimulus packages do not have significant effects on foreclosure percentages, but


A)
housing crises do have significant effects on foreclosure percentages.
stimulus packages have significant effects on foreclosure percentages, but housing
B)
crises do not have significant effects on foreclosure percentages.
both stimulus packages and housing crises have significant effects on foreclosure
C)
percentages.
A

Question #130 - 133 of 139 Question ID: 1508639

The standard error of estimate for Smith's regression is closest to:

A) 0.53.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 55/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

B) 0.16.
C
C) 0.56.

Question #131 - 133 of 139 Question ID: 1472182

Is Smith correct or incorrect regarding Concerns 1 and 2?

A) Incorrect on both Concerns.


B) Correct on both Concerns.
C) Only correct on one concern and incorrect on the other.
A. Multicollinearity causes
standard errors for the regression coefficients to be too high, which, in turn, causes the tstatistics to be too
low. However, contrary to Smith's concern, multicollinearity has no effect on the F-statistic.

Question #132 - 133 of 139 Question ID: 1472183

The most recent change in foreclosure share was +1 percent. Smith decides to base her
analysis on the data and methods provided in Exhibit 4 and Exhibit 5, and determines that
the two-step ahead forecast for the change in foreclosure share (in percent) is 0.125, and
that the mean reverting value for the change in foreclosure share (in percent) is 0.071. Is
Smith correct?

Smith is correct on the mean-reverting level for forecast of change in foreclosure


A)
share only.
Smith is correct on the two-step ahead forecast for change in foreclosure share
B)
only.
B
C) Smith is correct on both the forecast and the mean reverting level.

Question #133 - 133 of 139 Question ID: 1472184

Assume for this question that Smith finds that the foreclosure share series has a unit root.
Under these conditions, she can most reliably regress foreclosure share against the change
in interest rates (ΔINT) if:
C
A) ΔINT does not have unit root.
B) ΔINT has unit root and is not cointegrated with foreclosure share.
https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 56/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

C) ΔINT has unit root and is cointegrated with foreclosure share.

Question #134 of 139 Question ID: 1472147

The main reason why financial and time series intrinsically exhibit some form of
nonstationarity is that:

most financial and economic relationships are dynamic and the estimated
A)
regression coefficients can vary greatly between periods.
most financial and time series have a natural tendency to revert toward their
B)
means.
serial correlation, a contributing factor to nonstationarity, is always present to a
C)
certain degree in most financial and time series.
A

Question #135 of 139 Question ID: 1472137

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 57/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

A monthly time series of changes in maintenance expenses (ΔExp) for an equipment rental
company was fit to an AR(1) model over 100 months. The results of the regression and the
first twelve lagged residual autocorrelations are shown in the tables below. Based on the
information in these tables, does the model appear to be appropriately specified? (Assume a
5% level of significance.)

Regression Results for Maintenance Expense Changes

Model: DExpt = b0 + b1DExpt–1 + et

Coefficients Standard Error t-Statistic p-value

Intercept 1.3304 0.0089 112.2849 < 0.0001

Lag-1 0.1817 0.0061 30.0125 < 0.0001

Lagged Residual Autocorrelations for Maintenance Expense Changes

Lag Autocorrelation t-Statistic Lag Autocorrelation t-Statistic

1 −0.239 −2.39 7 −0.018 −0.18

2 −0.278 −2.78 8 −0.033 −0.33

3 −0.045 −0.45 9 0.261 2.61

4 −0.033 −0.33 10 −0.060 −0.60

5 −0.180 −1.80 11 0.212 2.12

6 −0.110 −1.10 12 0.022 0.22

A) Yes, because most of the residual autocorrelations are negative.


B) Yes, because the intercept and the lag coefficient are significant. C
C) No, because several of the residual autocorrelations are significant.

Question #136 of 139 Question ID: 1472232

An executive describes her company's "low latency, multiple terabyte" requirements for
managing Big Data. To which characteristics of Big Data is the executive referring?

A) Velocity and variety.


B) Volume and variety. A

C) Volume and velocity.


https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 58/59
2/2/23, 12:15 PM Kaplanlearn - Quiz

Question #137 of 139 Question ID: 1472186

Barry Phillips, CFA, is analyzing quarterly data. He has estimated an AR(1) relationship (xt =
b0 + b1 × xt-1 + et) and wants to test for seasonality. To do this he would want to see if which
of the following statistics is significantly different from zero?

A) Correlation(et, et-5).

B) Correlation(et, et-1).

C) Correlation(et, et-4).
C. 4th lag is the value that corresponds to the same season
as the predicted variable in the analysis of quarterly data.

Question #138 of 139 Question ID: 1508635

Consider the estimated model xt = −6.0 + 1.1 xt − 1 + 0.3 xt − 2 + εt that is estimated over 50

periods. The value of the time series for the 49th observation is 20 and the value of the time
series for the 50th observation is 22. What is the forecast for the 52nd observation?

A) 27.22.
B) 42. A

C) 24.2.

Question #139 of 139 Question ID: 1472221

An algorithm that involves an agent that performs actions that will maximize its rewards
over time, taking into consideration the constraints of its environment, best describes:

A) deep learning nets.


B) reinforcement learning. B

C) neural networks.

https://www.kaplanlearn.com/education/dashboard/index/4c2245b62bb3fe16ad078887209a46c8/qbank/74340059/quiz/238857825/print 59/59

You might also like