Methodolgy &
Analysis
For this project, we want to determine the best exponential smoothing
model to use for this data set. To determine the best smoothing model we
are going to compare the accuracy of several different models, SES,
Holt, Holt-Winters and some additive, exponential and damped variations
of Holt and Holt-Winters.
test.homes = data.home.sales$Value[166:177]
train.homes = data.home.sales$Value[1:165]
home=ts(data.home.sales$Value[1:168], start=100, frequency = 12)
fit1 = ses(home, h=12)
fit2 = holt(home, initial="optimal", h=12) ## optimal alpha and beta
fit3 = holt(home,damped=TRUE, h=12 ) ## additive damping
fit4 = holt(home,exponential=TRUE, damped=TRUE, h =12) ## multiplicative damp
fit5 = hw(home,h=12, seasonal="additive") ## default h = 10
fit6 = hw(home,h=12, seasonal="multiplicative")
fit7 = hw(home,h=12, seasonal="additive",damped=TRUE)
fit8 = hw(home,h=12, seasonal="multiplicative",damped=TRUE)
accuracy.table = round(rbind(accuracy(fit1), accuracy(fit2), accuracy(fit3), accuracy(fit4),
accuracy(fit5), accuracy(fit6), accuracy(fit7), accuracy(fit8)),4)
row.names(accuracy.table)=c("SES","Holt Linear","Holt Add. Damped", "Holt Exp. Damped",
"HW Add.","HW Exp.","HW Add. Damp", "HW Exp. Damp")
kable(accuracy.table, caption = "The accuracy measures of various exponential smoothing models
based on the training data")
The accuracy measures of various exponential smoothing models
based on the training data
SES |
2.2136 |
48.4549 |
35.2626 |
0.0553 |
6.2339 |
0.4093 |
0.0004 |
Holt Linear |
-0.1598 |
48.4565 |
35.3792 |
-0.4352 |
6.2890 |
0.4106 |
0.0035 |
Holt Add. Damped |
2.2835 |
48.4843 |
35.4168 |
0.0809 |
6.2728 |
0.4111 |
0.0011 |
Holt Exp. Damped |
2.6742 |
48.4464 |
35.3365 |
0.2005 |
6.2397 |
0.4101 |
-0.0002 |
HW Add. |
-0.5234 |
48.3285 |
34.9578 |
-0.4813 |
6.1911 |
0.4057 |
0.0029 |
HW Exp. |
1.1751 |
53.2488 |
37.2653 |
-0.2176 |
6.6320 |
0.4325 |
0.2248 |
HW Add. Damp |
1.4851 |
48.3832 |
35.1746 |
-0.1231 |
6.2356 |
0.4082 |
0.0022 |
HW Exp. Damp |
1.0625 |
48.5948 |
35.5074 |
-0.1541 |
6.2970 |
0.4121 |
0.0241 |
Looking at the results from the table above, it seems like the
Holt-Winters additive model appears to be the most appropriate
exponential smoothing model. The Holt-Winters beats out the other models
in every situation except on where it gets beat out by Holt Exponential
Dampened.
In addition to the accuracy table above, it is also beneficial to see
a visual representation of the different exponential smoothing models
options and compare them to the original serial plot.
par(mfrow=c(2,1), mar=c(3,4,3,1))
###### plot the original data
pred.id = 166:177
plot(1:165, train.homes, lwd=2,type="o", ylab="Home Sales", xlab="",
xlim=c(1,177), ylim=c(200, 1200), cex=0.3,
main="Non-Seasonal Smoothing Models")
lines(pred.id, fit1$mean, col="red")
lines(pred.id, fit2$mean, col="blue")
lines(pred.id, fit3$mean, col="purple")
lines(pred.id, fit4$mean, col="navy")
##
points(pred.id, fit1$mean, pch=16, col="red", cex = 0.5)
points(pred.id, fit2$mean, pch=17, col="blue", cex = 0.5)
points(pred.id, fit3$mean, pch=19, col="purple", cex = 0.5)
points(pred.id, fit4$mean, pch=21, col="navy", cex = 0.5)
#points(fit0, col="black", pch=1)
legend("bottomright", lty=1, col=c("red","blue","purple", "navy"),pch=c(16,17,19,21),
c("SES","Holt Linear","Holt Linear Damped", "Holt Multiplicative Damped"),
cex = 0.7, bty="n")
###########
plot(1:165, train.homes, lwd=2,type="o", ylab="Home Sales", xlab="",
xlim=c(1,177), ylim=c(200, 1200), cex=0.3,
main="Holt-Winterd Trend and Seasonal Smoothing Models")
lines(pred.id, fit5$mean, col="red")
lines(pred.id, fit6$mean, col="blue")
lines(pred.id, fit7$mean, col="purple")
lines(pred.id, fit8$mean, col="navy")
##
points(pred.id, fit5$mean, pch=16, col="red", cex = 0.5)
points(pred.id, fit6$mean, pch=17, col="blue", cex = 0.5)
points(pred.id, fit7$mean, pch=19, col="purple", cex = 0.5)
points(pred.id, fit8$mean, pch=21, col="navy", cex = 0.5)
###
legend("bottomright", lty=1, col=c("red","blue","purple", "navy"),pch=c(16,17,19,21),
c("HW Additive","HW Multiplicative","HW Additive Damped", "HW Multiplicative Damped"),
cex = 0.7, bty="n")
Looking at the graphs above the first depiction shows the predictive
lines to almost go fully horizontal, or mainly horizontal with minimal
slope. However, the second graph does show predictive lines with more of
a shape and potential variation that is consistent with the information
seen in the existing serial plot. The Holt-Winters Additive line is
within the second graph, further cementing it as the most appropriate
model.
This assignment is also using a training data set, which will be used
to identify the best model, with the assistance of the testing data set.
In order to use the model for real-forecast, the model needs to be refit
using the entire data to update the final working models smoothing
parameters.
acc.fun = function(test.data, mod.obj){
PE=100*(test.data-mod.obj$mean)/mod.obj$mean
MAPE = mean(abs(PE))
###
E=test.data-mod.obj$mean
MSE=mean(E^2)
###
accuracy.metric=c(MSE=MSE, MAPE=MAPE)
accuracy.metric
}
pred.accuracy = rbind(SES =acc.fun(test.data=test.homes, mod.obj=fit1),
Holt.Add =acc.fun(test.data=test.homes, mod.obj=fit2),
Holt.Add.Damp =acc.fun(test.data=test.homes, mod.obj=fit3),
Holt.Exp =acc.fun(test.data=test.homes, mod.obj=fit4),
HW.Add =acc.fun(test.data=test.homes, mod.obj=fit5),
HW.Exp =acc.fun(test.data=test.homes, mod.obj=fit6),
HW.Add.Damp =acc.fun(test.data=test.homes, mod.obj=fit7),
HW.Exp.Damp =acc.fun(test.data=test.homes, mod.obj=fit8))
kable(pred.accuracy, caption="The accuracy measures of various exponential smoothing models
based on the testing data")
The accuracy measures of various exponential smoothing models
based on the testing data
SES |
2510.071 |
6.258430 |
Holt.Add |
1554.106 |
4.657823 |
Holt.Add.Damp |
2512.571 |
6.262516 |
Holt.Exp |
2511.084 |
6.260095 |
HW.Add |
1737.009 |
5.053927 |
HW.Exp |
2306.645 |
6.038416 |
HW.Add.Damp |
3082.966 |
7.147686 |
HW.Exp.Damp |
3664.936 |
8.133387 |
Looking at the above accuracy table, we actually see that the Holt
Additive is the best of the eight smoothing models, with Holt-Winters
Additive being a close second. This does come as a surprise considering
the Holt-Winters Additive model has been performing better so far. Since
the Holt-Winters Additive model was more often the preferred model, over
Holt Additive, and since Holt-Winters is a close second here, we are
going to move forward using this model. However, it is still important
to acknowledge that the above accuracy table does not identify this as
the number one model choice.
