Holt’s Linear Trend

Basic Concepts

The data in Figure 3 of Simple Exponential Smoothing (as well as previous figures on that webpage) shows a distinct upward trend. The Moving Average and Simple Exponential Smoothing methods don’t adequately model this, but Holt’s Linear Trend Method (aka Double Exponential Smoothing) does. This is accomplished by adding a second single exponential smoothing model to capture the trend (either upwards or downwards). The model takes the following form for all i > 1.

image002z

image003z

image004z

image005z

where 0 ≤ α ≤ 1 and 0 < β ≤ 1.

An alternative form of these equations is

image002z

image006z

image007z

image005z

whereimage008z

Note that if β = 0, then the Holt model is equivalent to the Single Exponential Smoothing model.

Example specifying parameter values

Example 1: Redo Example 1 of Simple Exponential Smoothing using Holt’s Linear Trend Method where α = .4 and β = .7.

The result is shown in Figure 1. Here the cell C4 contains the formula =B4, cell D4 contains the value 0, cell C5 contains the formula =B$21*B5+(1-B$21)*(C4+D4), cell D5 contains the formula =C$21*(C5-C4)+(1-C$21)*D4 and cell E5 contains the formula =C4+D4.

Holt's linear trend Excel

Figure 1 – Holt’s Linear Trend

Optimizing parameter values

Example 2: Find the best-fit Holt’s approximation to the data in Example 1, using the MAE measure of accuracy.

Using Solver, we minimize the value of MAE (cell E21 of Figure 2) by changing the value in range B21:C21 subject to the constraint that B21 <= 1.0 and C21 <= 1.0.

The result shown in Figure 2 is that α = .271817 and β = .598161, with MAE = 6.74.

Optimization Holt's model Solver

Figure 2 – Holt’s Method Optimized

Note that the optimization approach, described above, using Solver is susceptible to finding a local minimum instead of a global minimum. In fact, as we will see in Example 1 of Holt’s Trend Confidence Interval, we can obtain values for alpha and beta that yield a slightly lower MAE value.

Forecasting

For any value of i, the forecast at time i+h is given by the formula

image009z

Example 3: Forecast the next 5 values in the times series from Example 2.

The result is shown in Figure 3.

Holt's linear trend forecast

Figure 3 – Forecast using Holt’s Method

The y and predicted y values shown in Figure 3 for i = 1 to 15 are the same as shown in Figure 2. The next five values are shown in range S19:S23. E.g. to calculate that ŷ16 = 95.16312, we use the formula =C$18+(Q19-Q$18)*D$18. We get the other four values by highlighting the range S19:S23 and pressing Ctrl-D.

The graph on the right side of Figure 3 shows that the forecasted values after i = 15 are linear.

Confidence Intervals and Data Analysis Tool

Click here for an example of how to obtain the standard errors and confidence intervals for the forecast obtained via the Holt’s Linear Trend option of the Basic Forecasting data analysis tool.

Reference

Hyndman, R. J., and Athanasopoulos, G. (2018) Holt’s linear trend method. Forecasting: principals and practice, 2nd Ed.
https://otexts.com/fpp2/holt.html

41 thoughts on “Holt’s Linear Trend”

  1. Prof. Zaiontz, there may be a bug in the latest version of your Resource Pack (or I may be incorrectly using it). I’m trying to apply Example 2, except I want to start forecasting in year 4. However, the output seems to ignore the “# of Lags”.

    In the Basic Forecasting dialog, my parameters are below.
    Input Range: ‘Holt 3’!$B$3:$B$18
    Method: Holt’s Linear Trend
    # of Lags: 3
    # of Seasons: 4
    # of Forecasts: 4
    Alpha: 0.4
    Beta: 0.7
    Optimize: MAE

    Reply
  2. Hi, this might be a stupid question, but we did the eqaution quite different in Uni.
    Can someone help me how to do the Bverton-Hault- Methode in Excel with this equation?
    P(t+1)=(1,4/(1+(0,4/20)*P(t))*P(t))
    with the Populations P1, P2, P3,P4, P5, P6, P7, P8, P9 P10 and inital value P(0)=5, P(0)=15 and P(0)=30
    Is there someone who might can help me?
    Thanks in advance!

    Reply
  3. Hi charles,

    Towards the end, you have stated y_(i+h) hat = u_i + v_i.
    I did not follow this.

    Shouldn’t the value of y_(i+1) continue to be calculated as u_i + v_i, that is y_(i+h) – u_(i+h-1) + v_(i+h-1)

    Regards

    Reply
  4. Hi Charles,

    How did you select the initial values of u and y for the Holt’s Linear Trend, and is it quite important?

    You take u_1 = y_1 and v_1 = 0, but in some books I see it expressed as u_1 = y_1 and v_1 = y_2 – y_1, among other variations.

    Thanks

    Reply
    • Peter,
      Yes, I use u_1 = y_1 and v_1 = 0. As you stated, other approaches are also used. For large time series this probably doesn’t matter much, but for smaller time series it might matter.
      Charles

      Reply
  5. i love this site .i am new in forecasting modelkng and know how to apply those all methods of forecasting but i need to know when to use which model for forecasting how would i know that any model is perfect for data ?
    is there any cheat sheet is available or any link if yes please share
    Thank you

    Reply
    • Hello Muhammad,
      Glad that you love the site.
      Which model to choose depends on the nature of the data. To over-simplify, if your data doesn’t have a seasonal component, then you don’t need to use Holt-Winter. If the data doesn’t have trend (a tendency for the data to increase or decrease over time), then you don’t need to use Holt’s Linear Trend and can probably use Exponential Smoothing.
      Charles

      Reply
  6. Hi Charles!

    I’m trying to graph Error, Trend and Seasonality to decide which model I should use. The Udacity course I’m taking uses Alteryx, and those values are automatically plotted under decomposition plots. I know how to find error, but how would I get the values for trend and seasonality from a dataset? Does column c or d represent trend? Thanks so much for this amazing website, and all your answers!

    Malak

    Reply
  7. Hi Charles,

    I am confusing if we do not have initial trend and initial level. Why should we simply use the first observation and the difference between the first two observations as starting level and trend?

    Reply
  8. Hello
    I have two funny questions , recently I got a little confused
    actually I want to ask :
    1. Holt’s method is the same method as double exponential smoothing method?
    2.Trends model and double exponential smoothing method are same?
    I m doing a assignment so I must name the forecasting methods and explain them by their formula so I need some help 🙂
    thank u

    Reply
    • 1. Yes, Holt’s method is the same method as double exponential smoothing method
      2. Double exponential smoothing is a type of Trend model, but I don’t think they are equivalent. To me, a trend model is a linear regression model with one independent variable, namely time (or something similar), but I am not sure this is the definition used my everyone.
      Charles

      Reply
  9. Hi Charles,

    I am wondering, how do you gain the regression for both holt and winters models? I am currently doing an assignment on forecasting and I am just trying to wrap my head around why you need this regression and how you gain it.

    Reply
    • Hello,
      The predicted value at each time depends on the value at the previous time. The first row corresponds to time 1 (the first time period). Since there is no previous time, you can’t calculate a predicted time for the first row.
      Charles

      Reply
  10. Hello Charles,
    Please pardon my ignorance, am new in the “art” of forecasting. All solutions seem to be forecasting in the same period. What about, if being asked to build a model that will forecast sales for the NEXT two years using Holt winter method. How do I go about it?

    Thanks

    Reply
  11. Great post Charles !

    Could you tell me:

    1.What are the acceptance criteria for the Holt’s model? I’ve read that some people use the RMSE / forecast value and if it’s less than 10%, you can use this model. In your example the model is not good for predictions 95,70649/95,16312=100,6%

    2. what sould we minimize for having best α and β MAE, MSE or RMSE ?

    Thanks !

    Paul

    Reply
    • Paul,
      Glad you like the post.
      1. The only real acceptance criteria for the Holt model (or any other model) is to see how good a job it does in correctly predicting future values. Even if the MAE, MSE or RMSE is very low, this only indicates that the model is a good fit for the existing (training) data. I don’t know of any universally agreed upon acceptable value for MAE, MSE, RMSE, etc.
      2. Minimizing MSE is equivalent to minimizing RMSE. If you want to penalize larger errors then MSE is generally better than MAE, otherwise MAE is the better choice.
      Charles

      Reply
  12. Dr. Zaiontz,
    I am trying to build a similar spreadsheet table to validate Excel’s FORECAST.ETS function because I am a bit wary of the quality of the forecast.
    1) I am wondering how this compares to Holt’s method
    Your work is always exemplary and sets a high mark for other scientists to follow
    2) Do you have either the Holt method or the ETA algorithm wks. in table form that you could email to me?

    Respectfully,
    Stephen Druley, Ph.D Theoretical Spatial Mathematics

    Reply

Leave a Reply to Paul Cancel reply