Back to Influxdb

holtWinters() function

content/flux/v0/stdlib/universe/holtwinters.md

latest9.5 KB
Original Source
<!------------------------------------------------------------------------------ IMPORTANT: This page was generated from comments in the Flux source code. Any edits made directly to this page will be overwritten the next time the documentation is generated. To make updates to this documentation, update the function comments above the function definition in the Flux source code: https://github.com/influxdata/flux/blob/master/stdlib/universe/universe.flux#L984-L996 Contributing to Flux: https://github.com/influxdata/flux#contributing Fluxdoc syntax: https://github.com/influxdata/flux/blob/master/docs/fluxdoc.md ------------------------------------------------------------------------------->

holtWinters() applies the Holt-Winters forecasting method to input tables.

The Holt-Winters method predicts n seasonally-adjusted values for the specified column at the specified interval. For example, if interval is six minutes (6m) and n is 3, results include three predicted values six minutes apart.

Seasonality

seasonality delimits the length of a seasonal pattern according to interval. If the interval is two minutes (2m) and seasonality is 4, then the seasonal pattern occurs every eight minutes or every four data points. If your interval is two months (2mo) and seasonality is 4, then the seasonal pattern occurs every eight months or every four data points. If data doesn’t have a seasonal pattern, set seasonality to 0.

Space values at even time intervals

holtWinters() expects values to be spaced at even time intervales. To ensure values are spaced evenly in time, holtWinters() applies the following rules:

  • Data is grouped into time-based "buckets" determined by the interval.
  • If a bucket includes many values, the first value is used.
  • If a bucket includes no values, a missing value (null) is added for that bucket.

By default, holtWinters() uses the first value in each time bucket to run the Holt-Winters calculation. To specify other values to use in the calculation, use aggregateWindow to normalize irregular times and apply an aggregate or selector transformation.

Fitted model

holtWinters() applies the Nelder-Mead optimization to include "fitted" data points in results when withFit is set to true.

Null timestamps

holtWinters() discards rows with null timestamps before running the Holt-Winters calculation.

Null values

holtWinters() treats null values as missing data points and includes them in the Holt-Winters calculation.

Function type signature
js
(
    <-tables: stream[A],
    interval: duration,
    n: int,
    ?column: string,
    ?seasonality: int,
    ?timeColumn: string,
    ?withFit: bool,
    ?withMinSSE: bool,
) => stream[B] where A: Record, B: Record

{{% caption %}} For more information, see Function type signatures. {{% /caption %}}

Parameters

n

({{< req >}}) Number of values to predict.

interval

({{< req >}}) Interval between two data points.

withFit

Return fitted data in results. Default is false.

column

Column to operate on. Default is _value.

timeColumn

Column containing time values to use in the calculating. Default is _time.

seasonality

Number of points in a season. Default is 0.

withMinSSE

Return minSSE data in results. Default is false.

minSSE is the minimum sum squared error found when optimizing the holt winters fit to the data. A smaller minSSE means a better fit. Examining the minSSE value can help understand when the algorithm is getting a good fit versus not.

tables

Input data. Default is piped-forward data (<-).

Examples

Use holtWinters to predict future values

js
import "sampledata"

sampledata.int()
    |> holtWinters(n: 6, interval: 10s)

{{< expand-wrapper >}} {{% expand "View example input and output" %}}

Input data

_time_value*tag
2021-01-01T00:00:00Z-2t1
2021-01-01T00:00:10Z10t1
2021-01-01T00:00:20Z7t1
2021-01-01T00:00:30Z17t1
2021-01-01T00:00:40Z15t1
2021-01-01T00:00:50Z4t1
_time_value*tag
2021-01-01T00:00:00Z19t2
2021-01-01T00:00:10Z4t2
2021-01-01T00:00:20Z-3t2
2021-01-01T00:00:30Z19t2
2021-01-01T00:00:40Z13t2
2021-01-01T00:00:50Z1t2

Output data

*tag_time_value
t12021-01-01T00:01:00Z10.955056783457671
t12021-01-01T00:01:10Z10.929416756584807
t12021-01-01T00:01:20Z10.913930733843678
t12021-01-01T00:01:30Z10.904983736080599
t12021-01-01T00:01:40Z10.89992727039632
t12021-01-01T00:01:50Z10.897102216255147
*tag_time_value
t22021-01-01T00:01:00Z6.791274364986771
t22021-01-01T00:01:10Z6.791274496071822
t22021-01-01T00:01:20Z6.791274497934493
t22021-01-01T00:01:30Z6.791274497960961
t22021-01-01T00:01:40Z6.791274497961337
t22021-01-01T00:01:50Z6.791274497961342

{{% /expand %}} {{< /expand-wrapper >}}

Use holtWinters with seasonality to predict future values

js
import "sampledata"

sampledata.int()
    |> holtWinters(n: 4, interval: 10s, seasonality: 4)

{{< expand-wrapper >}} {{% expand "View example input and output" %}}

Input data

_time_value*tag
2021-01-01T00:00:00Z-2t1
2021-01-01T00:00:10Z10t1
2021-01-01T00:00:20Z7t1
2021-01-01T00:00:30Z17t1
2021-01-01T00:00:40Z15t1
2021-01-01T00:00:50Z4t1
_time_value*tag
2021-01-01T00:00:00Z19t2
2021-01-01T00:00:10Z4t2
2021-01-01T00:00:20Z-3t2
2021-01-01T00:00:30Z19t2
2021-01-01T00:00:40Z13t2
2021-01-01T00:00:50Z1t2

Output data

*tag_time_value
t12021-01-01T00:01:00Z4.689501667763805
t12021-01-01T00:01:10Z9.89174309681615
t12021-01-01T00:01:20Z3.586846840716385
t12021-01-01T00:01:30Z6.51792718960969
*tag_time_value
t22021-01-01T00:01:00Z0.0003493824987647374
t22021-01-01T00:01:10Z12.80532295591006
t22021-01-01T00:01:20Z8.6277743213372
t22021-01-01T00:01:30Z1.4602203332390207

{{% /expand %}} {{< /expand-wrapper >}}

Use the holtWinters fitted model to predict future values

js
import "sampledata"

sampledata.int()
    |> holtWinters(n: 3, interval: 10s, withFit: true)

{{< expand-wrapper >}} {{% expand "View example input and output" %}}

Input data

_time_value*tag
2021-01-01T00:00:00Z-2t1
2021-01-01T00:00:10Z10t1
2021-01-01T00:00:20Z7t1
2021-01-01T00:00:30Z17t1
2021-01-01T00:00:40Z15t1
2021-01-01T00:00:50Z4t1
_time_value*tag
2021-01-01T00:00:00Z19t2
2021-01-01T00:00:10Z4t2
2021-01-01T00:00:20Z-3t2
2021-01-01T00:00:30Z19t2
2021-01-01T00:00:40Z13t2
2021-01-01T00:00:50Z1t2

Output data

*tag_time_value
t12021-01-01T00:00:00Z-2
t12021-01-01T00:00:10Z9.22029445567278
t12021-01-01T00:00:20Z10.72226472734998
t12021-01-01T00:00:30Z11.027320524107417
t12021-01-01T00:00:40Z11.036623518669842
t12021-01-01T00:00:50Z10.993492350404422
t12021-01-01T00:01:00Z10.955056783457671
t12021-01-01T00:01:10Z10.929416756584807
t12021-01-01T00:01:20Z10.913930733843678
*tag_time_value
t22021-01-01T00:00:00Z19
t22021-01-01T00:00:10Z10.099938333168268
t22021-01-01T00:00:20Z3.576640457126943
t22021-01-01T00:00:30Z6.744937158136209
t22021-01-01T00:00:40Z6.79061592841652
t22021-01-01T00:00:50Z6.791265139903281
t22021-01-01T00:01:00Z6.791274364986771
t22021-01-01T00:01:10Z6.791274496071822
t22021-01-01T00:01:20Z6.791274497934493

{{% /expand %}} {{< /expand-wrapper >}}