[timeseries] Add horizon_weight support for TimeSeriesPredictor #5084

shchur · 2025-04-23T14:58:50Z

Issue #, if available:

Description of changes:

Second attempt at [timeseries] Add support for horizon_weight in time series forecasting metrics #5058, this time folding horizon_weight and eval_metric_seasonal_period into properties of the TimeSeriesScorer
Add an optional argument horizon_weight: list[float] | None to the TimeSeriesPredictor and all forecasting metrics that allows assigning custom weights to each time step in the forecast horizon when computing the metric.
When computing all metrics, we create an array of raw error values of shape [num_items, prediction_length]. We then multiply this array with horizon_weight.reshape[1, prediction_length], before applying the final aggregation step.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

canerturkmen

Thanks a lot for the great PR, I think this is headed in a great direction. Left some comments here and there.

timeseries/src/autogluon/timeseries/metrics/abstract.py

timeseries/src/autogluon/timeseries/metrics/__init__.py

timeseries/src/autogluon/timeseries/metrics/point.py

canerturkmen · 2025-04-24T08:57:58Z

timeseries/src/autogluon/timeseries/predictor.py

@@ -93,6 +93,14 @@ class TimeSeriesPredictor:
    eval_metric_seasonal_period : int, optional
        Seasonal period used to compute some evaluation metrics such as mean absolute scaled error (MASE). Defaults to
        ``None``, in which case the seasonal period is computed based on the data frequency.
+    horizon_weight : List[float], optional


is [1, 1, 1, 1] more intuitive than [0.25, 0.25, 0.25, 0.25]? When one says weight my mind immediately jumps to something that sums to 1. This is especially apparent if we need lead time: [0, 0, 2, 2] vs [0, 0, .5, .5].

This is mostly about efficiency and numerical stability rather than being intuitive. We always have a matrix errors of shape [num_items, prediction_length]. There are two ways to apply different weights per item / per time step.

Current approach (no weights = all weights are equal to 1)

If we have an array horizon_weight of shape [1, prediction_length] where values sum up to prediction_length, and an array item_weight of shape [num_items, 1], where values sum up to num_items, the averaging logic is quite simple

errors: np.ndarray, # shape [num_items, prediction_length] if horizon_weight is not None: errors *= horizon_weight if item_weight is not None: errors *= item_weight return np.nanmean(errors)

Alternative approach (no weights = all weights are equal to 1/prediction_length or 1/num_items)

If instead horizon_weight and item_weight each summed up to 1, the code would be a lot less elegant. We won't be able to just optionally multiple the errors with some weights, but rather we will need to perform reduction after each multiplication. We won't be able to use np.average for this reduction since it doesn't support NaN values, so we will need to us np.nansum. So the code will look something like

errors: np.ndarray, # shape [num_items, prediction_length] if horizon_weight is not None: errors = np.nansum(errors * horizon_weight, axis=1) else: errors = np.nanmean(errors, axis=1) # now `errors` has shape [num_items] if item_weight is not None: return np.nansum(errors * item_weight) else: return np.nanmean(errors)

This looks less elegant to me.

Also, if the weights are uniform, I suspect that np.nansum(errors * horizon_weight, axis=1) is less numerically stable than np.nanmean(errors, axis=1) (especially if prediction_length is very large).

Sorry for being late to the party. I understand it's similar in spirit to how sample_weight works in most stats software. LGTM.

Do you mean that sample_weight usually adds up to 1, or that it usually adds up to len(y)? In sklearn it seems to be latter (https://scikit-learn.org/stable/modules/generated/sklearn.utils.class_weight.compute_sample_weight.html).

I meant the latter.

timeseries/src/autogluon/timeseries/predictor.py

shchur · 2025-04-24T14:56:20Z

timeseries/src/autogluon/timeseries/metrics/abstract.py

@@ -66,18 +77,16 @@ def __call__(
        self,
        data: TimeSeriesDataFrame,
        predictions: TimeSeriesDataFrame,
-        prediction_length: int = 1,


One breaking change is that the prediction_length must be set as an attribute of the TimeSeriesScorer.

This, for example, means that changes are required to the code in the tutorials

mse = MeanSquaredError() mse_score = mse( data=test_data, predictions=predictions, prediction_length=predictor.prediction_length, target=predictor.target, )

to

mse = MeanSquaredError(prediction_length=predictor.prediction_length) mse_score = mse( data=test_data, predictions=predictions, target=predictor.target, )

We could still allow passing prediction_length via kwargs here for backward compatibility (so both options above would work). Do you think this makes sense?

def __call__(..., **kwargs): prediction_length = kwargs.get("prediction_length", self.prediction_length)

would be good to have backward compatibility although I am not sure how many users work directly with metrics.

I've added a backward-compatible option to pass the prediction_length here.

shchur · 2025-04-24T14:58:57Z

timeseries/src/autogluon/timeseries/models/abstract/abstract_timeseries_model.py

@@ -699,19 +692,15 @@ def _score_with_predictions(
        self,
        data: TimeSeriesDataFrame,
        predictions: TimeSeriesDataFrame,
-        metric: Optional[str] = None,


I don't see why an AbstractTimeSeriesModel should be able to score with different metrics (other than its own eval_metric), so I removed this functionality. We only use it in unit tests, in normal usage scoring with custom metrics is handled by TimeSeriesTrainer.

makes sense. in the future: why should AbstractTimeSeriesModel even be aware of its metric :) ?

github-actions · 2025-04-24T17:46:47Z

Job PR-5084-cbd0df2 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-5084/cbd0df2/index.html

abdulfatir

Thanks @shchur! Overall looks great to me. Left some comments.

abdulfatir · 2025-04-25T09:19:51Z

timeseries/src/autogluon/timeseries/metrics/abstract.py

@@ -66,18 +77,16 @@ def __call__(
        self,
        data: TimeSeriesDataFrame,
        predictions: TimeSeriesDataFrame,
-        prediction_length: int = 1,


would be good to have backward compatibility although I am not sure how many users work directly with metrics.

timeseries/src/autogluon/timeseries/models/abstract/abstract_timeseries_model.py

timeseries/tests/unittests/test_metrics.py

timeseries/tests/unittests/test_predictor.py

github-actions · 2025-04-25T10:20:05Z

Job PR-5084-669146c is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-5084/669146c/index.html

abdulfatir

🚀

canerturkmen

LGTM!

github-actions · 2025-04-25T15:11:06Z

Job PR-5084-1a0d133 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-5084/1a0d133/index.html

…gluon#5084)

shchur requested a review from canerturkmen April 23, 2025 14:58

canerturkmen reviewed Apr 24, 2025

View reviewed changes

shchur force-pushed the embedded-horizon-weight branch from fefbc01 to 0ab8fca Compare April 24, 2025 09:12

shchur commented Apr 24, 2025

View reviewed changes

shchur added 10 commits April 25, 2025 07:35

Add horizon_weigth to TimeSeriesPredictor

7f0e845

Add unit tests

716580d

Fix unit tests

03736a2

Fix unit tests

dd80a7c

Save prediction_length as an attribute of the metric

27d9ffe

Fix types

10379f3

Fix metric tests

9caf12d

Add default value for prediction_length and update tests

ab79072

Fix typos

c7094b6

Add docstrings and clean up the tests

669146c

shchur force-pushed the embedded-horizon-weight branch from cbd0df2 to 669146c Compare April 25, 2025 07:35

shchur requested a review from abdulfatir April 25, 2025 07:44

abdulfatir reviewed Apr 25, 2025

View reviewed changes

Address PR comments

1a0d133

abdulfatir approved these changes Apr 25, 2025

View reviewed changes

canerturkmen approved these changes Apr 25, 2025

View reviewed changes

canerturkmen added module: timeseries related to the timeseries module enhancement New feature or request labels Apr 25, 2025

canerturkmen added this to the 1.3 Release milestone Apr 25, 2025

shchur merged commit 32fcbab into autogluon:master Apr 25, 2025
25 checks passed

shchur deleted the embedded-horizon-weight branch April 25, 2025 13:32

FireballDWF pushed a commit to FireballDWF/autogluon that referenced this pull request Apr 26, 2025

[timeseries] Add horizon_weight support for TimeSeriesPredictor (auto…

fdb084d

…gluon#5084)

shchur mentioned this pull request May 13, 2025

[timeseries] Fix FutureWarning in leaderboard and evaluate methods #5126

Merged

[timeseries] Add horizon_weight support for TimeSeriesPredictor #5084

[timeseries] Add horizon_weight support for TimeSeriesPredictor #5084

Uh oh!

Conversation

shchur commented Apr 23, 2025

Uh oh!

canerturkmen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shchur Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Current approach (no weights = all weights are equal to 1)

Alternative approach (no weights = all weights are equal to 1/prediction_length or 1/num_items)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

shchur Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Apr 24, 2025

Uh oh!

abdulfatir left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Apr 25, 2025

Uh oh!

abdulfatir left a comment

Choose a reason for hiding this comment

Uh oh!

canerturkmen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Apr 25, 2025

Uh oh!

Uh oh!

shchur Apr 24, 2025 •

edited

Loading

shchur Apr 24, 2025 •

edited

Loading