Add Chronos2Pipeline.embed #361

abdulfatir · 2025-11-03T16:29:39Z

Issue #, if available: #354

Description of changes: This PR adds Chronos2Pipeline.embed to enable users to extract embeddings from the last encoder layer in an easy way. The API and behavior is similar to what Chronos and Chronos-Bolt provides.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

lostella · 2025-11-14T09:26:18Z

src/chronos/chronos2/model.py

+    def encode(
+        self,
+        context: torch.Tensor,
+        context_mask: torch.Tensor | None = None,
+        group_ids: torch.Tensor | None = None,
+        future_covariates: torch.Tensor | None = None,
+        future_covariates_mask: torch.Tensor | None = None,
+        num_output_patches: int = 1,
+        future_target: torch.Tensor | None = None,
+        future_target_mask: torch.Tensor | None = None,
+        output_attentions: bool = False,
+    ):


I wish the diff would be more helpful here: is the body of this simply moved from forward?

Yes, the first (encoding) portion from forward has been factored out into encode.

lostella · 2025-11-14T09:31:04Z

src/chronos/chronos2/pipeline.py

+            or 2-dimensional of shape (n_variates, history_length). The history_lengths may be different across elements; left-padding
+            will be applied, if needed.
+        batch_size
+            The batch size used for generating embeddings. Note that the batch size here means the total number of time series which are input into the model.


I'm not sure this is clear to me: does the batch_size refer to the .shape[0] of the tensors being processed? Or does it span the variates dimension as well? (.shape[1]) I suppose it's the latter, given the docstring for the dataset class:

chronos-forecasting/src/chronos/chronos2/dataset.py

Lines 402 to 405 in e48f480

batch_size

The batch size for training the model. Note that the batch size here means the number of time series, including target(s) and

covariates, that are input into the model. If your data has multiple target and/or covariates, the effective number of time series

tasks in a batch will be lower than this value.

I see this is pretty much the description of batch_size everywhere (here, predict methods, dataset class). Maybe the confusion comes from "total number of time series" instead of "total number of variates", or something like that. But this could also be addressed separately.

Internally, there's no notion of a variate dimension in the model: only batch and time (patch) axes. The batch_size here refers to the maximum items x (co)-variates per batch. Open to suggestions on a better docstring.

abdulfatir

Thanks @lostella.

abdulfatir · 2025-11-14T10:50:19Z

src/chronos/chronos2/model.py

+    def encode(
+        self,
+        context: torch.Tensor,
+        context_mask: torch.Tensor | None = None,
+        group_ids: torch.Tensor | None = None,
+        future_covariates: torch.Tensor | None = None,
+        future_covariates_mask: torch.Tensor | None = None,
+        num_output_patches: int = 1,
+        future_target: torch.Tensor | None = None,
+        future_target_mask: torch.Tensor | None = None,
+        output_attentions: bool = False,
+    ):


Yes, the first (encoding) portion from forward has been factored out into encode.

abdulfatir · 2025-11-14T10:53:22Z

src/chronos/chronos2/pipeline.py

+            or 2-dimensional of shape (n_variates, history_length). The history_lengths may be different across elements; left-padding
+            will be applied, if needed.
+        batch_size
+            The batch size used for generating embeddings. Note that the batch size here means the total number of time series which are input into the model.


Internally, there's no notion of a variate dimension in the model: only batch and time (patch) axes. The batch_size here refers to the maximum items x (co)-variates per batch. Open to suggestions on a better docstring.

abdulfatir added the run-eval Run evaluation CI workflow label Nov 3, 2025

abdulfatir force-pushed the chronos-2-embed branch from aedade2 to 2b0bf77 Compare November 9, 2025 21:00

abdulfatir added 3 commits November 12, 2025 13:04

Add Chronos2Pipeline.embed

5fb8337

Add docstring

55a18cd

Add test

b0f9622

abdulfatir force-pushed the chronos-2-embed branch from 9cc8d98 to b0f9622 Compare November 12, 2025 12:04

lostella reviewed Nov 14, 2025

View reviewed changes

abdulfatir commented Nov 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Chronos2Pipeline.embed #361

Add Chronos2Pipeline.embed #361

abdulfatir commented Nov 3, 2025 •

edited

Loading

Uh oh!

lostella Nov 14, 2025

Uh oh!

abdulfatir Nov 14, 2025

Uh oh!

lostella Nov 14, 2025

Uh oh!

abdulfatir Nov 14, 2025

Uh oh!

abdulfatir left a comment

Uh oh!

abdulfatir Nov 14, 2025

Uh oh!

abdulfatir Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	batch_size
	The batch size for training the model. Note that the batch size here means the number of time series, including target(s) and
	covariates, that are input into the model. If your data has multiple target and/or covariates, the effective number of time series
	tasks in a batch will be lower than this value.

Add Chronos2Pipeline.embed #361

Are you sure you want to change the base?

Add Chronos2Pipeline.embed #361

Conversation

abdulfatir commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lostella Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

abdulfatir Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

lostella Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

abdulfatir Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

abdulfatir left a comment

Choose a reason for hiding this comment

Uh oh!

abdulfatir Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

abdulfatir Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

abdulfatir commented Nov 3, 2025 •

edited

Loading