How can we incorporate the choice of which data (test/train/both) of a task to use for calculation of a measure?

So now when we do:

```
re = resample(task, learner, resampling)
re$score(msr("surv.brier"))
```

and the measure takes the arguments `task` and `train_set` as [in here](https://github.com/mlr-org/mlr3proba/blob/main/R/MeasureSurvGraf.R#L88), the `.score` function will have access to the training dataset to perform some estimation, eg in the survival analysis usually we estimate the censoring distribution via Kaplan-Meier, $G(t)$. In the non-resampling case, if the `train` and `train_set` are not used, the test data will be used for such purposes.

We now have evidence that the choice of data (train / test / both) that is used to calculate $G(t)$ [paper link](https://pubmed.ncbi.nlm.nih.gov/39888901/) can influence positively or negatively the score, so it would be nice to have a more general way to say "apply this score and estimate some quantites that are required by the score using only the test set or train set or both" during resampling (and non-resampling) schemes. Note that this is not related to which observations the score is calculated for (use of `predict_sets`).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How can we incorporate the choice of which data (test/train/both) of a task to use for calculation of a measure? #1333

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

How can we incorporate the choice of which data (test/train/both) of a task to use for calculation of a measure? #1333

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions