We should probably implement a calibration plot as it seems to be requested and as it existed in mlr2: https://stackoverflow.com/questions/66819169/how-to-draw-a-calibration-plot-of-a-binary-classifier-in-mlr3