Open
Description
i am pretty sure, all of the HB formulas dont hold anymore if the tuned algo does not scale linear in runtime with eta.
this is especially true if we use the subsampling trick. we at least need to document this, but also maybe adapt a bit.
this is a more complicated issue, and needs to be discussed in the team
of course, would be great if people already post observations and thoughts here