Open
Description
It should be possible to perform efficient optimization of predict phase parameters, maybe even simultaneously with (ordinary) train time parameters so that predict phase parameters are optimized in an inner loop apart from train time parameters.
This was our resolution for #50 (see #50 (comment)) but apparently we don't have an issue mentioning this specifically?