https://github.com/thu-ml/tianshou/blob/c19876179a52acdc4d2e7dcb10dfbb4557adca86/tianshou/policy/modelfree/qrdqn.py#L82 mean(-1).sum(1)