You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Merge pull request #546 from instructlab/mergify/bp/release-v0.10/pr-455
moves deepspeed requirements into their own file; add deepspeed extras (backport #455)
Merge pull request #451 from instructlab/dependabot/github_actions/aw…
…s-actions/configure-aws-credentials-4.1.0
build(deps): Bump aws-actions/configure-aws-credentials from 4.0.2 to 4.1.0
fix: model.forward now accepts return_dict via kwargs
Since huggingface/transformers#36794 it handles
the return_dict with can_return_tuple decorator that expects the
argument to be passed as kwarg.
Without the patch, instructlab functional tests fail with:
AttributeError: 'bool' object has no attribute 'unsqueeze'
This is because the passed return_dict argument is incorrectly
interpreted as cache_position.
Of course, the way the model forward method is overridden here is quite
problematic and would benefit from a refactor. This patch is a stop-gap
to fix the CI with minimal changes.
Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>