这是indexloc提供的服务,不要输入任何密码
Skip to content

unfreeze_layer_from_past parameter  #25

@JhonDan1999

Description

@JhonDan1999

Nice repo!!!

it seems that the default parameter for the policy will freeze all the layers of the language model we are using and just update the lm_head
I tried the provided example of flan-T5 here: https://colab.research.google.com/drive/1DYHt0mi6cyl8ZTMJEkMNpsSZCCvR4jM1?usp=sharing

when I changed the value unfreeze_layer_from_past to be 1 to update the wights of the final layer of flan-t5 like this:
Screenshot 2023-09-20 at 1 04 45 PM

the behavior change the the actor starts to generate empty text:
Screenshot 2023-09-20 at 1 08 58 PM

Also after training it gave me empty text:

Screenshot 2023-09-20 at 1 09 50 PM

what is the reason of the this behavior?

NOTE: I did not change anything else in the flan-t5 code example.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions