A Question About Two-stage Training

Is the two-stage training in the code implemented by manually training twice by changing prompt_format? I didn't see the process of calling the model twice in a row, so the first call of the model uses QCM-R format to save the output rationale, and then integrates it into QCMR-A for answer reasoning? 
Then the entire R and A exist in the dataset, and the two models use the initialization model for reasoning. What is the meaning of the first QCM-R training? 
Maybe I misunderstood something?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A Question About Two-stage Training #83

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

A Question About Two-stage Training #83

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions