NLPDL_hw4

Task2 : Training Script

In this task, you have to write a train.py to train a transformer using the dataset prepared from the last task. Here is the reference material which you can follow.

transformers/run_glue.py at main · huggingface/transformers

Search the relevant document for HfArgumentParser, understand how to use it, and use it in your train.py. (think: What’s the advantage over argparse?)
Search the relevant document for logging, understand how to use it, and use it in your train.py. (think: What’s the advantage over print?)
Use set_seed in your train.py. (think: Why should we set the random seed?)
Use get_dataset defined in dataHelper.py.
Search the relevant document for AutoConfig, AutoTokenizer, AutoModelForSequenceClassification understand how to use them, and use them in your train.py.
use datasets.map and tokenzier to process the dataset, process the text string into tokenized ids.
search the relevant document for evaluate(Huggingface library), understand how to use it, and use it in your train.py(you need to compute micro_f1, macro_f1, accuracy).
search the relevant docs for DataCollatorWithPadding, understand how to use it, and what does it do, and use it in your train.py. (think: Are there any other data-collator you can use?)
Understand Trainer , and you can look into the source code by clicking . Use Trainer in your train.py.
Clear up all the redundant code in your train.py. If you directly copy code from huggingface/trainsformers/run_glue.py above, there will be loads of redundancy. You must make the code really clean.

Write annotations in train.py. For example,

'''
	initialize logging, seed, argparse...
'''
# your code for init relevant componets...

'''
	load datasets
'''
# your code for loading dataset...

'''
	load models
'''
# your code for loading model...

'''
	process datasets and build up datacollator
'''
# your code for processing datasets

trainer = Trainer(...) # build up trainer

# training!
# your codes for training...

# ...

Use [wandb](http://wandb.ai) to track your experiments. It’s a very fantastic tool!!! And it can easily interact with Huggingface, see https://docs.wandb.ai/guides/integrations/huggingface.
Run your train.py! The setting are as follows.
1. Use roberta-base , bert-base-uncased and allenai/scibert_scivocab_uncased as your pre-trained models.
2. Use restaurant_sup, acl_sup and agnews_sup as your datasets.
3. To make the results reliable, you need to run the same experiments several times(~5 runs) and report the standard deviation.
Adjust the batch size and epoch number to make your results converge stably. Write a small report to show the learning curves (metrics, loss during training, you can copy from wandb), results and configurations. Compare and analyze the results for different models and different datasets in your report.

My Solution

Final report : final report

wandb results : wandb results

To train and eval models on datasets, we can use

# train
python train.py training_args/train_{modelname}_{datasetname}.json

# eval
python train.py training_args/eval_{modelname}_{datasetname}.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NLPDL_hw4

Task2 : Training Script

My Solution

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
model		model
training_args		training_args
README.md		README.md
dataHelper.py		dataHelper.py
nlpdl_hw4_report.pdf		nlpdl_hw4_report.pdf
train.py		train.py

ZekaiGalaxy/NLPDL_hw4

Folders and files

Latest commit

History

Repository files navigation

NLPDL_hw4

Task2 : Training Script

My Solution

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages