AdversarialRLHF
/

rloo_pythia410m_tldr6.9b_rm410mdata

Model card Files Files and versions

rloo_pythia410m_tldr6.9b_rm410mdata / checkpoint-219

4.87 GB

1 contributor

History: 1 commit

Muqeeth's picture

Training in progress, step 219, checkpoint

cb52e32 verified 8 months ago

config.json
758 Bytes

Training in progress, step 219, checkpoint 8 months ago
generation_config.json
90 Bytes

Training in progress, step 219, checkpoint 8 months ago
model.safetensors
1.62 GB
xet

Training in progress, step 219, checkpoint 8 months ago
optimizer.pt
3.24 GB
xet

Training in progress, step 219, checkpoint 8 months ago
rng_state.pth
14.2 kB
xet

Training in progress, step 219, checkpoint 8 months ago
scheduler.pt
1.06 kB
xet

Training in progress, step 219, checkpoint 8 months ago
special_tokens_map.json
585 Bytes

Training in progress, step 219, checkpoint 8 months ago
tokenizer.json
3.56 MB

Training in progress, step 219, checkpoint 8 months ago
tokenizer_config.json
4.9 kB

Training in progress, step 219, checkpoint 8 months ago
trainer_state.json
157 kB

Training in progress, step 219, checkpoint 8 months ago
training_args.bin
6.52 kB
xet

Training in progress, step 219, checkpoint 8 months ago