AdversarialRLHF
/

rloo_pythia410m_tldr6.9b_rm410mdata

Model card Files Files and versions

rloo_pythia410m_tldr6.9b_rm410mdata / checkpoint-166

4.87 GB

1 contributor

History: 1 commit

Muqeeth's picture

Training in progress, step 166, checkpoint

96d9ca3 verified 10 months ago

config.json
758 Bytes

Training in progress, step 166, checkpoint 10 months ago
generation_config.json
90 Bytes

Training in progress, step 166, checkpoint 10 months ago
model.safetensors
1.62 GB
xet

Training in progress, step 166, checkpoint 10 months ago
optimizer.pt
3.24 GB
xet

Training in progress, step 166, checkpoint 10 months ago
rng_state.pth
14.2 kB
xet

Training in progress, step 166, checkpoint 10 months ago
scheduler.pt
1.06 kB
xet

Training in progress, step 166, checkpoint 10 months ago
special_tokens_map.json
585 Bytes

Training in progress, step 166, checkpoint 10 months ago
tokenizer.json
3.56 MB

Training in progress, step 166, checkpoint 10 months ago
tokenizer_config.json
4.9 kB

Training in progress, step 166, checkpoint 10 months ago
trainer_state.json
119 kB

Training in progress, step 166, checkpoint 10 months ago
training_args.bin
6.52 kB
xet

Training in progress, step 166, checkpoint 10 months ago