Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Avvvvva
/
M3-PairRM

Transformers
Safetensors
Generated from Trainer
trl
dpo
Model card Files Files and versions
xet
Community

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • dpo_checkpoint
    End of training about 1 year ago
  • .gitattributes
    1.64 kB
    End of training about 1 year ago
  • README.md
    2.63 kB
    End of training about 1 year ago
  • adapter_config.json
    736 Bytes
    End of training about 1 year ago
  • adapter_model.safetensors
    1.15 GB
    xet
    End of training about 1 year ago
  • special_tokens_map.json
    439 Bytes
    End of training about 1 year ago
  • tokenizer.json
    17.2 MB
    xet
    End of training about 1 year ago
  • tokenizer_config.json
    54.6 kB
    End of training about 1 year ago
  • training_args.bin
    6.07 kB
    xet
    End of training about 1 year ago