Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lomahony 's Collections
Pythia-hh-all-sft-dpo
pythia-helpful-1epoch
pythia-helpful-epoch2
Pythia-helpful 3 epochs

pythia-helpful-epoch2

updated Mar 12, 2024

Pythia-2.8b supervised finetuned and DPO finetuned with the helpful subset of Anthropic-hh-rlhf dataset for a second epoch.

Upvote
-

  • lomahony/pythia-2.8b-helpful-sft-epoch2

    Text Generation • 3B • Updated Mar 6, 2024 • 7

  • lomahony/pythia-1b-helpful-sft-epoch2

    Text Generation • 1B • Updated Mar 6, 2024 • 1

  • lomahony/pythia-1.4b-helpful-sft-epoch2

    Text Generation • 1B • Updated Mar 6, 2024 • 1

  • lomahony/pythia-410m-helpful-sft-epoch2

    Text Generation • 0.4B • Updated Mar 6, 2024 • 1

  • lomahony/pythia-70m-helpful-sft-epoch2

    Text Generation • 70.4M • Updated Mar 6, 2024 • 1

  • lomahony/pythia-160m-helpful-sft-epoch2

    Text Generation • 0.2B • Updated Mar 6, 2024 • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs