Post-Trained GPT-2 Code: https://github.com/gumran/post-training Instruction-Tuned GPT-2 Collection Code: https://github.com/gumran/post-training • 3 items • Updated Jun 22, 2025 Preference-Tuned GPT-2 Collection Code: https://github.com/gumran/post-training • 3 items • Updated Jun 23, 2025
Instruction-Tuned GPT-2 Collection Code: https://github.com/gumran/post-training • 3 items • Updated Jun 22, 2025
Preference-Tuned GPT-2 Collection Code: https://github.com/gumran/post-training • 3 items • Updated Jun 23, 2025
Instruction-Tuned GPT-2 Code: https://github.com/gumran/post-training gumran/gpt2-sft Text Generation • 0.1B • Updated Jun 12, 2025 • 7 gumran/gpt2-medium-sft Text Generation • 0.4B • Updated Jun 12, 2025 • 10 gumran/gpt2-large-sft Text Generation • 0.8B • Updated Jun 12, 2025 • 5
Preference-Tuned GPT-2 Code: https://github.com/gumran/post-training gumran/gpt2-dpo Text Generation • 0.1B • Updated Jun 22, 2025 • 7 gumran/gpt2-medium-dpo Text Generation • 0.4B • Updated Jun 12, 2025 • 14 gumran/gpt2-large-dpo Text Generation • 0.8B • Updated Jun 23, 2025 • 1
Post-Trained GPT-2 Code: https://github.com/gumran/post-training Instruction-Tuned GPT-2 Collection Code: https://github.com/gumran/post-training • 3 items • Updated Jun 22, 2025 Preference-Tuned GPT-2 Collection Code: https://github.com/gumran/post-training • 3 items • Updated Jun 23, 2025
Instruction-Tuned GPT-2 Collection Code: https://github.com/gumran/post-training • 3 items • Updated Jun 22, 2025
Preference-Tuned GPT-2 Collection Code: https://github.com/gumran/post-training • 3 items • Updated Jun 23, 2025
Preference-Tuned GPT-2 Code: https://github.com/gumran/post-training gumran/gpt2-dpo Text Generation • 0.1B • Updated Jun 22, 2025 • 7 gumran/gpt2-medium-dpo Text Generation • 0.4B • Updated Jun 12, 2025 • 14 gumran/gpt2-large-dpo Text Generation • 0.8B • Updated Jun 23, 2025 • 1
Instruction-Tuned GPT-2 Code: https://github.com/gumran/post-training gumran/gpt2-sft Text Generation • 0.1B • Updated Jun 12, 2025 • 7 gumran/gpt2-medium-sft Text Generation • 0.4B • Updated Jun 12, 2025 • 10 gumran/gpt2-large-sft Text Generation • 0.8B • Updated Jun 12, 2025 • 5