Code: https://github.com/gumran/post-training
Alim
gumran
AI & ML interests
None yet
Organizations
models
7
gumran/distilbert-diffusion-TinyStories
Text Generation
•
65.8M
•
Updated
•
13
•
1
gumran/gpt2-large-dpo
Text Generation
•
0.8B
•
Updated
•
4
gumran/gpt2-dpo
Text Generation
•
0.1B
•
Updated
•
8
gumran/gpt2-sft
Text Generation
•
0.1B
•
Updated
•
8
gumran/gpt2-medium-dpo
Text Generation
•
0.4B
•
Updated
•
12
gumran/gpt2-large-sft
Text Generation
•
0.8B
•
Updated
•
13
gumran/gpt2-medium-sft
Text Generation
•
0.4B
•
Updated
•
7
datasets
0
None public yet