Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4.6
TFLOPS
Tanay
PRO
Tanaybh
Follow
caelancooper's profile picture
John6666's profile picture
upgraedd's profile picture
10 followers
·
5 following
tanaybhardwaj
AI & ML interests
Exploring RLHF/RLAIF techniques, LoRA adapters, and dialogue optimization. Building models that better understand and respond to human intent
Recent Activity
updated
a model
about 2 months ago
Tanaybh/microllm-v1
published
a model
about 2 months ago
Tanaybh/microllm-v1
updated
a model
about 2 months ago
Tanaybh/gpt-rope-swiglu
View all activity
Organizations
Tanaybh
's models
9
Sort: Recently updated
Tanaybh/microllm-v1
Updated
Oct 20
•
6
Tanaybh/gpt-rope-swiglu
7.88M
•
Updated
Oct 17
•
5
Tanaybh/nano-gpt-from-scratch
Text Generation
•
1.07M
•
Updated
Oct 5
•
10
Tanaybh/gpt2-rlhf-anthropic
Text Generation
•
0.1B
•
Updated
Oct 2
•
11
Tanaybh/gpt2-got-therapy
Text Generation
•
0.1B
•
Updated
Sep 30
•
10
Tanaybh/bipedal-walker-ppo
Reinforcement Learning
•
Updated
Sep 21
•
14
Tanaybh/lunar-lander-ppo
Reinforcement Learning
•
Updated
Sep 21
•
11
Tanaybh/my-first-lora-trash-model
Updated
Sep 3
•
1
Tanaybh/dialogpt-medium-qlora-alpaca
Updated
Sep 3
•
2