Tanay's picture

Tanay PRO

Tanaybh

·

tanaybhardwaj

AI & ML interests

Exploring RLHF/RLAIF techniques, LoRA adapters, and dialogue optimization. Building models that better understand and respond to human intent

Recent Activity

updated a model about 2 months ago

Tanaybh/microllm-v1

published a model about 2 months ago

Tanaybh/microllm-v1

updated a model about 2 months ago

Tanaybh/gpt-rope-swiglu

View all activity

Organizations

Tanaybh 's models 9

Tanaybh/microllm-v1

Updated Oct 20 • 6

Tanaybh/gpt-rope-swiglu

7.88M • Updated Oct 17 • 5

Tanaybh/nano-gpt-from-scratch

Text Generation • 1.07M • Updated Oct 5 • 10

Tanaybh/gpt2-rlhf-anthropic

Text Generation • 0.1B • Updated Oct 2 • 11

Tanaybh/gpt2-got-therapy

Text Generation • 0.1B • Updated Sep 30 • 10

Tanaybh/bipedal-walker-ppo

Reinforcement Learning • Updated Sep 21 • 14

Tanaybh/lunar-lander-ppo

Reinforcement Learning • Updated Sep 21 • 11

Tanaybh/my-first-lora-trash-model

Updated Sep 3 • 1

Tanaybh/dialogpt-medium-qlora-alpaca

Updated Sep 3 • 2