13 7 118

Vineet Sharma

vineetsharma

AI & ML interests

Generative AI, Computer Vision, Natural Language Processing, Reinforcement Learning

Recent Activity

liked a model 19 days ago

nvidia/Alpamayo-R1-10B

liked a model 3 months ago

nvidia/omnivinci

upvoted a collection 5 months ago

FastVLM

View all activity

Organizations

upvoted a collection 5 months ago

FastVLM

Collection

Efficient Vision Encoding for Vision Language Models • 9 items • Updated Sep 2, 2025 • 106

upvoted an article 5 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3, 2025

•

313

upvoted a collection 5 months ago

VibeVoice

Collection

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated 4 days ago • 199

upvoted an article 8 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

591

upvoted an article 12 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.32k

upvoted an article over 1 year ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

•

127

upvoted a paper about 2 years ago

AppAgent: Multimodal Agents as Smartphone Users

Paper • 2312.13771 • Published Dec 21, 2023 • 54

Vineet Sharma

AI & ML interests

Recent Activity

Organizations

vineetsharma's activity

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Vision Language Models (Better, faster, stronger)

Open-source DeepResearch – Freeing our search agents

Introduction to 3D Gaussian Splatting