Shyam Sunder Kumar's picture

Open to Collab

Shyam Sunder Kumar

theainerd

·

AI & ML interests

Natural Language Processing

Recent Activity

liked a model 16 days ago

google/translategemma-4b-it

upvoted a collection 16 days ago

liked a model 18 days ago

kyutai/pocket-tts

View all activity

Organizations

upvoted a collection 16 days ago

TranslateGemma

3 items • Updated 17 days ago • 202

upvoted a collection 20 days ago

Health AI Developer Foundations (HAI-DEF)

Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated 20 days ago • 179

upvoted 2 collections about 2 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated 11 days ago • 206

Bhojpuri and Hindi Rural Women ASR

This dataset includes ASR data from rural women speaking Hindi and Bhojpuri, supporting inclusive voice recognition. • 2 items • Updated Nov 6, 2025 • 1

upvoted 2 collections 2 months ago

Inference Optimized Checkpoints (with Model Optimizer)

A collection of generative models quantized and optimized for inference with Model Optimizer. • 51 items • Updated 1 day ago • 79

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 87

upvoted an article 2 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

314

upvoted a paper 3 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 133

upvoted 2 collections 3 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated 6 days ago • 166

🎆 October 2025 - China Open Source Highlights

29 items • Updated 23 days ago • 13

upvoted 2 papers 3 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 78

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 62

upvoted 2 collections 3 months ago

Nemotron RAG

Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 11 items • Updated 3 days ago • 66

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29, 2025 • 60

upvoted a paper 3 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 68

upvoted an article 3 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

Oct 23, 2025

•

148

upvoted a collection 3 months ago

🎯 Liquid Nanos

Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated 5 days ago • 107

upvoted an article 4 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

Sep 23, 2025

•

135

upvoted 2 collections 4 months ago

GLM-4.6

7 items • Updated Nov 5, 2025 • 52

DeepSeek-V3.2

4 items • Updated Dec 1, 2025 • 522