view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 12 days ago • 473
view article Article I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing 12 days ago • 15
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling 19 days ago • 47
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 489
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub +2 Feb 12, 2025 • 80
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16, 2025 • 76
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15, 2025 • 226
Cosmos Collection ⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/nvidia-cosmos-2 • 14 items • Updated about 3 hours ago • 300
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python +5 Oct 22, 2024 • 44