LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures Paper • 2509.14252 • Published Sep 11 • 4
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 8 days ago • 240
Dynamic Chunking for End-to-End Hierarchical Sequence Modeling Paper • 2507.07955 • Published Jul 10 • 25