view post Post 385 Check out your 2025 Hugging Face Wrapped, a small experimental recap hf-wrapped/2025 See translation 1 reply Β· π€ 4 4 + Reply
view post Post 378 PatchDNA, a DNA foundation model based on Meta's BLT tokenization strategy https://www.biorxiv.org/content/10.1101/2025.11.28.691095v1 See translation π 1 1 + Reply
view post Post 2459 MLEB is the largest, most diverse, and most comprehensive benchmark for legal text embedding models. https://huggingface.co/blog/isaacus/introducing-mleb See translation π 5 5 π₯ 4 4 β€οΈ 4 4 β 3 3 π€ 3 3 π 3 3 π§ 3 3 π€― 3 3 + Reply
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper β’ 2501.02045 β’ Published Jan 3 β’ 23
view post Post 456 Bio LLMs train on many genomes, but can we encode differences within a species? TomatoTomato adds pangenome tokens to represent a domestic tomato and a wild tomato in one sequence π 𧬠monsoon-nlp/tomatotomato-gLM2-150M-v0.1 See translation π 1 1 + Reply