-
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency
Paper β’ 2410.07563 β’ Published β’ 2 -
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
Paper β’ 2407.03963 β’ Published β’ 19 -
Tagengo: A Multilingual Chat Dataset
Paper β’ 2405.12612 β’ Published β’ 3 -
Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities
Paper β’ 2404.17790 β’ Published β’ 5
Kaito Sugimoto
kaisugi
AI & ML interests
Japanese LLMs
Recent Activity
reacted
to
leonardlin's
post
with π₯
about 17 hours ago
We just released our latest Shisa V2.1 Japanese multi-lingual models: https://huggingface.co/collections/shisa-ai/shisa-v21
Besides updates to our 14B, and 70B, we have a new LFM2-based 1.2B, Llama 3.2-based 3B, and Qwen 3-based 8B, all with class-leading Japanese language capabilities.
Per usual, lots of details in the Model Cards for those interested.
liked
a model
12 days ago
YYama0/CT-JMedRoBERTa
liked
a dataset
3 months ago
nvidia/Nemotron-Personas-Japan