Small casual language models trained for the evaluation of sample efficiency.
Daniel Christoph
J4bb4wukis
AI & ML interests
None yet
Organizations
None yet
models 9
J4bb4wukis/llama_208m_wikipedia_en_shuffeld
0.2B • Updated
J4bb4wukis/llama_360m_wikipedia_en_shuffeld
0.4B • Updated
J4bb4wukis/xlstm_406m_wikipedia_en_shuffeld
0.4B • Updated • 3
J4bb4wukis/mamba2_432m_wikipedia_en_shuffeld
0.4B • Updated
J4bb4wukis/gpt2_355m_wikipedia_en_shuffeld
0.4B • Updated
J4bb4wukis/gpt2_209m_wikipedia_en_shuffeld
0.2B • Updated • 1
J4bb4wukis/gpt2_124m_wikipedia_en_shuffeld
0.1B • Updated
J4bb4wukis/xlstm_247m_wikipedia_en_shuffeld
0.2B • Updated • 2
J4bb4wukis/mamba2_172m_wikipedia_en_shuffeld
0.2B • Updated • 1
datasets 0
None public yet