GGUF

GGUF quantizations are available here: https://huggingface.co/mradermacher/Llama_L0-Luau-1B-GGUF

Training Data

This model was trained on a dataset derived from TorpedoSoftware/Roblox-Luau-Reasoning-v1.0, which is released under the MIT License.

The original authors are not affiliated with or responsible for this model.

Base Model

Base model: meta-llama/Llama-3.2-1B-Instruct

Fine-tuning Method

  • Adapter: DoRA
  • Method: SFT
  • Precision: trained with 4-bit base weights + BF16 compute, then merged to safetensors

Training Details

  • Training time: ~12 hours
  • Hardware: 1x NVIDIA RTX 5060 Ti
Downloads last month
49
Safetensors
Model size
1B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for luminousresearch/Llama_L0-Luau-1B

Finetuned
(1218)
this model
Quantizations
2 models

Dataset used to train luminousresearch/Llama_L0-Luau-1B