Training Data

This model was trained on a dataset derived from TorpedoSoftware/Roblox-Luau-Reasoning-v1.0, which is released under the MIT License.

The original authors are not affiliated with or responsible for this model.

Base Model

Base model: Qwen/Qwen3-4B-Instruct-2507

Fine-tuning Method

  • Adapter: DoRA
  • Method: SFT
  • Precision: trained with 4-bit base weights + BF16 compute, then merged to safetensors

Training Details

  • Training time: ~12 hours
  • Hardware: 1x NVIDIA RTX 5060 Ti
Downloads last month
20
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for luminousresearch/Qwen_L0-Luau-4B

Finetuned
(321)
this model

Dataset used to train luminousresearch/Qwen_L0-Luau-4B