Training Data

This model was trained on a dataset derived from TorpedoSoftware/Roblox-Luau-Reasoning-v1.0, which is released under the MIT License.

The original authors are not affiliated with or responsible for this model.

Base Model

Adapter: DoRA
Method: SFT
Precision: trained with 4-bit base weights + BF16 compute, then merged to safetensors

Safetensors

Model size

4B params

Tensor type

BF16

Base model

Finetuned

(1050)

this model