jrc
/

llama3-8b-coedit

grammar-correction

Model card Files Files and versions

jrc commited on Apr 19, 2024

Commit

c54ee96

·

verified ·

1 Parent(s): 172f473

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -18,7 +18,13 @@ This is a Llama3 8B based model trained using [torchtune](https://pytorch.org/to
 ### Training details
-The exact training script (`lora_finetune_distributed`) and config (`8B_lora.yaml`) are both included in this repository. Specifically, in order to add the dataset, I added the following lines to the config:
 ```
 dataset:
@@ -30,4 +36,9 @@ train_on_input: False
 split: train
 ```
 ### Evaluation results

 ### Training details
+The exact training script (`lora_finetune_distributed`) and config (`8B_lora.yaml`) are both included in this repository.
+**Training command**: `tune run --nproc_per_node 8 lora_finetune_distributed --config 8B_lora.yaml`
+> Yes I used 8 GPUs :)
+In order to add the dataset, I added the following lines to the config:
 ```
 dataset:
 split: train
 ```
+**Loss curve**
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/615b01ae487af9ad44dad803/Te9DycG2UVGm_JLnrG9De.png)
 ### Evaluation results