Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,13 @@
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
+
|
| 5 |
+
LoRA weights for `LLaMA-7B`
|
| 6 |
+
|
| 7 |
+
Trained on 'taesiri/webnlg-triplets-explanation-v1' for 4 epochs.
|
| 8 |
+
|
| 9 |
+
Command:
|
| 10 |
+
|
| 11 |
+
```
|
| 12 |
+
WORLD_SIZE=2 CUDA_VISIBLE_DEVICES=0,1 torchrun --nproc_per_node=2 --master_port=1234 finetune.py --base_model='decapoda-research/llama-7b-hf' --data_path 'taesiri/webnlg-triplets-explanation-v1' --num_epochs=4 --cutoff_len=512 --group_by_length --lora_target_modules='[q_proj,k_proj,v_proj,o_proj]' --lora_r=8 --micro_batch_size=8 --batch_size=32
|
| 13 |
+
```
|