Commit
·
df6d92a
1
Parent(s):
09b1fd3
Update README.md
Browse files
README.md
CHANGED
|
@@ -20,8 +20,9 @@ Model Card for Loquace-7B
|
|
| 20 |
|
| 21 |
An exclusively Italian speaking, instruction finetuned, Large Language model. 🇮🇹
|
| 22 |
|
| 23 |
-
The Loquace Italian LLM models
|
| 24 |
-
of a specific language.
|
|
|
|
| 25 |
|
| 26 |
The QLoRa (https://github.com/artidoro/qlora) method of fine-tuning significantly lower the resources requirements compared to any other methods available,
|
| 27 |
this allow to easily execute the process on significanly larger dataset while still using consumers GPUs and still achieve high accuracy.
|
|
@@ -29,7 +30,7 @@ this allow to easily execute the process on significanly larger dataset while st
|
|
| 29 |
## Model Description
|
| 30 |
|
| 31 |
Loquace-7B is the first 7B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs
|
| 32 |
-
exclusively in Italian and that uses Falcon-7B model as base.
|
| 33 |
|
| 34 |
The related code can be found at:
|
| 35 |
https://github.com/cosimoiaia/Loquace
|
|
|
|
| 20 |
|
| 21 |
An exclusively Italian speaking, instruction finetuned, Large Language model. 🇮🇹
|
| 22 |
|
| 23 |
+
The Loquace Italian LLM models are created as a proof-of-concept to evaluate on how language tuning can be achieved using QLoRa by instruct-tunings foundational LLMs
|
| 24 |
+
using dataset of a specific language.
|
| 25 |
+
|
| 26 |
|
| 27 |
The QLoRa (https://github.com/artidoro/qlora) method of fine-tuning significantly lower the resources requirements compared to any other methods available,
|
| 28 |
this allow to easily execute the process on significanly larger dataset while still using consumers GPUs and still achieve high accuracy.
|
|
|
|
| 30 |
## Model Description
|
| 31 |
|
| 32 |
Loquace-7B is the first 7B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs
|
| 33 |
+
exclusively in Italian and that uses Falcon-7B model as base, the most accurate model of it's class.
|
| 34 |
|
| 35 |
The related code can be found at:
|
| 36 |
https://github.com/cosimoiaia/Loquace
|