add arxiv paper
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ co2_eq_emissions:
|
|
| 21 |
emissions: 23660
|
| 22 |
---
|
| 23 |
|
| 24 |
-
`dant5-small` is a 60M parameter model with architecture identical to `t5-small`. It was trained for 10 epochs on the Danigh GigaWord Corpus ([official website](https://gigaword.dk), [paper](https://aclanthology.org/2021.nodalida-main.46/)).
|
| 25 |
|
| 26 |
## To use the model
|
| 27 |
|
|
|
|
| 21 |
emissions: 23660
|
| 22 |
---
|
| 23 |
|
| 24 |
+
`dant5-small` is a 60M parameter model with architecture identical to `t5-small`. Training details are given in the paper [Training a T5 Using Lab-sized Resources](https://arxiv.org/abs/2208.12097). It was trained for 10 epochs on the Danigh GigaWord Corpus ([official website](https://gigaword.dk), [paper](https://aclanthology.org/2021.nodalida-main.46/)).
|
| 25 |
|
| 26 |
## To use the model
|
| 27 |
|