Update README.md
Browse files
README.md
CHANGED
|
@@ -8,53 +8,49 @@ tags:
|
|
| 8 |
---
|
| 9 |
# Model Catalogue
|
| 10 |
|
| 11 |
-
|
| 12 |
-
- Contains many boutique AI models
|
| 13 |
-
- Still a work in progress
|
| 14 |
|
| 15 |
-
## Pretrained
|
|
|
|
|
|
|
|
|
|
| 16 |
|
| 17 |
-
English models were finetuned on a subset of [Zyphra/Zyda-2](https://huggingface.co/datasets/Zyphra/Zyda-2).
|
| 18 |
- [snowflake-arctic-embed-xs-zyda-2](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-zyda-2)
|
| 19 |
- [deberta-v3-xsmall-zyda-2](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2)
|
| 20 |
- [deberta-v3-base-zyda-2](https://huggingface.co/agentlans/deberta-v3-base-zyda-2)
|
| 21 |
|
| 22 |
-
Multilingual
|
|
|
|
|
|
|
| 23 |
- [multilingual-e5-small-aligned](https://huggingface.co/agentlans/multilingual-e5-small-aligned)
|
| 24 |
- [distilbert-base-multilingual-cased-aligned](https://huggingface.co/agentlans/distilbert-base-multilingual-cased-aligned)
|
| 25 |
|
| 26 |
-
## Text
|
| 27 |
|
| 28 |
-
|
| 29 |
-
- **Output:** number
|
| 30 |
|
| 31 |
-
|
|
| 32 |
-
|
| 33 |
-
| deberta-v3-xsmall-zyda-2
|
| 34 |
-
| deberta-v3-base-zyda-2
|
| 35 |
| multilingual-e5-small-aligned | Multilingual | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-quality) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-readability) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-sentiment) |
|
| 36 |
-
| mdeberta-v3-base
|
| 37 |
|
| 38 |
-
Note
|
| 39 |
|
| 40 |
-
## Small
|
| 41 |
|
| 42 |
-
|
| 43 |
-
- **Output:** text
|
| 44 |
|
| 45 |
-
|
| 46 |
-
|
| 47 |
-
| **Task** | **Model** | **Dataset** |
|
| 48 |
-
|:------------------:|:---------------------------------------------------------------------------------:|:------------------------------------------------------------------------------------------------------:|
|
| 49 |
| Keyword extraction | [flan-t5-small-keywords](https://huggingface.co/agentlans/flan-t5-small-keywords) | [wikipedia-paragraph-keywords](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-keywords) |
|
| 50 |
-
| Title generation
|
| 51 |
-
|
| 52 |
-
## Natural language inference (NLI) models
|
| 53 |
|
| 54 |
-
|
| 55 |
-
- **Output:** label (entailment, neutral, or contradiction)
|
| 56 |
|
| 57 |
-
These
|
| 58 |
|
| 59 |
- [all-MiniLM-L6-v2-nli](https://huggingface.co/agentlans/all-MiniLM-L6-v2-nli)
|
| 60 |
- [bge-small-en-v1.5-nli](https://huggingface.co/agentlans/bge-small-en-v1.5-nli)
|
|
@@ -63,4 +59,4 @@ These are English only.
|
|
| 63 |
- [NoInstruct-small-Embedding-v0-nli](https://huggingface.co/agentlans/NoInstruct-small-Embedding-v0-nli)
|
| 64 |
- [snowflake-arctic-embed-s-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-s-nli)
|
| 65 |
- [snowflake-arctic-embed-xs-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-nli)
|
| 66 |
-
- [TinyBERT_General_4L_312D-nli](https://huggingface.co/agentlans/TinyBERT_General_4L_312D-nli)
|
|
|
|
| 8 |
---
|
| 9 |
# Model Catalogue
|
| 10 |
|
| 11 |
+
[This repository](https://huggingface.co/agentlans) contains a collection of boutique AI models and is organized as follows:
|
|
|
|
|
|
|
| 12 |
|
| 13 |
+
## Pretrained Base Models for Text Embedding
|
| 14 |
+
|
| 15 |
+
### English Models
|
| 16 |
+
These models were finetuned on a subset of [Zyphra/Zyda-2](https://huggingface.co/datasets/Zyphra/Zyda-2):
|
| 17 |
|
|
|
|
| 18 |
- [snowflake-arctic-embed-xs-zyda-2](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-zyda-2)
|
| 19 |
- [deberta-v3-xsmall-zyda-2](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2)
|
| 20 |
- [deberta-v3-base-zyda-2](https://huggingface.co/agentlans/deberta-v3-base-zyda-2)
|
| 21 |
|
| 22 |
+
### Multilingual Models
|
| 23 |
+
These models were aligned using [agentlans/en-translations](https://huggingface.co/datasets/agentlans/en-translations):
|
| 24 |
+
|
| 25 |
- [multilingual-e5-small-aligned](https://huggingface.co/agentlans/multilingual-e5-small-aligned)
|
| 26 |
- [distilbert-base-multilingual-cased-aligned](https://huggingface.co/agentlans/distilbert-base-multilingual-cased-aligned)
|
| 27 |
|
| 28 |
+
## Text Statistics Models
|
| 29 |
|
| 30 |
+
These models take text as input and output a number.
|
|
|
|
| 31 |
|
| 32 |
+
| Base Model | Language | Quality | Readability | Sentiment |
|
| 33 |
+
|------------|----------|---------|-------------|-----------|
|
| 34 |
+
| deberta-v3-xsmall-zyda-2 | English | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-quality) | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-readability) | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-sentiment) |
|
| 35 |
+
| deberta-v3-base-zyda-2 | English | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-quality) | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-readability) | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-sentiment) |
|
| 36 |
| multilingual-e5-small-aligned | Multilingual | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-quality) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-readability) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-sentiment) |
|
| 37 |
+
| mdeberta-v3-base | Multilingual | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-quality) | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-readability) | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-sentiment) |
|
| 38 |
|
| 39 |
+
**Note:** The `mdeberta-v3-base` models were trained on a previous version of the dataset, not the complete dataset.
|
| 40 |
|
| 41 |
+
## Small Text-to-Text Models (English Only)
|
| 42 |
|
| 43 |
+
These models take text as input and produce text as output.
|
|
|
|
| 44 |
|
| 45 |
+
| Task | Model | Dataset |
|
| 46 |
+
|------|-------|---------|
|
|
|
|
|
|
|
| 47 |
| Keyword extraction | [flan-t5-small-keywords](https://huggingface.co/agentlans/flan-t5-small-keywords) | [wikipedia-paragraph-keywords](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-keywords) |
|
| 48 |
+
| Title generation | [flan-t5-small-title](https://huggingface.co/agentlans/flan-t5-small-title) | [wikipedia-paragraph-titles](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-titles) |
|
| 49 |
+
| Summarization | [text-summarization](https://huggingface.co/agentlans/text-summarization) | [wikipedia-paragraph-summaries](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-summaries) |
|
|
|
|
| 50 |
|
| 51 |
+
## Natural Language Inference (NLI) Models (English Only)
|
|
|
|
| 52 |
|
| 53 |
+
These models take text as input and output a label (entailment, neutral, or contradiction).
|
| 54 |
|
| 55 |
- [all-MiniLM-L6-v2-nli](https://huggingface.co/agentlans/all-MiniLM-L6-v2-nli)
|
| 56 |
- [bge-small-en-v1.5-nli](https://huggingface.co/agentlans/bge-small-en-v1.5-nli)
|
|
|
|
| 59 |
- [NoInstruct-small-Embedding-v0-nli](https://huggingface.co/agentlans/NoInstruct-small-Embedding-v0-nli)
|
| 60 |
- [snowflake-arctic-embed-s-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-s-nli)
|
| 61 |
- [snowflake-arctic-embed-xs-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-nli)
|
| 62 |
+
- [TinyBERT_General_4L_312D-nli](https://huggingface.co/agentlans/TinyBERT_General_4L_312D-nli)
|