Update README.md
Browse files
README.md
CHANGED
|
@@ -260,28 +260,6 @@ This model is [Smaug-34b](https://huggingface.co/abacusai/Smaug-34B-v0.1) with L
|
|
| 260 |
[Join our Discord!](https://discord.gg/rJXGjmxqzS)
|
| 261 |
|
| 262 |
|
| 263 |
-
### Evaluation Results
|
| 264 |
-
|
| 265 |
-
Coming Soon
|
| 266 |
-
|
| 267 |
-
### Contamination Results
|
| 268 |
-
|
| 269 |
-
|
| 270 |
-
Coming Soon
|
| 271 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
| 272 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ConvexAI__Luminex-34B-v0.1)
|
| 273 |
-
|
| 274 |
-
| Metric |Value|
|
| 275 |
-
|---------------------------------|----:|
|
| 276 |
-
|Avg. |77.06|
|
| 277 |
-
|AI2 Reasoning Challenge (25-Shot)|73.63|
|
| 278 |
-
|HellaSwag (10-Shot) |86.59|
|
| 279 |
-
|MMLU (5-Shot) |76.55|
|
| 280 |
-
|TruthfulQA (0-shot) |69.68|
|
| 281 |
-
|Winogrande (5-shot) |83.43|
|
| 282 |
-
|GSM8k (5-shot) |72.48|
|
| 283 |
-
|
| 284 |
-
|
| 285 |
# Open Portuguese LLM Leaderboard Evaluation Results
|
| 286 |
|
| 287 |
Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/ConvexAI/Luminex-34B-v0.1) and on the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard)
|
|
|
|
| 260 |
[Join our Discord!](https://discord.gg/rJXGjmxqzS)
|
| 261 |
|
| 262 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 263 |
# Open Portuguese LLM Leaderboard Evaluation Results
|
| 264 |
|
| 265 |
Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/ConvexAI/Luminex-34B-v0.1) and on the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard)
|