SeasonalFall84 commited on
Commit
75d6bc4
·
verified ·
1 Parent(s): ff39d4a

Add model-index metadata for benchmark results

Browse files

This PR adds machine-readable evaluation metadata to the model card using the `model-index` format.

## What This Adds

Structured YAML metadata for 7 benchmark(s) from the README:

- MATH-500: 98.1
- AIME24: 90.8
- AIME25: 88.0
- LCB: 69.3
- GPQA: 74.4
- HLE: 14.6
- MMLU-Pro: 81.9

## Why This Helps

Adding structured benchmark metadata enables:

1. **Automatic Leaderboard Inclusion** - Model appears on Hugging Face leaderboards and Papers with Code
2. **Better Discoverability** - Users can search/filter models by benchmark scores
3. **Machine-Readable Data** - Tools and APIs can query model performance programmatically

## What Doesn't Change

- ✅ Existing README content stays the same
- ✅ Markdown benchmark tables remain unchanged
- ✅ Only adds metadata to the YAML frontmatter

Thank you for open-sourcing INTELLECT-3! This contribution helps the community compare and discover your work.

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -14,6 +14,39 @@ language:
14
  base_model:
15
  - zai-org/GLM-4.5-Air-Base
16
  pipeline_tag: text-generation
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  ---
18
 
19
  # INTELLECT-3
 
14
  base_model:
15
  - zai-org/GLM-4.5-Air-Base
16
  pipeline_tag: text-generation
17
+ model-index:
18
+ - name: INTELLECT-3
19
+ results:
20
+ - task:
21
+ type: text-generation
22
+ dataset:
23
+ name: Benchmarks
24
+ type: benchmark
25
+ metrics:
26
+ - name: MATH-500
27
+ type: math_500
28
+ value: 98.1
29
+ - name: AIME24
30
+ type: aime24
31
+ value: 90.8
32
+ - name: AIME25
33
+ type: aime25
34
+ value: 88.0
35
+ - name: LCB
36
+ type: lcb
37
+ value: 69.3
38
+ - name: GPQA
39
+ type: gpqa
40
+ value: 74.4
41
+ - name: HLE
42
+ type: hle
43
+ value: 14.6
44
+ - name: MMLU-Pro
45
+ type: mmlu_pro
46
+ value: 81.9
47
+ source:
48
+ name: Model README
49
+ url: https://huggingface.co/PrimeIntellect/INTELLECT-3
50
  ---
51
 
52
  # INTELLECT-3