Hush-cd commited on
Commit
68713e7
Β·
verified Β·
1 Parent(s): 009fdab

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +61 -0
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ inference: false
3
+ language:
4
+ - en
5
+ - zh
6
+ tags:
7
+ - instruction-finetuning
8
+ task_categories:
9
+ - text-generation
10
+ base_model:
11
+ - Qwen/Qwen2.5-1.5B-Instruct
12
+ license: cc-by-nc-nd-4.0
13
+ ---
14
+ <h1 align="center">
15
+ πŸ” xVerify-1.5B-I
16
+ </h1>
17
+
18
+ <p align="center">
19
+ <div style="display: flex; justify-content: center; gap: 10px;">
20
+ <a href="https://github.com/IAAR-Shanghai/xVerify">
21
+ <img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub"/>
22
+ </a>
23
+ <a href="https://huggingface.co/IAAR-Shanghai/xVerify-1.5B-I">
24
+ <img src="https://img.shields.io/badge/πŸ€—%20Hugging%20Face-xVerify--1.5B--I-yellow" alt="Hugging Face"/>
25
+ </a>
26
+ </div>
27
+ </p>
28
+ xVerify is an evaluation tool fine-tuned from a pre-trained large language model, designed specifically for objective questions with a single correct answer. It accurately extracts the final answer from lengthy reasoning processes and efficiently identifies equivalence across different forms of expressions.
29
+
30
+ ---
31
+
32
+ ## ✨ Key Features
33
+
34
+ ### πŸ“Š Broad Applicability
35
+ Suitable for various objective question evaluation scenarios including math problems, multiple-choice questions, classification tasks, and short-answer questions.
36
+
37
+ ### ⛓️ Handles Long Reasoning Chains
38
+ Effectively processes answers with extensive reasoning steps to extract the final answer, regardless of complexity.
39
+
40
+ ### 🌐 Multilingual Support
41
+ Primarily handles Chinese and English responses while remaining compatible with other languages.
42
+
43
+ ### πŸ”„ Powerful Equivalence Judgment
44
+ - βœ“ Recognizes basic transformations like letter case changes and Greek letter conversions
45
+ - βœ“ Identifies equivalent mathematical expressions across formats (LaTeX, fractions, scientific notation)
46
+ - βœ“ Determines semantic equivalence in natural language answers
47
+ - βœ“ Matches multiple-choice responses by content rather than just option identifiers
48
+
49
+ ---
50
+
51
+
52
+ ## πŸ“š Citation
53
+
54
+ ```bibtex
55
+ @article{xVerify,
56
+ title={xVerify: Efficient Answer Verifier for Reasoning Model Evaluations},
57
+ author={Ding Chen and Qingchen Yu and Pengyuan Wang and Wentao Zhang and Bo Tang and Feiyu Xiong and Xinchi Li and Minchuan Yang and Zhiyu Li},
58
+ journal={arXiv preprint arXiv:2504.10481},
59
+ year={2025},
60
+ }
61
+ ```