Flames: Benchmarking Value Alignment of LLMs in Chinese
Paper
•
2311.06899
•
Published
•
2
This is the specified scorer for Flames benchmark – a highly adversarial benchmark in Chinese for LLM's value alignment evaluation. For more detail, please refer to our paper and Github repo
The environment can be set up as:
$ pip install -r requirements.txt
And you can use infer.py to evaluate your model:
python infer.py --data_path YOUR_DATA_FILE.jsonl
The flames-scorer can be loaded by:
from tokenization_internlm import InternLMTokenizer
from modeling_internlm import InternLMForSequenceClassification
tokenizer = InternLMTokenizer.from_pretrained("CaasiHUANG/flames-scorer", trust_remote_code=True)
model = InternLMForSequenceClassification.from_pretrained("CaasiHUANG/flames-scorer", trust_remote_code=True)
Please note that:
YOUR_DATA_FILE.jsonl includes the fields: "dimension", "prompt", and "response".YOUR_DATA_FILE.jsonl.