YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Bilingual Translation Evaluation Script (EN β†’ KK)

This repository provides an evaluation pipeline for English-to-Kazakh/Russian-to-Kazakh (and vice versa) translation models based on the Gemma3ForCausalLM architecture from Hugging Face Transformers.

πŸš€ Overview

The script:

  • Loads a fine-tuned model and tokenizer
  • Performs inference on a FLORES-style test set (.jsonl)
  • Computes BLEU score using NLTK
  • Saves predictions and evaluation results into a JSON file

βš™οΈ Configuration

Modify these lines at the top of the script as needed:

SRC_LANG = "en"
TGT_LANG = "kk"
MODEL_PATH = "/path/to/your/model"
TEST_FILE = "/path/to/test_file.jsonl"
OUTPUT_JSON = "/path/to/output_file.jsonl"
MAX_NEW_TOKS = 64
DEVICE = "cuda"  # or "cpu"

To specify GPU devices:

export CUDA_VISIBLE_DEVICES=2,3,4,5

▢️ Run the Script

python eval_blue.py
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support