ibelem commited on
Commit
705da0c
·
verified ·
1 Parent(s): 6af591a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -3
README.md CHANGED
@@ -1,3 +1,13 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ pipeline_tag: text-generation
4
+ tags: [ONNX, ONNXRuntime, phi3.5, nlp, conversational, custom_code]
5
+ inference: false
6
+ ---
7
+ Based on https://huggingface.co/microsoft/Phi-3.5-mini-instruct
8
+
9
+ Convert ONNX model by using https://github.com/microsoft/onnxruntime-genai
10
+
11
+ Using command: python -m onnxruntime_genai.models.builder -m microsoft/Phi-3.5-mini-instruct -o Phi-3.5-mini-instruct-onnx -e webgpu -c cache-dir -p int4 --extra_options int4_block_size=32 int4_accuracy_level=4
12
+
13
+ The generated external data (model.onnx.data) is larger than 2GB, which is not suitable for ORT-Web. I use an additional Python script to move some data into model.onnx.