Update README.md
Browse files
README.md
CHANGED
|
@@ -39,7 +39,7 @@ This optimization resulted in a **~4x reduction in file size** and significant i
|
|
| 39 |
|
| 40 |
**This is the model intended for direct deployment in mobile applications.**
|
| 41 |
|
| 42 |
-
The original, full-precision (FP32) model can be found at the [main model repository here](
|
| 43 |
|
| 44 |
## Model Description
|
| 45 |
|
|
@@ -62,7 +62,7 @@ import numpy as np
|
|
| 62 |
import scipy.special
|
| 63 |
|
| 64 |
# Load the tokenizer
|
| 65 |
-
tokenizer = AutoTokenizer.from_pretrained("
|
| 66 |
# Load the ONNX session
|
| 67 |
session = ort.InferenceSession("path/to/your/model.quant.onnx")
|
| 68 |
|
|
|
|
| 39 |
|
| 40 |
**This is the model intended for direct deployment in mobile applications.**
|
| 41 |
|
| 42 |
+
The original, full-precision (FP32) model can be found at the [main model repository here](https://huggingface.co/SharpWoofer/distilroberta-sms-spam-detector)
|
| 43 |
|
| 44 |
## Model Description
|
| 45 |
|
|
|
|
| 62 |
import scipy.special
|
| 63 |
|
| 64 |
# Load the tokenizer
|
| 65 |
+
tokenizer = AutoTokenizer.from_pretrained("SharpWoofer/distilroberta-sms-spam-detector-onnx-quantized")
|
| 66 |
# Load the ONNX session
|
| 67 |
session = ort.InferenceSession("path/to/your/model.quant.onnx")
|
| 68 |
|