Apertus-8B-Instruct-2509-LOGEQ-FP8_dynamic

This model applies Logarithmic Equalization (LogEQ) followed by
full FP8 dynamic quantization using the LLM Compressor framework.

LogEQ smooths and rescales activations using logarithmic statistics,
reducing activation outliers before quantization.
Both weights and activations are quantized to FP8 dynamic format
(E5M2 or E4M3 depending on layer type).
The quantized model is compatible with vLLM FP8 execution. data-free activation tracing.

Quantization Details

Safetensors

Model size

8B params

Tensor type

BF16

F8_E4M3

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Quantized

(33)

this model