AMD RyzenAI Models mohitsha/timm-resnet18-onnx-quantized-ryzen Updated Mar 21, 2024 mohitsha/transformers-resnet18-onnx-quantized-ryzen Image Classification • Updated Mar 21, 2024 • 24 mohitsha/Llama-2-7b-hf-quantized-brevitas Updated Mar 27, 2024 mohitsha/opt-125m-quantized-brevitas Text Generation • Updated Mar 27, 2024 • 13
FP8 KV Cache Models with FP8 KV Cache Scales mohitsha/Llama-2-70b-chat-hf-FP8-KV Text Generation • Updated Jun 25, 2024 • 19 mohitsha/Llama-2-7b-chat-hf-FP8-KV Text Generation • Updated Jun 25, 2024 • 26 mohitsha/Llama-2-7b-chat-hf-FP8-KV-AMMO Text Generation • Updated Jun 25, 2024 • 24 mohitsha/Llama-2-70b-chat-hf-FP8-KV-AMMO Text Generation • Updated Jun 25, 2024 • 22