Whisper Engines Collection Compiled engines for running Whisper with TRT LLM for much faster inference. • 219 items • Updated 26 days ago
baseten/btest-llama3.1-70b-instruct-NVIDIA-H100-80GB-HBM3-0.15.0-TP1-fp8-checkpoint Updated 27 days ago • 8