bartowski
/

SmallThinker-3B-Preview-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (1)

Q8_0 is as fast as Q2_* in ollama

#1 opened 7 days ago by