Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
bartowski
/
SmallThinker-3B-Preview-GGUF
like
18
Text Generation
GGUF
PowerInfer/QWQ-LONGCOT-500K
Inference Endpoints
imatrix
conversational
Model card
Files
Files and versions
Community
2
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
Q8_0 is as fast as Q2_* in ollama
2
#1 opened 7 days ago by
aguspiza