metadata
library_name: peft
license: mit
base_model: Qwen/QwQ-32B-Preview
tags:
- lora
- unsloth
- generated_from_trainer
- text-generation-inference
model-index:
- name: QwQ-32b-Preview-bnb-4bit-wTags
results: []
QwQ-32B-Preview LoRA for separating thinking/answer parts
This LoRA file was fine-tuned to make QwQ constantly separate its private thoughts from the final answer using <THINKING>...</THINKING><ANSWER>...</ANSWER>
tags.
A Q4_K_M GGUF version (which can be used as an adapter for Ollama) is available on shakedzy/QwQ-32B-Preview-with-Tags-LoRA-GGUF.