shakedzy's picture
Update README.md
d996c51 verified
metadata
library_name: peft
license: mit
base_model: Qwen/QwQ-32B-Preview
tags:
  - lora
  - unsloth
  - generated_from_trainer
  - text-generation-inference
model-index:
  - name: QwQ-32b-Preview-bnb-4bit-wTags
    results: []

QwQ-32B-Preview LoRA for separating thinking/answer parts

This LoRA file was fine-tuned to make QwQ constantly separate its private thoughts from the final answer using <THINKING>...</THINKING><ANSWER>...</ANSWER> tags.

A Q4_K_M GGUF version (which can be used as an adapter for Ollama) is available on shakedzy/QwQ-32B-Preview-with-Tags-LoRA-GGUF.