InternViT-6B + QLLaMA, can be used for image-text retrieval like CLIP
2
#5 opened about 16 hours ago
by
vitvit
Fix incorrect image embedding when running with a single GPU and 24GB VRAM
1
#3 opened 7 months ago
by
xdedss